Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1301 |
Symbol | |
ID | 3747449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1766885 |
End bp | 1769713 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637773838 |
Product | excinuclease ABC subunit A |
Protein accession | YP_379604 |
Protein GI | 78189266 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0773804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCTC ACGGGCAACT GACCGACACC TCCTTACCGG ATATTGTGCT GAAAGGCATT AACACCCACA ACCTCCGCAA CATTTCCGTT CGCATTCCTC GCAATAAATT TATTGTTATA ACGGGCGTTA GTGGCTCAGG CAAATCAAGC CTTGCTTTCG ACACCCTTTA CGCCGAAGGG CATCGCCGTT ATGTGGAATC GCTCTCGGCG TATGTTCGCC AATTTCTTGA GCGAATGCCT CGCCCCGATA TTGAGCACGT TGAAGGCATT GCGCCTGCCA TTGCTATTGA GCAAAAAGCA CTCCCTAAAA ATCCCCGCTC AACCGTTGGC ACCGTGTCGG AAATTTATGA CTACCTCCGC TTGCTCTATG CCCGCATTGG TAAAATTTAC TCGCGCGACA CCAACGAGTT AGTGCTCAAG CACACACCCG ATGACGTCAG CTTGCAAGCA GGTTTTATTG AGGATGGCAA AAAATTTTAT GTGGGATTTT TTTTTCCTCA CCATCATACC GCTCAACAGC TCGACTGCTC GCCCGAAGAG GAAATTGCAA ATCTCCTGAA AAAAGGCTTT TTCCGCTTGC TTGCAGGCGA TGAGCTGCTT GACCTAAACC AAGAAGCTGA CTACCAAAAA GTGCTCGACA TGCCCGCTAA GGTTCGCGCT GAACTCTTAG TGGTGGTTGA CCGCTTTGTT GCCCGCAATA ACGACAAACT CTTTAGCCGC ATTTCGCAAG CTGCCGAAAG CAGTTTTATG GAATCGGGCG GACACGCAGT GCTAAAAGTA GTTGACGGCA AAACCTACCG CTTTAGCGAT CGCCTTGAGC TGCACGATAT TGAGTATCAA GAGCCTTCGC CCCAACTCTT TGCCTTTAAC TCCCCCATTG GCGCTTGCAC CACCTGCCAA GGTTTTGGGA GAATTATGGG AATTGATGAA GATGCCGTTA TTCCCGATAA ATCACTTTCC ATTGAAGAGG GAGCAATTGC TTGCTGGAAT TCTGAAAAAT ATCGCTGGAA TTTATTGGAG CTGATGCACT ATGCGCCGAA GTTTGGTGTT CCACTACGAG AGCCTTACGA AAAGCTCACC TTTGAACAAA AAGAGATTAT TTGGAAAGGA ACTCCTGACG GAAGCTTTAA TGGCATTCGC GCTTTTTTTG CGGAAATAGA AAAAGATGCC GGTTACAAAA TGCACTACCG CGTTTTTTTA AGCCGCTACC GAGGCTACGC CATCTGCCCC GATTGCGAAG GAAGCCGCTT AAACCCCGAT GCTCTTCAGG TAAAAATTTC AGGACGCCAC ATTGGCGAAG TAACTCGCAT GAGCATTGGC GAAGTGGCTG AATTTTTCCG CAACCTCAAC ATCTCCCCCT TTGACCGCTC GGTAGCTGAA GTGATTTTGC AAGAAATTAA TCGCCGACTT GGCTACTTGC TCGACGTAGG ACTTGATTAC CTCACGCTTG ACCGCTTAAC CCACACCCTA AGCGGCGGCG AATTCCAACG CATCAACCTC TCCACCTCGC TTGGCTCACC GCTTGTAGGC ACCATGTACA TTCTTGACGA ACCAAGCATT GGGCTACACC AAAGCGACTC CGCACGCTTG ATTGCGCTGC TCCGCAAATT ACGCGACCTT GGCAACACCG TTGTGGTAGT TGAGCACGAC CGCGAAATTA TTGAAGCCGC CGATGAGGTG ATTGATCTTG GACCATTTGC TGGACGGCTT GGTGGCGAAG TAGTATTTCA AGGCAGCATG GAAGCCATGC GCTCATCGGG CACCTCGCTC ACTGCACAAT ACATGAATGG CGAACAACAA ATTGAGGTAC CCCAACAGCG CCGCACGGTT GATTTCTCCG CCTGCATTAC CATCAGCGGT GCCATGCAAA ACAACCTCAA AAACATTGAT GTTCAAATTC CGCTTAAAGT AATGACCTGC ATAACCGGCG TTAGTGGTTC AGGCAAATCA ACCCTCATTA ACGATATTCT TTGCAAAGGC ATTCTCCGCG AAAAACATGG AAGCCGTGGC ACCGTAGGCA CCCACCGCTC GCTAACAGGC GCATGGCTCA TTGACCGCAT TGAGCACGTT GATCAATCGC CCATTGGCAA GTCAAGCCGT AGCAATCCGG TTACCTACAT GAAAATTTTC GACGACATCC GCACCCTTTT TGCCAACACG CCCGATGCTC GCAAGAAAAA AGTAAAAGCA GGCTACTTCT CTTTTAACAT TCCCGGCGGT AGGTGCGAAG TGTGCTCGGG CGAAGGCAGC GTGCATATTG AAATGCAATT TCTTGCCGAC ATTGAAGCCG TATGCGAAGC CTGCAACGGA CTTCGCTACC AACCCGAAGC GCTTGCCATT AAGTTCAACG GTAAATCCAT TGCCGAAGTG CTCGACATGA CGGTAAGCGA AGCACTGAGC TTTTTTAAAG GCGAAAAAAA CATTGTAAAA AAACTCAGCG TTCTCGATCA AGTAGGACTT GGCTACATAC GTCTTGGGCA ATCCTCCAGC ACCTTCTCAG GCGGCGAAGC ACAACGCTTG AAGCTTGCCA CCTTTATTGC CCACGCCGAC ACCACTCACA CGCTTTTCGT GTTTGATGAA CCAACCACAG GACTACATTT TGAGGATATT AAAAAGCTCA TCCTTTGCTT TGAAAAGCTC CTTGAGCAAA ACAACAGCCT TATTATTATT GAGCACAATC TCGATATTAT TAAGCAAGCT GATTGGGTAA TTGATTTAGG ACCAGGCGCA GGCGATAAAG GTGGGCACTT GGTAGAACAA GGCACACCCG AAGAGGTTGC TCAATGCACT GAATCACTGA CGGGGCAATA TTTGCGAGGG GTGGTATAA
|
Protein sequence | MNAHGQLTDT SLPDIVLKGI NTHNLRNISV RIPRNKFIVI TGVSGSGKSS LAFDTLYAEG HRRYVESLSA YVRQFLERMP RPDIEHVEGI APAIAIEQKA LPKNPRSTVG TVSEIYDYLR LLYARIGKIY SRDTNELVLK HTPDDVSLQA GFIEDGKKFY VGFFFPHHHT AQQLDCSPEE EIANLLKKGF FRLLAGDELL DLNQEADYQK VLDMPAKVRA ELLVVVDRFV ARNNDKLFSR ISQAAESSFM ESGGHAVLKV VDGKTYRFSD RLELHDIEYQ EPSPQLFAFN SPIGACTTCQ GFGRIMGIDE DAVIPDKSLS IEEGAIACWN SEKYRWNLLE LMHYAPKFGV PLREPYEKLT FEQKEIIWKG TPDGSFNGIR AFFAEIEKDA GYKMHYRVFL SRYRGYAICP DCEGSRLNPD ALQVKISGRH IGEVTRMSIG EVAEFFRNLN ISPFDRSVAE VILQEINRRL GYLLDVGLDY LTLDRLTHTL SGGEFQRINL STSLGSPLVG TMYILDEPSI GLHQSDSARL IALLRKLRDL GNTVVVVEHD REIIEAADEV IDLGPFAGRL GGEVVFQGSM EAMRSSGTSL TAQYMNGEQQ IEVPQQRRTV DFSACITISG AMQNNLKNID VQIPLKVMTC ITGVSGSGKS TLINDILCKG ILREKHGSRG TVGTHRSLTG AWLIDRIEHV DQSPIGKSSR SNPVTYMKIF DDIRTLFANT PDARKKKVKA GYFSFNIPGG RCEVCSGEGS VHIEMQFLAD IEAVCEACNG LRYQPEALAI KFNGKSIAEV LDMTVSEALS FFKGEKNIVK KLSVLDQVGL GYIRLGQSSS TFSGGEAQRL KLATFIAHAD TTHTLFVFDE PTTGLHFEDI KKLILCFEKL LEQNNSLIII EHNLDIIKQA DWVIDLGPGA GDKGGHLVEQ GTPEEVAQCT ESLTGQYLRG VV
|
| |