Gene Cag_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2014 
Symbol 
ID3747987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2554198 
End bp2555586 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content47% 
IMG OID637774551 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_380305 
Protein GI78189967 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAG GCAAGATTTC ACAAATCATC GGGCCCGTCG TTGATGTTGA TTTCCCTGAA 
GGACGGTTGC CATCAATTCT TGATGCGCTT ACTGTTAAAA GAGAAGATGG CTCTAAGTTG
GTGCTTGAAA CCCAACAGCA CCTTGGTGAA GAGCGTGTTC GTACCGTTGC TATGGAAAGC
ACCGATGGTT TAGTAAGAGG CATGGGCGTG GTGAATACCG GCGCTGCTAT TCAGGTGCCT
GTTGGCGCTG AAGTGCTTGG ACGCATGTTA AACGTTGTGG GCGATCCAAT TGATGGACGC
GGTCCCGTTA ACAGCAAAAA AACCTACTCC ATCCATCGTA GTGCTCCAAA GTTTGAAGAC
ATTTCAACCA AAGCTGAAAT GTTTGAAACG GGTATTAAAG TTATTGACTT ACTTGAACCA
TACTCTCGCG GTGGAAAAAC CGGTTTGTTT GGTGGTGCAG GTGTAGGCAA AACCGTGCTC
ATTATGGAGC TGATTAACAA CATTGCAAAG CAGCAGTCGG GCTTTAGCGT GTTTGCGGGC
GTAGGTGAGC GTACTCGCGA AGGTAACGAC CTTTGGCACG AAATGATGGA GTCGGGCGTT
ATTGACAAAA CCGCACTTGT GTTTGGTCAA ATGAACGAAC CTCCCGGTGC TCGTCAGCGT
GTGGCTTTAA CGGGTTTGAG TATTGCAGAA TACTTCCGTG ATGAAGAAAA TCGCGATGTG
TTGCTCTTTG TTGACAACAT TTTCCGCTTT ACGCAGGCAG GTTCAGAGGT ATCGGCACTG
CTTGGACGTA TGCCAAGTGC TGTAGGTTAC CAGCCAACGC TTGCAACCGA AATGGGTCAG
CTTCAAGATA GAATTGTTTC CACCAAAAAA GGTTCGGTTA CCTCAGTACA AGCTATTTAT
GTGCCTGCTG ATGACCTTAC CGACCCTGCT CCTGCAACAG CGTTTACCCA CTTGGATGCA
ACCACAGTGC TTTCACGTTC CATTGCAGAG CTTGGTATTT ATCCTGCGGT AGATCCACTT
GACTCCACTT CCCGTATTCT TGATCCTAAT GTTGTTGGCG ACGACCACTA CAACACCGCA
CAAGCGGTAA AGCAGTTGCT CCAGCGCTAT AAAGATTTGC AAGATATTAT TGCAATTCTT
GGTATGGACG AGTTAAGCGA TGAAGATAAG TTGGTGGTAT CGCGCGCACG TAAAGTACAG
CGCTTCCTTT CACAGCCATT CTTTGTGGCT GAAGCCTTTA CGGGTCTTGC TGGTAAGTAT
GTAAAGCTTG AAGATACTAT CAAAGGCTTT AAAGAAATTA TTGCTGGAAA GCACGATAAA
CTCCCAGAAA ATGCCTTCTA CCTTGTAGGC ACCATTGAAG AGGCTATCGA GAAAGCAAAA
ACTCTCTAA
 
Protein sequence
MQEGKISQII GPVVDVDFPE GRLPSILDAL TVKREDGSKL VLETQQHLGE ERVRTVAMES 
TDGLVRGMGV VNTGAAIQVP VGAEVLGRML NVVGDPIDGR GPVNSKKTYS IHRSAPKFED
ISTKAEMFET GIKVIDLLEP YSRGGKTGLF GGAGVGKTVL IMELINNIAK QQSGFSVFAG
VGERTREGND LWHEMMESGV IDKTALVFGQ MNEPPGARQR VALTGLSIAE YFRDEENRDV
LLFVDNIFRF TQAGSEVSAL LGRMPSAVGY QPTLATEMGQ LQDRIVSTKK GSVTSVQAIY
VPADDLTDPA PATAFTHLDA TTVLSRSIAE LGIYPAVDPL DSTSRILDPN VVGDDHYNTA
QAVKQLLQRY KDLQDIIAIL GMDELSDEDK LVVSRARKVQ RFLSQPFFVA EAFTGLAGKY
VKLEDTIKGF KEIIAGKHDK LPENAFYLVG TIEEAIEKAK TL