Gene Cag_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0140 
Symbol 
ID3747186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp154291 
End bp155871 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content44% 
IMG OID637772667 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_378461 
Protein GI78188123 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACAA CAGTCAGGCC TGATGAGGTT TCATCCATAC TTCGCAAACA GCTTGCCAAT 
TTTGAGTCAG AAGCTGACGT ATATGATGTT GGAACAGTGC TCCAGGTTGG TGACGGTATC
GCCCGTGTGT ATGGGTTGAC AAAAGTCGCA GCCGGTGAGC TTCTTGAATT TCCAAACAAT
GTAATGGGCA TGGCGCTTAA CCTCGAAGAG GATAACGTTG GTGCAGTGTT GTTTGGTGAA
TCCACCATGG TGAAGGAGGG TGATACTGTA AAGCGTTCAG GTATTTTGGC TTCTATTCCG
GTTGGTGAAG CTATGTTAGG TCGCGTTATC AATCCACTTG GTGAGCCAAT TGATGGTAAA
GGGCCTATTG ATGCTAAACT TCGTTTACCA CTTGAGCGTC GTGCTCCTGG TGTTATTTAT
CGTAAATCAG TACATGAGCC ACTGCAAACA GGCTTAAAAG CTATTGATGC TATGATTCCT
GTTGGTCGTG GTCAGCGTGA GTTGATTATT GGTGACCGTC AAACAGGTAA AACCGCTGTA
GCGCTTGATA CCATTATCAA CCAGAAAGGT AAAGGCGTTT TTTGTATTTA CGTTGCTATC
GGTTTAAAAG GTTCAACGAT TGCGCAGGTT GTAAGTACGC TTGAAAAATA TGATGCGCTT
TCTTACACCA CTGTTATTGC TGCTACAGCT TCCGATCCTG CTCCACTTCA GTTTATTGCT
CCATTTGCAG GCGCTACGCT TGGTGAGTAT TTCCGCGATA CTGGTCGCCA TGCGCTTGTT
ATATATGATG ATCTTTCAAA GCAGGCTGTT TCTTATCGTC AGGTTTCGCT CTTGCTTCGT
CGTCCACCAG GACGTGAAGC TTACCCTGGT GATGTGTTCT ACTTACACTC TCGTTTGCTT
GAGCGTGCTG CAAAAATTAC CGATGATGTT GAAGTCGCTA AAAAAATGAA CGACCTTCCT
GATGCCTTAA AGCCATTGGT GAAGGGTGGA GGTAGCTTAA CGGCATTGCC TATTATTGAA
ACACAGGCAG GTGACGTGTC GGCATACATT CCAACAAACG TTATTTCTAT TACTGACGGT
CAAATCTTCC TTGAGTCAAA CCTCTTTAAC TCAGGTCAGC GTCCTGCTAT TAACGTTGGT
ATTTCGGTAT CGCGTGTAGG TGGTGCAGCG CAAATTAAAG CAATGAAGAA AATTGCTGGT
ACGCTTCGCC TTGATTTGGC TCAGTTCCGC GAACTTGAAG CCTTCTCTAA ATTTGGTTCT
GACCTTGATA AAACAACCAA AGCGCAGCTT GATCGTGGCG CTCGCCTTGT TGAAATTTTA
AAGCAAGGGC AGTATGTGCC AATGCCCGTT GAAAAACAGG TGGCAATTAT TTTTGTAGGT
ACGCAAGGAT TGCTTGATTC CGTTGACTTG AAATTTATCC GCAAGTGTGA GGAAGAGTTC
CTTGCAATGC TTGAAATGAA GCATGCAGAT ATTCTTAGTG GAATTGCCGA GAAAGGGACG
CTTGAAGCTG ATGTAGCAAG CAAGTTGAAA GATATTGCAA CCAAGTTTAT TGCTACATTT
AAAGAGAAAA ACAAAGCCTA A
 
Protein sequence
MSTTVRPDEV SSILRKQLAN FESEADVYDV GTVLQVGDGI ARVYGLTKVA AGELLEFPNN 
VMGMALNLEE DNVGAVLFGE STMVKEGDTV KRSGILASIP VGEAMLGRVI NPLGEPIDGK
GPIDAKLRLP LERRAPGVIY RKSVHEPLQT GLKAIDAMIP VGRGQRELII GDRQTGKTAV
ALDTIINQKG KGVFCIYVAI GLKGSTIAQV VSTLEKYDAL SYTTVIAATA SDPAPLQFIA
PFAGATLGEY FRDTGRHALV IYDDLSKQAV SYRQVSLLLR RPPGREAYPG DVFYLHSRLL
ERAAKITDDV EVAKKMNDLP DALKPLVKGG GSLTALPIIE TQAGDVSAYI PTNVISITDG
QIFLESNLFN SGQRPAINVG ISVSRVGGAA QIKAMKKIAG TLRLDLAQFR ELEAFSKFGS
DLDKTTKAQL DRGARLVEIL KQGQYVPMPV EKQVAIIFVG TQGLLDSVDL KFIRKCEEEF
LAMLEMKHAD ILSGIAEKGT LEADVASKLK DIATKFIATF KEKNKA