Gene Cpha266_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2544 
Symbol 
ID4569734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2921353 
End bp2922933 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content52% 
IMG OID639767109 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_912956 
Protein GI119358312 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.430651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACAA CAGTCAGGCC TGATGAGGTT TCATCCATAC TTCGCAAGCA GCTTGCCGGT 
TTTGAGTCCG AGGCTGATGT CTACGATGTG GGAACAGTTT TGCAGGTTGG TGACGGTATC
GCCCGTGTGT ATGGATTATC CAAGGCGGCT GCCGGTGAAC TTCTTGAGTT TCCCAACAAG
GTGATGGGTA TGGCGCTTAA CCTTGAAGAA GACAATGTCG GCGCGGTATT GTTCGGAGAA
TCCAATCTGG TTAAAGAGGG CGATACGGTT AAACGAACCG GTATTCTTGC CTCTATCCCG
GTTGGCGAGG CTATGCTTGG AAGGGTCATC AATCCTCTGG GAGAGCCGAT TGACGGAAAA
GGGCCGATCG AAACACAGAT CAGGCTTCCG CTCGAGCGCA GGGCACCTGG CGTTATCTAC
CGTAAATCAG TGCATGAACC GCTTCAGACC GGACTCAAGG CGATCGACTC GATGATTCCT
ATCGGCCGCG GACAGCGCGA GCTTATTATT GGCGATCGTC AGACCGGAAA GACGGCGGTT
GCAATCGATA CCATTATCAA CCAGAAGGGC AAGGGCGTTT TCTGTATCTA TGTCGCTATC
GGTCTGAAAG GGTCAACGGT GGCTCAGGTG GTCAATACGC TTGAGAAATT CGGCGCCATG
GAGTATACCA CGGTTATTAC GGCTACGGCT TCCGATCCGG CGCCTCTTCA GTTTATCGCT
CCGTTCGCAG GCGCTACTCT TGGCGAGTAT TTCCGCGATA CCGGCCGCCA CGCGCTGGTT
GTTTATGACG ATCTTTCAAA GCAGGCTGTT GCCTATCGCC AGCTCTCCCT GCTCCTTCGC
CGTCCGCCGG GACGCGAAGC ATATCCCGGC GATGTATTCT ATTTACATTC CCGTCTTCTC
GAAAGGGCTG CAAAGATTAC TGATGATATT GAGGTTGCAA GGAAGATGAA CGATCTTCCC
GATGCGCTCA AATCGATGGT TAAAGGCGGC GGCAGTCTTA CTGCGCTTCC TGTTATTGAA
ACCCAGGCGG GTGACGTTTC CGCGTATATT CCGACCAACG TTATTTCGAT TACCGATGGT
CAGATTTTTC TTGAGTCGAA CCTTTTCAAT TCGGGTCAGC GGCCTGCAAT CAACGTTGGT
ATTTCGGTAT CCCGCGTTGG TGGTTCTGCA CAGATCAAGG CAATGAAAAA GGTTGCCGGT
ACCCTGCGAC TCGATCTTGC GCAGTTCCGT GAACTTGAGG CTTTCTCGAA ATTCGGTTCG
GATCTTGATA AAACCACGAA AGCGCAGCTT GACAGAGGCG CAAGACTGGT CGAGATTCTC
AAGCAGGGCC AGTATATTCC AATGGCTGTT GAAAAACAGG TTGCCATCAT CTTCCTTGGT
ACTCAGGGTT TGCTTGATGC TGTCGATGTA ACGCGTATTC GCAAGTTTGA AGAGGAGTTC
CTTGGGTTGC TTGAGCACAA GCATCCGGAA GTGCTCAAGG CGATTGCCGA AACGGGCACT
CTTGAAACCG ATACCGCTAA CAAGATCAAG GAGGCAGCGC AGAAGTTTAT CGCTTCCTTC
AACCAGAAAG CAAAGGCGTA A
 
Protein sequence
MSTTVRPDEV SSILRKQLAG FESEADVYDV GTVLQVGDGI ARVYGLSKAA AGELLEFPNK 
VMGMALNLEE DNVGAVLFGE SNLVKEGDTV KRTGILASIP VGEAMLGRVI NPLGEPIDGK
GPIETQIRLP LERRAPGVIY RKSVHEPLQT GLKAIDSMIP IGRGQRELII GDRQTGKTAV
AIDTIINQKG KGVFCIYVAI GLKGSTVAQV VNTLEKFGAM EYTTVITATA SDPAPLQFIA
PFAGATLGEY FRDTGRHALV VYDDLSKQAV AYRQLSLLLR RPPGREAYPG DVFYLHSRLL
ERAAKITDDI EVARKMNDLP DALKSMVKGG GSLTALPVIE TQAGDVSAYI PTNVISITDG
QIFLESNLFN SGQRPAINVG ISVSRVGGSA QIKAMKKVAG TLRLDLAQFR ELEAFSKFGS
DLDKTTKAQL DRGARLVEIL KQGQYIPMAV EKQVAIIFLG TQGLLDAVDV TRIRKFEEEF
LGLLEHKHPE VLKAIAETGT LETDTANKIK EAAQKFIASF NQKAKA