Gene Cphamn1_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0297 
Symbol 
ID6373952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp288943 
End bp290523 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content50% 
IMG OID642682811 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001958747 
Protein GI189499277 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.204325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAA CAGTCAGGCC TGATGAGGTT TCGGCAATAC TCCGCAAACA CCTGGCAGGG 
TTTGAGTCCG AAGCGGATGT CTATGATGTT GGAACCGTAC TCCAGGTAGG TGATGGCATT
GCAAGGATTT ATGGTTTATC AAAAGTTGCC GCTGGTGAAC TTCTCGAGTT TCCGGATAAT
GTCATGGGTA TGGCATTGAA CCTTGAGGAA GATAATGTCG GGGCTGTTAT GTTTGGGAAA
TCGACTGCCG TTAAGGAGGG CGATACTGTC AAGAGGACAG GTATTCTCGC GTCAATTCCT
GTCGGTGAGG CCATGCTTGG AAGGGTTATC AATCCGCTTG GCGAGCCTAT CGACGGCAAG
GGGCCGATAG AAACCAGTAT CAGGCTGCCT CTCGAACGCA AGGCTCCGGG TGTTATTTTC
CGTAAGTCGG TTAATCAGCC GCTTCAGACA GGTCTGAAGG CGATCGACGC GATGATTCCG
ATCGGTCGTG GTCAGCGAGA GTTGATTATC GGCGATCGTC AGACAGGAAA AACAGCTGTT
GCGATCGATA CCATCATCAA TCAGAAAGGA AAGGATGTAT TCTGCATCTA TGTCGCTATA
GGTCAGAAGG GATCGACAAT CGCCCAGGTT GTCAGTACCC TTGAAAAATA CGGTGCCATG
GAATATACGA CGGTGATTGC TTCATCCGCT TCTGATCCTG CGCCGATGCA GTTTATCGCT
CCCTATGCCG GTGCTGCAAT AGGCGAGTTT TTCCGTGACA CCGGACGTCA TGCGCTGGTG
ATTTACGATG ATCTTTCAAA ACAGGCGGTT GCCTATCGTC AGCTTTCTCT TCTTCTTCGT
CGTCCGCCGG GACGTGAAGC CTATCCTGGC GATGTTTTTT ATCTCCATTC GCGTCTGCTT
GAACGTGCGG CAAAGATTAC CGACGATCTG GAAACGGCCA AAAAGATGAA TGATCTTCCG
GAACCGTTGA AACCGATGGT CAAGGCGGGT GGAAGTCTTA CGGCGCTGCC GGTTATCGAG
ACCCAGGCAG GTGACGTGTC CGCTTACATC CCGACCAATG TTATTTCGAT TACCGACGGT
CAGATATTTC TTGAACCGAA TCTCTTCAAT GCCGGTCAGA GACCTGCTAT CAATGTCGGT
ATCTCGGTTT CCCGTGTGGG TGGTAGCGCG CAGATCAAGG CGATGAAAAA GATTACAGGT
ACGCTGCGTC TGGACCTTGC CCAGTTCCGT GAGCTCGAGG CGTTCTCGAA ATTCGGCTCG
GACCTTGACA AGGCGACAAA AGCCCAGCTC GACAGGGGAG CCCGTCTGGT AGAGATTCTG
AAACAGGACC AGTATGTGCC TATGGCGGTT GAAAAACAGG TCGCGATTAT CTTTGCAGGT
ACCCAGGGTG TTCTCGATCA GCTTGATTTG CAGTATATCC GCAGGTTTGA AGAGGAGTTT
CTCAGTCTTC TTGAGCACAA GCACAGTGAT ATTCTCAACA GTATTGCCGA AACAGGTCAA
ATGGATGTTG ATGTGGCAAA AAGGTTGAAA GAGGTGGCTG AGCAGTTTAT GAGCACGTTC
AAGCAGAAAG TAACAGCGTA G
 
Protein sequence
MSTTVRPDEV SAILRKHLAG FESEADVYDV GTVLQVGDGI ARIYGLSKVA AGELLEFPDN 
VMGMALNLEE DNVGAVMFGK STAVKEGDTV KRTGILASIP VGEAMLGRVI NPLGEPIDGK
GPIETSIRLP LERKAPGVIF RKSVNQPLQT GLKAIDAMIP IGRGQRELII GDRQTGKTAV
AIDTIINQKG KDVFCIYVAI GQKGSTIAQV VSTLEKYGAM EYTTVIASSA SDPAPMQFIA
PYAGAAIGEF FRDTGRHALV IYDDLSKQAV AYRQLSLLLR RPPGREAYPG DVFYLHSRLL
ERAAKITDDL ETAKKMNDLP EPLKPMVKAG GSLTALPVIE TQAGDVSAYI PTNVISITDG
QIFLEPNLFN AGQRPAINVG ISVSRVGGSA QIKAMKKITG TLRLDLAQFR ELEAFSKFGS
DLDKATKAQL DRGARLVEIL KQDQYVPMAV EKQVAIIFAG TQGVLDQLDL QYIRRFEEEF
LSLLEHKHSD ILNSIAETGQ MDVDVAKRLK EVAEQFMSTF KQKVTA