Gene TM1040_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2088 
Symbol 
ID4077839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2191586 
End bp2193127 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content60% 
IMG OID638007407 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_614082 
Protein GI99081928 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.509731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCC AAGCAGCGGA AATTTCCGCG ATCCTTAAAG ACCAGATCAA GAATTTTGGT 
CAAGAAGCCG AAGTGGCCGA GATTGGTCGC GTGCTCTCCG TCGGTGACGG TATTGCCCGT
GTCTATGGCC TCGACAACGT CCAGGCGGGC GAAATGGTCG AATTCCCCGG CGGCATCATG
GGTATGGCGC TGAACCTGGA ATCCGACAAC GTCGGTGTGG TTATCTTCGG CTCCGACCGC
GACATTAAAG AAGGCGACAC CGTCAAGCGC ACCAACTCCA TCGTGGACGT GCCTGCTGGT
CCCGAGCTTC TGGGTCGCGT TGTCGACGGC CTCGGCAACC CGCTCGACGG CAAAGGCCCG
ATCGAAGCCA AAGAGCGCAA GGTTGCAGAC GTCAAAGCGC CGGGCATCAT CCCGCGTAAA
TCCGTGCACG AGCCGATGGC AACCGGCCTC AAGTCCGTGG ACGCGATGAT CCCCGTTGGC
CGTGGCCAGC GCGAGCTGAT CATTGGTGAC CGTCAGACCG GTAAGACCGC GATCGCTCTG
GACACCATCC TGAACCAGAA GTCCTACAAC GACGCTGCAG GCGACGACGA CTCCAAGAAA
CTCTACTGCG TCTACGTTGC TGTGGGTCAG AAGCGTTCCA CCGTGGCGCA GCTGGTGAAA
AAGCTCGAAG AGTCCGGCGC GATGGAATAC TCCATCGTTG TGGCCGCAAC CGCGTCCGAC
CCGGCACCGA TGCAGTTCCT TGCCCCCTAC GCGGCGACCG CGATGGCCGA ATACTTCCGC
GACAGCGGCA AGCACGCGCT CATCATCTAT GATGACCTCT CCAAGCAGGC TGTGGCCTAT
CGTCAGATGT CCCTGCTGCT GCGCCGCCCG CCGGGCCGTG AAGCTTATCC GGGTGACGTT
TTCTACCTCC ACTCCCGTCT GCTCGAGCGT TCCGCAAAGC TGAACGAAGA CTTCGGTGCA
GGCTCGCTGA CCGCGCTGCC GATCATCGAA ACCCAGGGCG GCGACGTGTC TGCGTTTATT
CCGACCAACG TGATCTCCAT CACCGACGGT CAGATCTTCC TTGAGACCGA ACTGTTCTAC
CAGGGCATCC GCCCCGCCGT GAACACCGGT CTGTCGGTTT CGCGTGTGGG CTCCTCGGCT
CAGACCGATG CGATGTCTTC CGTTGCGGGC CCTGTGAAAC TGTCCCTGGC TCAGTACCGC
GAAATGGCGG CCTTTGCGCA GTTCGGTTCC GACCTCGACG CCGCAACCCA GCAGCTGCTG
GCCCGTGGCG CGCGTCTCAC CGAGCTGATG AAACAGCCGC AGTATTCGCC GCTCACCAAC
TCTGAAATCG TCTGCATCAT CTTCGCGGGC ACCAACGGCT ACCTCGACAA AGTCGACGTC
AAGGAAGTGG GTCGTTATGA AGCCGAGCTT CTGACCTTCC TGCGCTCCAA GAAGGCCGAC
TTCCTGCAGT GGATCACCGA TGAGGATCCC AAGTTCAAGA AGGGTGAACC GGTCGAGAAG
ATGAAGGCTG TTCTGGACGA ATTCGCAGCA GACTTCGCGT AA
 
Protein sequence
MGIQAAEISA ILKDQIKNFG QEAEVAEIGR VLSVGDGIAR VYGLDNVQAG EMVEFPGGIM 
GMALNLESDN VGVVIFGSDR DIKEGDTVKR TNSIVDVPAG PELLGRVVDG LGNPLDGKGP
IEAKERKVAD VKAPGIIPRK SVHEPMATGL KSVDAMIPVG RGQRELIIGD RQTGKTAIAL
DTILNQKSYN DAAGDDDSKK LYCVYVAVGQ KRSTVAQLVK KLEESGAMEY SIVVAATASD
PAPMQFLAPY AATAMAEYFR DSGKHALIIY DDLSKQAVAY RQMSLLLRRP PGREAYPGDV
FYLHSRLLER SAKLNEDFGA GSLTALPIIE TQGGDVSAFI PTNVISITDG QIFLETELFY
QGIRPAVNTG LSVSRVGSSA QTDAMSSVAG PVKLSLAQYR EMAAFAQFGS DLDAATQQLL
ARGARLTELM KQPQYSPLTN SEIVCIIFAG TNGYLDKVDV KEVGRYEAEL LTFLRSKKAD
FLQWITDEDP KFKKGEPVEK MKAVLDEFAA DFA