Gene Tmz1t_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0213 
Symbol 
ID7084334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp246664 
End bp248208 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content64% 
IMG OID643697255 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_002353904 
Protein GI217968670 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATGC AACTCAACCC CTCTGAAATC AGCGATCTGA TCAAGAGCCG GATCCAGAAC 
CTGCAGCTCG CCGCCACGTC GCGCAACGAG GGTACGGTGG TCTCCGTCAC CGACGGCATC
ACCCGCGTTC ATGGCCTGAC CGACGTCATG CAGGGCGAAA TGCTGGAGTT CCCCGGCAAC
ACCTTCGGCA TGGCGCTCAA CCTCGAGCGC GACTCCGTCG GCGCCGTGGT GCTCGGCGAA
TACGAGCACA TCACCGAAGG CGACACCGTC AAGGCCACCG GCCGCATTCT CGAAGTTCCG
GTCGGCCCCG AGCTGATCGG CCGCGTCGTG AACGCGCTCG GCCAGCCGAT CGACGGCAAG
GGCCCGATCA ACGCCAAGCT TACCGACAAG ATCGAAAAGG TCGCGCCGGG CGTCATCGCC
CGTCAGTCCG TTTCGCAGCC GGTGCAGACC GGTCTGAAGT CGGTTGACTC GATGGTGCCG
ATCGGTCGTG GCCAGCGCGA GCTGATCATC GGCGACCGCC AGACCGGCAA GACCGCCGTC
GCGGTCGACG CGATCATCAA CCAGAAGGGC CAGGACATGT ACTGCGTGTA CGTGGCCATC
GGCCAGAAGG CCTCGACTGT CGCGAACGTC GTGCGCAAGC TCGAAGAGAA CGGCGCGATG
GAATACACCA TCGTCGTCGC CGCCACCGCG TCGGAGTCGG CCGCCATGCA GTATCTGGCT
GCCTACGCCG GCTGCACCAT GGGCGAGTAC TTCCGCGACC GCGGCATGGA CGCGCTGATC
GTCTACGACG ACCTCACCAA GCAGGCCTGG GCGTATCGCC AAGTGTCGCT GCTGCTGCGC
CGTCCGCCTG GCCGTGAAGC CTACCCGGGC GATGTGTTCT ACCTGCACTC CCGTCTGCTC
GAGCGTGCCG CACGCGTCAA CGCCGACTAC GTCGAGAAGT TCACCAACGG CGAGGTCAAG
GGCAAGACCG GTTCGCTGAC TGCGCTGCCG GTCATCGAGA CCCAGGCCGG CGACGTGTCC
GCGTTCGTTC CGACCAACGT GATCTCGATT ACCGACGGCC AGATCTTCCT CGAGACCGAC
CTCTTCAACG CCGGTATCCG TCCCGCGATC AACGCCGGTA TCTCGGTGTC CCGCGTCGGT
GGTGCAGCCC AGACCAAGGT CGTCAAGAAG CTCTCCGGCG GTATCCGTAC CGACCTCGCA
CAGTATCGCG AACTCGCTGC GTTCGCCCAG TTCGCCTCCG ACCTGGACGA TGCCACGCGG
AAGCAGCTGG AGCGCGGCCG CCGCGTCACC GAGCTGATGA AGCAGGCCCA GTACTCGCCG
ATGTCGATCG CCGACATGGC CATCGTTCTG TACGCGGTGA ACAACGGCTA CTTCGACGAT
GTCGACGTGG GTCGCGTGCT GGCTTTCGAG TCCGCGATGA TCCAGTTCGT CAAGACCAAG
CAGGCCGCCC TGGTCTCCTC GATCCTGAGC AAGAAGGAAC TCGACGCCGA AGGCGAAAAG
TCCCTGGCTG CCGCGATCGC CGAGTTCAAG AAGAGCTGGG CTTAA
 
Protein sequence
MSMQLNPSEI SDLIKSRIQN LQLAATSRNE GTVVSVTDGI TRVHGLTDVM QGEMLEFPGN 
TFGMALNLER DSVGAVVLGE YEHITEGDTV KATGRILEVP VGPELIGRVV NALGQPIDGK
GPINAKLTDK IEKVAPGVIA RQSVSQPVQT GLKSVDSMVP IGRGQRELII GDRQTGKTAV
AVDAIINQKG QDMYCVYVAI GQKASTVANV VRKLEENGAM EYTIVVAATA SESAAMQYLA
AYAGCTMGEY FRDRGMDALI VYDDLTKQAW AYRQVSLLLR RPPGREAYPG DVFYLHSRLL
ERAARVNADY VEKFTNGEVK GKTGSLTALP VIETQAGDVS AFVPTNVISI TDGQIFLETD
LFNAGIRPAI NAGISVSRVG GAAQTKVVKK LSGGIRTDLA QYRELAAFAQ FASDLDDATR
KQLERGRRVT ELMKQAQYSP MSIADMAIVL YAVNNGYFDD VDVGRVLAFE SAMIQFVKTK
QAALVSSILS KKELDAEGEK SLAAAIAEFK KSWA