Gene Tmz1t_2568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2568 
Symbol 
ID7874007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2772290 
End bp2773558 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content66% 
IMG OID643699490 
Producthypothetical protein 
Protein accessionYP_002889547 
Protein GI237653233 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0567655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCCGCA TCATCGATCG GCGCTTCGAC AGCAAGAACA AGAGCGCGGT GAACCGCCAG 
CGCTTCATGC GCCGCTTCAA GCAGCAGATC CGCAAGGCGG TATCCGAGGC GATCCACGGC
CGCTCCATCC GCGACCTGGA GAACGGCGAG CAGATCTCCA TTCCTGCGCG CGATCTCTCC
GAGCCCTCTC TGCACCACGG CAAGGGCGGC ATCTGGGAGC AGGTCTTCCC CGGCAACGAC
CAGTTCAGCA CCGGCGACCG CATCAAGCGC CCACTCGGTG GCGCTGGCGA CGGTGCCGGC
AAGGGCAAGG CGAGCCAGGA CGGCGAGCAC GAGGACGACT TCGTCTTCCA GCTCTCGCGC
GAGGAGTTCC TCGACCTCTT CTTCGAGGAC CTCGAGCTCC CCCGCCTGAT CCGCACCCAG
CTAGCCAAGG TCACCGACTA CAAGACGCGG CGCGCCGGCT TCACGTCCGA TGGCGTGCCC
GCGAACATCA ACATCGTGCG CTCGATGCGC GGAGCGCTCG GGCGCAGGCT CGCGCTCGGC
TCGCCCTGGG CTGCACGCAT CCGCGCGCTG CAGCAGGAAC TCGACGAAGC GCTCGCGCGC
GCCGGCGAAG ACAGCGAGGA AGTGCGCGAC CTGCGCGAGG AACTCGCCGC GCTGCGCGCG
CGCATCGAAC GCATCCCCTT CATCGACCGC TTCGACCTGC GCTACAACAA CCGCGTCAAG
GAGCCCCGCC CCACCACCCA GGCGGTGATG TTCTGCGTGA TGGACGTCTC CGGCTCGATG
GACGAGGAGC GCAAGTCCAT GGCCAAGCGG TTCTTCATGC TGCTCTACCT GTTCCTCACG
CGCAGCTACG AGCACATCGA GGTCGTCTTC ATCCGTCACC ACACCGTGGC CAAGGAAGTC
GACGAGGACG AATTCTTCCA CTCGCGCGAA TCCGGCGGCA CCGTGGTCTC CAGCGCGCTC
GAGCTGATGC GCAACATCCT GCGCGAGCGT TACGCCAACG GGCAGTGGAA CGTCTACGGC
GCCCAGGCGT CCGACGGCGA CAACTGGGAC AACGACTCGC CGGTGTGCGG ACGCCTGCTC
GGCAAGGAGA TCCTGCCCTG GTGCCAGTAC TTCGCCTACG TCGAGATCAC CGCCGGCGAG
CCGCAGAACC TGTGGCGCGA GTACGCCAAG CTCGAGGCCG CGCACGACAA CTTCGCGATG
CAGCGCATCG AGTCGCCGGC CGACATCTAC CCGGTGTTCC GCGAACTGTT CAAGAAGACG
ATCGCATGA
 
Protein sequence
MVRIIDRRFD SKNKSAVNRQ RFMRRFKQQI RKAVSEAIHG RSIRDLENGE QISIPARDLS 
EPSLHHGKGG IWEQVFPGND QFSTGDRIKR PLGGAGDGAG KGKASQDGEH EDDFVFQLSR
EEFLDLFFED LELPRLIRTQ LAKVTDYKTR RAGFTSDGVP ANINIVRSMR GALGRRLALG
SPWAARIRAL QQELDEALAR AGEDSEEVRD LREELAALRA RIERIPFIDR FDLRYNNRVK
EPRPTTQAVM FCVMDVSGSM DEERKSMAKR FFMLLYLFLT RSYEHIEVVF IRHHTVAKEV
DEDEFFHSRE SGGTVVSSAL ELMRNILRER YANGQWNVYG AQASDGDNWD NDSPVCGRLL
GKEILPWCQY FAYVEITAGE PQNLWREYAK LEAAHDNFAM QRIESPADIY PVFRELFKKT
IA