Gene Tmz1t_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3897 
Symbol 
ID7873545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4292407 
End bp4294353 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content65% 
IMG OID643700836 
Producthypothetical protein 
Protein accessionYP_002890859 
Protein GI237654545 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACTT CCACAACGAT AACGCGCTCA GCATGGCTTT TCCTCGGACT GCTCGCCATC 
GCCTGCGGAT TGCTGTACAT CCCCGGCCTG ACAGGCGCCC TCTACTACGA CGACATCCGC
CCTCTCTCCG GCCTCGCCAG CGTCGTCAAC CTCGACTCCG CGCTCTACTA CCTCTCCTCC
GAGATCTCCG GTCCACTCGG CCGTCCGGTC GCCATGCTCA GCTTCCTGGT GCACGTCGAG
GACTGGCCCG GAGCCGTCGA GAACATCTTC CTTTTCAACG TACTGCTGCA CCTGACCAAC
GGCACGCTGG TCGCCCTGCT CGTCCATCGG CTGCTCAGTC TGAGGGGGGT CGCAGGCAGT
GCGCCCGCCT GGATTGCAGC CAGCACAGCT GCAATCTGGA TGCTGATGCC GCTGCAGGTC
TCCTCCTCGC TGATCGCGGT GCAGCGCATG GCGACGCTGT CCGCCTTCTT CGTGCTCGCG
GGCCTGTTGA TCCATGTTCA GGGCATCGCG ATCGAGGACC GGCGTCGTGC ACTCGGCGCC
GCTCTGCAGG CGCTCGGCCT CGTGGGGTTC ACCCTGCTCG CGATGTTCAC CAAGGAAAAC
GGCATCCTGC TGCCCGTTTT CGCGCTCGTG ATCCAGGCGA CGCTGCTCGC GGACCACACT
TCCCCGGGCC GCCTTCGACT CCTGCGCACC GTCGCCGGCG GCGGGGCGCT CGCGATCATC
CTGGCCTACC TCGCATACAG CGCATTCCGT TCCGGTGGCG TGTTCGGGGG GCGTGAGTTC
GATCTGCTCG AACGCATCCA GACCCAGCCG CTGATCCTGC TCGAATACCT CCGCGAAGCC
TTCGCACCCC GTCCCTATGG CTTGCACCCC TTCCATGACG GCTACCCCAA GGTCAGCGCG
CTCGCAGAGC ACCCCGTGGC GCTCTTCGCC GCCATCCTGT GGCCGACGCT CGCGGTGCTC
GCGGTCCGCT TCCGCCGCCG CTATCCGGTC GCGGCCTTCG CCGTGCTGTG GTTTCTTGCT
GCCCACCTGC TCGAATCCAC CGTCCTCGGG CTTGAGCTGT ACTTCGAGCA TCGCAACTAC
CTCGCGCTGT TCGGACCCTG CCTTGCAATC GCCTGGGCGG TGGGGCGGAC ACCAGTTCCG
TACCGCCGAC TCGCCGTCGC GGGCTTCGCG GCCTATCTCG CAACGCTCGG GGCGATCCTG
TTCCACGTGA CCAGCCTCTG GGGCGACAAG CTCGACGCAG CGGAGACCTG GTTCGTTCAC
GCCTACAAGT CGCCGCGAGC TGCGGAGCAC CTCGCACTCC TGTACCTCGA GCAGGGCCGC
TTCAACGAGG CCTACCAGGT CATACGGATC CAGGTGGATG ATTGCCCTCA GTGCCTCGCC
TCGGTGACCC AGGCCGCGCT GCTCGCCTGC GCAGCGGGAG AGGCTCAGCG CACCCGAGAC
TACTTCGCTC AAGCGGAAGC GCTTGCCATC GAGGCCCGCA ACGTCAGCGG AGCGGCGACG
ACGCTGACTG CAATGCACAA CGCAATCGAA GACGGAAAAT GCTCCCTGGT CGACTACGAT
CAACTCGAAA CACTCAACCG CAGCCTCCTG CGCCACCAAA CGGGGGGGCT CGGGACACTC
AGCCGCAAGG CGATTCACAT GAACCTGGAA CGGATCGCAC TGGCCAAGGG TGACGCGAAT
TCCGCACTGG ACCACCTGAA ACAGGCCTGG GCGGTAGACC GGGACCGCGC GCTCGGACAT
GCGATCATTG ACGCACTACT CGAACGGCAT GAAATCGAGA ACGCGGAGGT ATTCCACCGC
AAAGTACTTT GCCGCGAGTT TCCCAAGCAC CCGGTGCTCG CCAACGTCGC ACGCAAGCAA
TGCGATGAAT CGATGCAGGC CATACTCGAG GCGGCAAGCA GTCATCCGCA ACGCACGGGT
GACGCAAAAA CCGCCGCAAC ACCATGA
 
Protein sequence
MSTSTTITRS AWLFLGLLAI ACGLLYIPGL TGALYYDDIR PLSGLASVVN LDSALYYLSS 
EISGPLGRPV AMLSFLVHVE DWPGAVENIF LFNVLLHLTN GTLVALLVHR LLSLRGVAGS
APAWIAASTA AIWMLMPLQV SSSLIAVQRM ATLSAFFVLA GLLIHVQGIA IEDRRRALGA
ALQALGLVGF TLLAMFTKEN GILLPVFALV IQATLLADHT SPGRLRLLRT VAGGGALAII
LAYLAYSAFR SGGVFGGREF DLLERIQTQP LILLEYLREA FAPRPYGLHP FHDGYPKVSA
LAEHPVALFA AILWPTLAVL AVRFRRRYPV AAFAVLWFLA AHLLESTVLG LELYFEHRNY
LALFGPCLAI AWAVGRTPVP YRRLAVAGFA AYLATLGAIL FHVTSLWGDK LDAAETWFVH
AYKSPRAAEH LALLYLEQGR FNEAYQVIRI QVDDCPQCLA SVTQAALLAC AAGEAQRTRD
YFAQAEALAI EARNVSGAAT TLTAMHNAIE DGKCSLVDYD QLETLNRSLL RHQTGGLGTL
SRKAIHMNLE RIALAKGDAN SALDHLKQAW AVDRDRALGH AIIDALLERH EIENAEVFHR
KVLCREFPKH PVLANVARKQ CDESMQAILE AASSHPQRTG DAKTAATP