Gene Tmz1t_3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3876 
Symbol 
ID7873527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4273343 
End bp4274773 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content70% 
IMG OID643700818 
Productprotein of unknown function DUF404 
Protein accessionYP_002890841 
Protein GI237654527 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG CCAAGCCTTT CGATGAGATG CACGCCGCCG ACGGCACGAT CCGGACGCAC 
TACCAGCCCT ACAGCGAGTG GCTCGGGCAC ACCCCGCGCG AGCTGATCGC GCGCAAGCGC
AAGGAGGCCG ATCTCGCCTT CCACCGCGTC GGCATCACCT TCAACGTCTA TGGCGCCGAC
GGCGGCAAGG AGCGCCTGAT CCCCTTCGAC CTGCTGCCGC GCATCATCCC CGGCGACGAA
TGGCGCACGC TCGAGGCCGG CCTGCGCCAG CGCGTGCGCG CGCTCAACGC CTTCCTCGCC
GACATCTACC ACGGCCAGGA GATCCTGCGC GCCGGACGCA TCCCCGCCGA CCAGGTGCTG
GACAACGCCC AGTTCCGCCC CGAGATGAAG GGTGTGCACG TGCCCGGCGG GCTGTACGCG
ATGATCGCCG GCATCGACCT GGTGCGTGCC GCGGGTGCCG ACGGCAAGGG CGACTACTAC
GTGCTGGAGG ACAACCTGCG CGTGCCCTCG GGGGTGAGCT ACCTGCTGGA GAACCGCAAG
ATGATGATGC GGCTCTTCCC CGACCTGTTC TCGCGCTACG ACGTGCAGCC GGTCGAGCAC
TACCCCGACC TGCTGCTCGA GACCCTGCGC GCGGTGGCGC CGGCCGGCGT ACTCGACCCC
ACCGCGGTGC TGCTCACCCC GGGCGCCTTC AACAGCGCCT ACTTCGAGCA CAGCTTCCTC
GCCCAGCAGA TGGGCATCGA GCTCGTCGAG GGCCAGGACC TCTTCGTCGA GGACGACACC
GTGTTCATGC GCACCACCCA GGGGCCGCGG CGGGTGGATG TGATCTACCG GCGGCTGGAC
GACGACTTCC TCGACCCCGA GGTCTTCCGC GCCGACTCGA TGCTCGGTGT GCCGGGCCTG
ATGTCGGCCT ACCGCGCCGG CCGGGTGACG CTGGCCAATG CGGTCGGCAC GGGGGTGGCG
GACGACAAGT CGATCTACCC CTACGTGCCC GAGATGGTGC GCTTCTACCT CGGCGAGGAG
CCGATCCTCA ACAACGTCCC GACCTGGATG CTGCGCGAAC CCGACGACCT CGCCTACACG
CTCGCCCACC TGCCCGAGCT GGTGGTCAAG GAGGTCCATG GCGCCGGCGG CTACGGCATG
CTGGTGGGGC CGGCGGCCAC CAAGGCCGAG ATCGAGGAGT TCCGCCAGCG CATCCTCGCC
GCGCCGGAGA AGTACATCGC CCAGCCGACC TTGTCCTTGT CCACCTGCCC GACCTTCGTC
GACGCCGGCA TCGCGCCGCG CCACATCGAC CTGCGACCCT TCGTGCTCTC GGGCGGACGC
GAGATCCGCA TGGTGCCGGG CGGCCTCACC CGCGTCGCGC TCAAGGCCGG CTCGCTGGTG
GTCAACTCCT CGCAGGGCGG GGGCACCAAG GACACCTGGG TGGTGGGCTG A
 
Protein sequence
MNAAKPFDEM HAADGTIRTH YQPYSEWLGH TPRELIARKR KEADLAFHRV GITFNVYGAD 
GGKERLIPFD LLPRIIPGDE WRTLEAGLRQ RVRALNAFLA DIYHGQEILR AGRIPADQVL
DNAQFRPEMK GVHVPGGLYA MIAGIDLVRA AGADGKGDYY VLEDNLRVPS GVSYLLENRK
MMMRLFPDLF SRYDVQPVEH YPDLLLETLR AVAPAGVLDP TAVLLTPGAF NSAYFEHSFL
AQQMGIELVE GQDLFVEDDT VFMRTTQGPR RVDVIYRRLD DDFLDPEVFR ADSMLGVPGL
MSAYRAGRVT LANAVGTGVA DDKSIYPYVP EMVRFYLGEE PILNNVPTWM LREPDDLAYT
LAHLPELVVK EVHGAGGYGM LVGPAATKAE IEEFRQRILA APEKYIAQPT LSLSTCPTFV
DAGIAPRHID LRPFVLSGGR EIRMVPGGLT RVALKAGSLV VNSSQGGGTK DTWVVG