Gene Tmz1t_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3878 
Symbol 
ID7873529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4275906 
End bp4278512 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content75% 
IMG OID643700820 
Productprotein of unknown function DUF404 
Protein accessionYP_002890843 
Protein GI237654529 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGTGC AGCCGCACAG CCCGCCCGCG CTGCTCGCCC GCTACGTCGC GCGGCCCGAC 
CGCTACGACG AGCTCTGCCA ACGCATCGGC AGCGTCGTGG TGCTGCGGCC GCACTGGCGG
GACTTCTTCC ACGGCCTCGC CGCCATGCCC GCCGACGAGC TGTCGGCACG GCGGGCCGCG
CTCGGCCGGC AGATCCACGA GAACGGCATC ACCTACAACG TGTATGCCGA TCCGCGCGGC
TTCGCGCGCC CGTGGGAGCT CGACCTGCTG CCCTACATGC TGCCCGCGGC GGAGTGGGCG
CAGATCGAGG CCGCGGTGAT CCAGCGCGCC ACCCTGTTCG ACCGCATCCT CGCCGACCTC
TACGGCGAGC AGACCCTGCT CGAGCAGGGC CTGGTGCCGC CCGCGCTCAT CTATGGCCAT
AGCGGCTTTC TGCGCCCGCT CGTGGGCGCG CGGCCGGCGG GCGGGCGCTT CCTGCACCTG
TACGCGGTGG ACCTGGCGCG CTCGCCCGAC GGGCGCTGGT GGGTGGTGGC CGACCGCACC
CAGGCGCCCT CGGGTGCGGG CTACGCGCTG GAGAACCGCA GCGTGGTCGC GCGCGTGCTG
CCCGAGCTCT ACCGCGCCGC CGGCGTCGAG CCGCTGGTGC CCTTCTTCGA GGGCTTCCGC
GACGGCCTCG TCGAGCTCGC GCCGGCCGAC GGTGCGGCGG CCGGGGAAGA TCCCCTGATC
GTGGTGCTCA CGCCCGGGCC CTACAACGAG ACCTACTTCG AGCACGCCTT CCTCGCCCGC
GAGATGGGTT TCCCGCTCGT CGAGGGCCAG GACCTCACCG TGCGCGACGA CAAGGTCTGG
TTGCGCACGC TCGAGGGCCT GCGCCGGGTG CATGTGATCC TGCGCCGGGT GGACGACCTG
TGGTGCGATC CGCTCGAACT GCGCGAAGAC TCCGCGCTCG GCGTCGCCGG CCTGGTGGCG
GCGGTGCGTG CCGGCATGGT GACGGTGGCC AACGCGCTCG GCAGCGGCAT CCTCGAGACC
GGGGCGTTGC TCGGCTACCT GCCGCGGCTG TGCGAGCACC TGCTCGGCAG CAAGCTGCGC
ATGCCCTCGG TGGCGACCTG GTGGTGCGGC GAGCCGGCGG CCTGCGAATA CGCGCTGAAG
CACCTGCGCG AGCTGGTGAT CAAGCCGGCC TACCCCACCC TGGGTGCGCG GCCGGTGTTC
TGCGGCGACC TGCCGGTGGC AGAGCTCGAC GCGCTGGCGG CACGGATCGC GCTGCGGCCC
TTCGACTTCG TCGCGCAGGA GATGGTCAAC CTCTCGCAGG CCCCGGTGCT GGCGGACGAG
GCGGGCGGCG CCGGGAACAA GGCGCCCGCG CGCGACGCGC GCGCGCTGGT CGCGCGCAAC
ATCGGGCTGC GCGTGTTCGC GGCGGCGGGG GCGGAGGGGT TTCGCGTGCT GCCGGGCGGG
CTCGTCCGGG TGGCCTCGGA GGCCGACATG CGCATCGTGT CGATGCAGCA CGGCGGCGGC
AGCAAGGACG CCTGGGTGCT CGGCTCGCCG GCGCGGCCGC GTCCGGTGCC GCGCATCGTG
CCCGGCCAGC TCGCGCCCTT TGCCGGGGGG GCGCGCGTGC TCGGCCTGTC GAGCCGGGTG
GCGGAGAACT TCTTCTGGCT CGGACGCTAC AGCGAGCGCG CCGACGCTGC CGCCCGCCTC
GGCCGCGAGA CGCTGGCGCG CCTGGGCGAG GCCGGCGAGG CCTCGTGGCT GGGCGAGGAT
GGCAAGGTGA CCGCGGCCCT GCAGGCGCTG TGCCGGCAGC GCGGCCTGCT CGCCACCGTG
GCGGGGGAGG CCGACGCGGC ACCCACACCC CCCGCGCCGC TCGCTCAGGC GCTCTTCGAT
CCCGCCCAGC CCGGCAGCGT GGTGGCCAAC CTGCGCCAGG TGCTGCGCGT CGCAGCCCTG
GTGCGCGACC GCCTGTCGCC GGACAGCTGG CGCATCTACA ACCGCCTGTC GGAGTTTGCC
GCGGCGCCGG CTCAGGCCCC GCGCCTCGGC GAGGCGCTGC AGCGCCTGGA CGAGTCGTTG
CTGTCGCTGG TGACGCTGTC GGGTTTCGTC ATCGAGAGCA TGCCGCGCGA CGCCGGCTGG
CGCTTCCTGT CGATCGGGCG GCGCATCGAG CGCCTGCAGT TTCTCGCCGC CGCGCTGTCG
GCGCTGCTGC TCGAGCCCTC GCAAGGCGGG CTCAAGGTGC TGCTCGCGAT CACCGACGCC
GAGCTGCGCT ACCGCAGCCG CCACGCGCGC GGCCTGGCCC CGCAGCCGGT GGCCGAGCTG
GTCATGCTCG ACGGCGACAA CCCGCGCGCG CTGCGCTACC AGCTCGATTC GCTCGCCGAG
CACGTCGCCC TGCTGCCCGA CGGCGCCGCG CTCGCCGCGC CGCTGCGCGA ACGCCTGGCC
GCGCTCGACG CGCTCGCGCT GTGCGACTGC TTCGTGCACG AGCTGGCCGG CGCGCGTAGC
CGCGCGGCCG CACAGGCGCG CCACTGCGAG CCCCTGCGGC AGGTGCTGGA CGACATCTGG
CGCGGCGGCA ACGAGCTCGC CGATGCGCTG GCGCGGCGCT ACTTCACCCA CCTCGACGCC
CGCAGCCGGG CGACCGTGTC GCTGTAG
 
Protein sequence
MQVQPHSPPA LLARYVARPD RYDELCQRIG SVVVLRPHWR DFFHGLAAMP ADELSARRAA 
LGRQIHENGI TYNVYADPRG FARPWELDLL PYMLPAAEWA QIEAAVIQRA TLFDRILADL
YGEQTLLEQG LVPPALIYGH SGFLRPLVGA RPAGGRFLHL YAVDLARSPD GRWWVVADRT
QAPSGAGYAL ENRSVVARVL PELYRAAGVE PLVPFFEGFR DGLVELAPAD GAAAGEDPLI
VVLTPGPYNE TYFEHAFLAR EMGFPLVEGQ DLTVRDDKVW LRTLEGLRRV HVILRRVDDL
WCDPLELRED SALGVAGLVA AVRAGMVTVA NALGSGILET GALLGYLPRL CEHLLGSKLR
MPSVATWWCG EPAACEYALK HLRELVIKPA YPTLGARPVF CGDLPVAELD ALAARIALRP
FDFVAQEMVN LSQAPVLADE AGGAGNKAPA RDARALVARN IGLRVFAAAG AEGFRVLPGG
LVRVASEADM RIVSMQHGGG SKDAWVLGSP ARPRPVPRIV PGQLAPFAGG ARVLGLSSRV
AENFFWLGRY SERADAAARL GRETLARLGE AGEASWLGED GKVTAALQAL CRQRGLLATV
AGEADAAPTP PAPLAQALFD PAQPGSVVAN LRQVLRVAAL VRDRLSPDSW RIYNRLSEFA
AAPAQAPRLG EALQRLDESL LSLVTLSGFV IESMPRDAGW RFLSIGRRIE RLQFLAAALS
ALLLEPSQGG LKVLLAITDA ELRYRSRHAR GLAPQPVAEL VMLDGDNPRA LRYQLDSLAE
HVALLPDGAA LAAPLRERLA ALDALALCDC FVHELAGARS RAAAQARHCE PLRQVLDDIW
RGGNELADAL ARRYFTHLDA RSRATVSL