Gene Tmz1t_3488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3488 
Symbol 
ID7872994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3826354 
End bp3827652 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID643700428 
Productprotein of unknown function DUF21 
Protein accessionYP_002890459 
Protein GI237654145 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGTCA ACGAACTGCT CGTCATCCTG CTGCTGATCG CAGCGAGCGC CTTCTTTTCC 
ATGTCCGAGA TCTCCCTCGC GGCTGCGCGC AAGATCAAGC TGCGCGTCAT GGCCGAGGCA
GGGCATCTCA ACGCCCGGCG CGTGCTTGCG CTGCAGGACA GTCCGGGCCA CTTCTTCACC
GTGGTGCAGA TCGGCCTCAA CGCCGTCGCC ATTCTCGGCG GCGTAGTCGG CGAACAGGCG
CTTTCTCCCT ACGTCACCGA GCTGCTGCGC CGGGCCTACG CCGGCCCGAT GCTCGACACC
ATCGCCTTCG TCGTCTCCTT CGTTTTCGTC ACCTCGCTGT TCGTGCTTTT TGCCGACCTC
ATGCCCAAGC GCCTGGCCAT GGTCCAGCCC GAACGCATCG CGGTGGCGGT GGTGCAGCCG
ATGCAGGTGT GCATGTGGCT GTTCGCGCCG CTGGTGTGGG TCTTCAACGG CGCGGCCAAC
CTCATCTTCC GCTGGTTCAA GTTGCCGAGC GTGCGCATCG AGGACATCAC CGCCGACGAC
ATCATGGCCA TGGCTGACGC CGGTGCCCAG GCCGGCGCGC TGCTGCGCCA GGAGCAGCAC
CTGATCAGCA ACGTGTTCGA GCTCGACTCG CGCATCGTGC CCTCGGCGAT GACCTCGCGC
GAGAACATCG TCTTCCTCAC CCTGTCGGAG TCCGAGGAGA GCATCCGGCG CAAGATCGCC
GCCCACCCGC ACGGGAAGTT TCCGGTGTGC GAGGACGGCA TCGACAGCGT GATCGGCTAT
GTGGACTCGA AGGACATCCT GCCGCGCATC GTGCAGGGCC AGGATCTGTC CTTGCGCACC
CAGCCGATCG TGCGCAAGGT GCTGATGCTG CCCGACACGC TCACGCTCTT CGAGGCGCTC
GAGCGCTTCC GCGACGCCAA GGAGGATTTC GCGCTCATCC TCAACGAATA CGCGCTGGTG
GTGGGCCTGC TGTCGCTGCA GGACGTGATG AACACGGTGA TGGGCGATCT CGTCAGCCCC
TTCCAGGAAG AGCTCATCGT GCGCCGAGAC GACAACTCCT GGCTCATCGA CGGCGCCACG
CCGATCGAGG ACGTCATGCA GGCGCTCGAG ATCGAGGTCT TCGAGGGCTT CCAGAACTAC
GAGACCGTCG CCGGTTTCCT GATGTACCGC CTGCGCAAGG TCCCCAAGCG CACCGACTTC
GTGACCTACC TCGGCTACAA GTTCGAGGTG GTCGACATCG ACAACTACCG CATCGACCAA
GTGCTGGTCA CCCGCGAGAC CCCGGTCGGC GCCGTGTAA
 
Protein sequence
MPVNELLVIL LLIAASAFFS MSEISLAAAR KIKLRVMAEA GHLNARRVLA LQDSPGHFFT 
VVQIGLNAVA ILGGVVGEQA LSPYVTELLR RAYAGPMLDT IAFVVSFVFV TSLFVLFADL
MPKRLAMVQP ERIAVAVVQP MQVCMWLFAP LVWVFNGAAN LIFRWFKLPS VRIEDITADD
IMAMADAGAQ AGALLRQEQH LISNVFELDS RIVPSAMTSR ENIVFLTLSE SEESIRRKIA
AHPHGKFPVC EDGIDSVIGY VDSKDILPRI VQGQDLSLRT QPIVRKVLML PDTLTLFEAL
ERFRDAKEDF ALILNEYALV VGLLSLQDVM NTVMGDLVSP FQEELIVRRD DNSWLIDGAT
PIEDVMQALE IEVFEGFQNY ETVAGFLMYR LRKVPKRTDF VTYLGYKFEV VDIDNYRIDQ
VLVTRETPVG AV