Gene Tmz1t_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0031 
Symbol 
ID7083414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp37900 
End bp39534 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID643697081 
ProductTIR protein 
Protein accessionYP_002353730 
Protein GI217968496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCG CAGCGAAGAT CGCCCCGAAG GTGTTCGTGT CCTATGCCCG GCAGGACTGC 
AGCACGCTGG CCGAGGAACT GGTCACGGCG CTTGAGTTGC TTCAGTTCGA GGGCTACCTG
GACCGCAGCG ACATCGCGGC GGGGGAGGAC TGGGAGCATC GGCTGGATGC CCTGATCCGC
CAGGCGGATA CGGTGGTCTT TGTCCTCTCG CCACGGTCGG TGCAGTCCGA GCGTTGTGCA
TGGGAGGTGC AGCGCGCGTT GGCACTGTCC AAGCGCATCA TCCCGGTGGT GGGCATGGCG
GTCGATGACG CCTCGGTCCC GGCGCCGCTG CAACGGCTCA ACTACATCCA TTTCACTGCT
GGCCACTCAT TCGCGCGCTC CCTCGGTCAA CTGGCCGACG CGCTGCGGCT GGACATCGGC
TGGATACGTG AACACACGCG ATTGGGTGAG CTCGCTCTTC GATGGAATGA ACGGCAGCAG
CCTGATGCCT TGCTGCTGCG CGGCGACGAG TTGTCTGCCG GGCAGGCGTG GATGGCGAAC
TGGGCGCCGG AGTTTCCGCC CGTCACCGAG TTGCAGCGCA GCTTCATCGC CGCGAGCGCG
GACGCGCAGA CCCGGCGTGA GAGTCTGGAA CGGCAGCAGA ACGAGGCGAT CGCCAAAGCC
AACGCCGAAC GCGCCGAGGC CCTGACCCGC CGCGAGGAGG CGTTGTTCTC GTTGAAGCGC
CGCACCTTGC TGGGCGGCGT GCTCGCGGTG GCGCTCTCTC TCGGGCTCGG CGGCATGGCC
TGGTGGTCGC TGCAGCTTCG CCGCCGCGCC GAGGAGGCCG AACGGACAGC CATCGACGAG
CTTGTTCGCC GCGAGGCCAT GCGCACGGAC ATCAGCGGAC AGATCGTGGC CTACGCCACA
TCGCCCGGCC AGTGGGCGAT GGACAGTGGC GTGGATGGCC ACTCGCCCTA CACCGGCACC
TTGCTGCGGG AACTGCAGTC GCCCGACATC TCGTTGTGGG TGGCGCTATC CAGGACGACG
ACCCAGGTCG CGAAAGCCAC CAACGGCAGT CAGCGGCCGT TCATTTCGTC CGACATGAAC
GGTGACGTGT TCCTCGGCCA CCCCTCACCC ACGCGTCGTC TGCGCGCGCT CGTCATTGGC
GCAGGACGGT TCCAAATGGC GACCGACCTA TCCTTTGAGG GGGCGTACAA GGATGCCGAT
GCCTGGGGCG CGTTTCTGGC CGGGCGCGGA TTCAGCGTGC AAACGCTGCG CGATCCGACG
CGGGCGTCCG TGCTGGCGTC GATCGAGGCA CTTCGCGTTT CCGCTCTAGA CGAGGCGGAC
GCGTCGATTC GACGCGTGGG CATTGCTCTG CAGCCGGATG GCACGCAGGC CCAGCCGTCA
CTCCCCGTGC CTCGTCGAAC ACGGCCCGAC GCCGAGCCGG CGCACGATGC TCTGATCGTC
TTCTTTTACG CCGGCTACGG CTTCCGCGCG GGCGCCGAGC GTTTTCTCGC CGTGTCGGAC
ACGGCATTCG ACACCGCGAA AACCGGGCTG GTGACGGAGC CCGGTGCGAC GGCGGTGTCG
GTGGACGATC TCGAAAAAGT GCTGCGCGAA GCGGCCGCCG CTTCGGTGGT GATCCTGGAC
ACGAACTTTA TCTAG
 
Protein sequence
MDIAAKIAPK VFVSYARQDC STLAEELVTA LELLQFEGYL DRSDIAAGED WEHRLDALIR 
QADTVVFVLS PRSVQSERCA WEVQRALALS KRIIPVVGMA VDDASVPAPL QRLNYIHFTA
GHSFARSLGQ LADALRLDIG WIREHTRLGE LALRWNERQQ PDALLLRGDE LSAGQAWMAN
WAPEFPPVTE LQRSFIAASA DAQTRRESLE RQQNEAIAKA NAERAEALTR REEALFSLKR
RTLLGGVLAV ALSLGLGGMA WWSLQLRRRA EEAERTAIDE LVRREAMRTD ISGQIVAYAT
SPGQWAMDSG VDGHSPYTGT LLRELQSPDI SLWVALSRTT TQVAKATNGS QRPFISSDMN
GDVFLGHPSP TRRLRALVIG AGRFQMATDL SFEGAYKDAD AWGAFLAGRG FSVQTLRDPT
RASVLASIEA LRVSALDEAD ASIRRVGIAL QPDGTQAQPS LPVPRRTRPD AEPAHDALIV
FFYAGYGFRA GAERFLAVSD TAFDTAKTGL VTEPGATAVS VDDLEKVLRE AAAASVVILD
TNFI