Gene Tmz1t_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3757 
Symbol 
ID7873754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4126833 
End bp4128671 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content68% 
IMG OID643700701 
ProductTonB-dependent receptor plug 
Protein accessionYP_002890725 
Protein GI237654411 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCC GTGTTTCCGC GCTCGCCTCG GCATGCGCGC TCGCCTGTTT CCTCGCACCG 
GCCGCTCATG CGGCCGATGC CCAGCTCGGC CACGTCGTCG TGACCGCCAC TCGCCAGCCG
ATCAGTGCCG ATGCGGCCCT GGCCAGCGTG GATGTCATCG AGCGCGACGA AATCGCCCGC
GCGGGGCATT CGAGCCTGCT GGGCCTGCTC TCCTCGCGGC CGGGCGTGCA GATGGCACGC
AATGGCGGTC CTGGTTCGAG CGGCAGCATC TTCATCCGCG GGGCGAACTC CGGACACACG
CTGGTGCTGG TCGATGGGGT GCGCATGGGC TCCGCCACCA GCGGCTCTCC GGTGCTGGAG
ACGATCCCGC TCGAACTCAT CGAGCGTATC GAGATCCTGC GCGGCCCCGG AAGTGCGCTG
TACGGCGCGG ATGCGCTCGG TGGCGTGATC CAGGTGTTCA CCCGCAAGGG GCGCGAGGGC
TTCCAGCCCT CCGTGCGCGT CGGGGCCGGC ACGGACGGCG CGCGCGAGGC GAGCGCCACG
CTCGCGGGTG GAAACGAGCG TCTGCGCTAC AGCCTGACCG CCGGTCACGA GCGCAGCGAC
GGCTTCAACG CAAGGCCGAA TTCGGCGACC GCCGCGGATG CCGACGACGA CGGCTTCCGC
GAGGATTATC TGGGCGCGTC GCTGGTGCTG GACCTGGGCG GCAATGACGA GCTCGGCGCC
AACCTGCTGT ATTCCGACAT GCGCAACTGG TACGACGCGG CACAGCCTTT CGACTCCTAC
CTCGACAAGC GTGCCGAGAC CTTCGGCGCG TATCTGCGCA AGCAGCATTC CGCCGACTGG
GCCAGCACCC TGCGCTTCGG TCACGGGGTG GATGCCCTCG ACAACCAGTC GAACGCGAGC
ACGCGTTCCC GCTTCGATAC CACGCAGCGC CAGTTGAGCT GGCAGCACGA CGTCGCCGTA
GGTGGCGGCT CGCTGATGGC GGCGTACGAG TTTCTGCAGC AGCGCGTGAA GACCACCTCC
GACTTCGAGA AGACGCGTCG CCACATCAAC GCCTTCCTGC TCGGCTGGGG TGGTGAATTC
GACCGCCACA ACGTCCAGCT CAATGCGCGC CACGACCGCA ACTCGCAGTT CGGTGGCAAG
ACCACCGGCG CCGCGGCCTA CGGCTATCGG CTTGCGCCCG AGTGGCGCGC CCATGCCAGC
ATCGGCACGG CGTTCAAGGC GCCGACCTTC AACGATCTGT ACTTCCCGGT GGAGTGCTAC
GGTGCCTGGG GCTGCTTCGG CGGCAATCCG GACCTGGAGC CCGAGGAGGC GCTCAACCGT
GAGCTGGGCG TGGCCTGGGA GCGCAACGGG GTCGGTGTCG ACCTGACCTA TTTCAACAAC
CGCATCAAGA ACCTCATCGA CTGGAGCACC GGCATCGCCT CCAATGTGGG CAGGGCCGAC
ATCCAGGGGA TCGAGGCGGC GCTGTCCGCG ACCCTGGGCG ACTACCGGCT GCGCGCCAGC
GTCGACCTGC TCGACGCGGA GGATGACGAG ACCGGCGATC AGCTCGGCCG CCGCGCGCGC
GTCGGCGGGG CGCTCGCGCT GGAGCGTGCG GTGGGCGCGT GGACCTGGGG TGTGGAGTGG
AACGGCAAGG GGCGGCGCTA TGATCAGGTG CCGAACGCGG TGTCCAATCG CCTCGGCGGC
TACGGTCTGG TCGATGCCTA CGCGCACTAC GCCGTGGCGC GCGACTGGAG CGTCGAAGTT
CGTGCCAACA ATCTACTCGA CAAGGACCAT GAGCTTGCGA AGGGGTTCGC AACACAGGGC
AGGAGTGCCT TCGTCGCGCT GCGCTACGCG ATGCACTGA
 
Protein sequence
MKIRVSALAS ACALACFLAP AAHAADAQLG HVVVTATRQP ISADAALASV DVIERDEIAR 
AGHSSLLGLL SSRPGVQMAR NGGPGSSGSI FIRGANSGHT LVLVDGVRMG SATSGSPVLE
TIPLELIERI EILRGPGSAL YGADALGGVI QVFTRKGREG FQPSVRVGAG TDGAREASAT
LAGGNERLRY SLTAGHERSD GFNARPNSAT AADADDDGFR EDYLGASLVL DLGGNDELGA
NLLYSDMRNW YDAAQPFDSY LDKRAETFGA YLRKQHSADW ASTLRFGHGV DALDNQSNAS
TRSRFDTTQR QLSWQHDVAV GGGSLMAAYE FLQQRVKTTS DFEKTRRHIN AFLLGWGGEF
DRHNVQLNAR HDRNSQFGGK TTGAAAYGYR LAPEWRAHAS IGTAFKAPTF NDLYFPVECY
GAWGCFGGNP DLEPEEALNR ELGVAWERNG VGVDLTYFNN RIKNLIDWST GIASNVGRAD
IQGIEAALSA TLGDYRLRAS VDLLDAEDDE TGDQLGRRAR VGGALALERA VGAWTWGVEW
NGKGRRYDQV PNAVSNRLGG YGLVDAYAHY AVARDWSVEV RANNLLDKDH ELAKGFATQG
RSAFVALRYA MH