Gene Tmz1t_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3850 
Symbol 
ID7874092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4247254 
End bp4249269 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content69% 
IMG OID643700792 
ProductTonB-dependent receptor 
Protein accessionYP_002890816 
Protein GI237654502 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.302325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTGGG GAGCTGAAAA GGTCGTGGAT CCTCGTCGGT TGAAGCTGGC CGTGCTGGGC 
CTCGGCGTGC TGCTCGCGTT CCCGTCCGAG GCTGGCGCCG CACAGGACGA CCTGACCGCG
CTGCCCTTCG AGGAGTTGCT GCTGCGCGAT TTCGTGTCCG CGTCGCGGCT CGCACGCCAG
GTGAGCGACT CGCCTGCGGC GGTGGCGATC GTCACCGCCG ACGACATCCG CGCCTACGGC
TACCGCACGC TCGCCGACGT CATCAACGGC ATGCGCGGCC TGTACACCAC CGACGAGCGC
ACCTACCACT ACATGGGCGG GCGCAGCTTC GGCGACGTCG AGGACTACGC CGGCCGCGTG
ATGCTGCTGA TCGACGGCTA CGCGGTGCAG GACAACCTCT TCGACCAGGC CTACATCGAC
GAATCCGGCC TGATCGACCT GGAACTGGTC GATCGCGTCG AGTACGTGCC GGGAACCGGC
TCGGTGACCT ACGGCAACAA CGCGCTGCTC GGCATCCTCA ATGTCGTCAC CCGGCGCGGA
CGCGACTTCG ACGGTGCGCG CGTGTCGGCG GAGATCTCCA GCCGCGGCGC CAGCCGCCAG
CGCGCCACCT GGGGCAAGCA TTTCAACAAC GGCGCCGAGG TCCTGCTCTC GGCCTCGACG
CTCGACGTCG ATGGCCGCAA CCTGTACTTC CCCGCCTACG ACACGCCCGC GACCAACTTC
GGCGTGGCCG AGGGGCTCGA CGGCGAGCGT AACCAGCGCG TGTTCGGAAA GCTGTCGTGG
TCGGGCTGGA CCGTTCAGGC GGCGTGGGTG GAGCGCGAGA AGAGCGTACC CACGAACCCC
TCCGCATACA CCGCCTTCAA CACGCCGTTC CCGACACGCG ACGAGAGTGC CTTCCTCGGG
GTGCGCCACG AGACCGACCT CGGTCTGCAG CTGTATTCCT CGTCCAGTCT GATGTTGGGA
CGCTACGCCT ACTGGAACCA GCGCGAATAC GCCCTCGACG AGGACGGCGA GTACGACGAC
GGCGAGAAGT ACGGCGTGCG CGACTACCAC GGCGCGTGGT GGCGCTTCGA CCAGAAGTTC
GTCGGGCGCT GGTTCGTCGA CCACACGCTG GTATTCGGCG CGGAGCTGCG CGACGACCAC
CGCCAGTCCT TCCACCGCCG CTTCCTCTCG CCCGCTGGCG AAGTCACGGA TCGTGACGAC
GGCGAGCTTT CGCGCCGCAC CTTCAGTCTC TACGTCGCCG ACGACTACCG GCTGAACCAG
CAGTGGACGC TCAACCTGGG CGTGCGTCAC GACGACGCCG ACGATCTCGA CGGCAACCTC
AGCCCGCGTG CCGCGTTGAT CTGGCAGCAG GATCCGGCGA CGACCTGGAA GGCTTCGTAC
AGCGAGGCCT TCAAGATGCC CAACGCCAAC GACCGCTGGA CGTCCGACGA CACGGCCGTC
CCCGAGTACG TCGCCGCCAC CGAGCTCGTG CTGCAGCGCC AGCTCGCGCC GCACACGCGC
TTCACCGGCT CCCTGTACCG CTACCGGCGC AGCGACCTGC CGATCGAAAA CGCGGACGGG
GACGAGGTTC CCGAGGGAAG CAGCCGCGCG CGCGGCGTCG AGACCGAGAT CGAGCATGTC
TGGGAGCGCG GGGCGCGTGC GCGCGCCAGC GTGGCCTGGC AGCGCTCGCG CGATGTGTAC
GGGCGCGACG CGGTCAACTC GCCCGACCTG CTCGGCAAGC TCGCCTTCAC CTTCCTGCTG
CCGGGCGAGG CGCTGCGTGC CGGCCTCGAG ACGCAATATC TCGGCCCGCG CCTGACCCGC
GAGCGGCGCA TGCTGGGCGG GGTGACGCTT TCCAATCTGA CCCTGTCCAC CGAGCGCGAC
TGGCATGGCC TGTCGGCCTC GCTGAGCGTG CGCAACCTGT TCGATCGTGA CTACGAGACC
GTGTCGGGCT TCGACTGGCG GCCCGGTGAC GTGGCACAGG ACGGCCTGCG CATGGACGGG
CGCAGCGTCT GGCTGCAGGT CGGGTACGCG CTATGA
 
Protein sequence
MEWGAEKVVD PRRLKLAVLG LGVLLAFPSE AGAAQDDLTA LPFEELLLRD FVSASRLARQ 
VSDSPAAVAI VTADDIRAYG YRTLADVING MRGLYTTDER TYHYMGGRSF GDVEDYAGRV
MLLIDGYAVQ DNLFDQAYID ESGLIDLELV DRVEYVPGTG SVTYGNNALL GILNVVTRRG
RDFDGARVSA EISSRGASRQ RATWGKHFNN GAEVLLSAST LDVDGRNLYF PAYDTPATNF
GVAEGLDGER NQRVFGKLSW SGWTVQAAWV EREKSVPTNP SAYTAFNTPF PTRDESAFLG
VRHETDLGLQ LYSSSSLMLG RYAYWNQREY ALDEDGEYDD GEKYGVRDYH GAWWRFDQKF
VGRWFVDHTL VFGAELRDDH RQSFHRRFLS PAGEVTDRDD GELSRRTFSL YVADDYRLNQ
QWTLNLGVRH DDADDLDGNL SPRAALIWQQ DPATTWKASY SEAFKMPNAN DRWTSDDTAV
PEYVAATELV LQRQLAPHTR FTGSLYRYRR SDLPIENADG DEVPEGSSRA RGVETEIEHV
WERGARARAS VAWQRSRDVY GRDAVNSPDL LGKLAFTFLL PGEALRAGLE TQYLGPRLTR
ERRMLGGVTL SNLTLSTERD WHGLSASLSV RNLFDRDYET VSGFDWRPGD VAQDGLRMDG
RSVWLQVGYA L