Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3850 |
Symbol | |
ID | 7874092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4247254 |
End bp | 4249269 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700792 |
Product | TonB-dependent receptor |
Protein accession | YP_002890816 |
Protein GI | 237654502 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.302325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTGGG GAGCTGAAAA GGTCGTGGAT CCTCGTCGGT TGAAGCTGGC CGTGCTGGGC CTCGGCGTGC TGCTCGCGTT CCCGTCCGAG GCTGGCGCCG CACAGGACGA CCTGACCGCG CTGCCCTTCG AGGAGTTGCT GCTGCGCGAT TTCGTGTCCG CGTCGCGGCT CGCACGCCAG GTGAGCGACT CGCCTGCGGC GGTGGCGATC GTCACCGCCG ACGACATCCG CGCCTACGGC TACCGCACGC TCGCCGACGT CATCAACGGC ATGCGCGGCC TGTACACCAC CGACGAGCGC ACCTACCACT ACATGGGCGG GCGCAGCTTC GGCGACGTCG AGGACTACGC CGGCCGCGTG ATGCTGCTGA TCGACGGCTA CGCGGTGCAG GACAACCTCT TCGACCAGGC CTACATCGAC GAATCCGGCC TGATCGACCT GGAACTGGTC GATCGCGTCG AGTACGTGCC GGGAACCGGC TCGGTGACCT ACGGCAACAA CGCGCTGCTC GGCATCCTCA ATGTCGTCAC CCGGCGCGGA CGCGACTTCG ACGGTGCGCG CGTGTCGGCG GAGATCTCCA GCCGCGGCGC CAGCCGCCAG CGCGCCACCT GGGGCAAGCA TTTCAACAAC GGCGCCGAGG TCCTGCTCTC GGCCTCGACG CTCGACGTCG ATGGCCGCAA CCTGTACTTC CCCGCCTACG ACACGCCCGC GACCAACTTC GGCGTGGCCG AGGGGCTCGA CGGCGAGCGT AACCAGCGCG TGTTCGGAAA GCTGTCGTGG TCGGGCTGGA CCGTTCAGGC GGCGTGGGTG GAGCGCGAGA AGAGCGTACC CACGAACCCC TCCGCATACA CCGCCTTCAA CACGCCGTTC CCGACACGCG ACGAGAGTGC CTTCCTCGGG GTGCGCCACG AGACCGACCT CGGTCTGCAG CTGTATTCCT CGTCCAGTCT GATGTTGGGA CGCTACGCCT ACTGGAACCA GCGCGAATAC GCCCTCGACG AGGACGGCGA GTACGACGAC GGCGAGAAGT ACGGCGTGCG CGACTACCAC GGCGCGTGGT GGCGCTTCGA CCAGAAGTTC GTCGGGCGCT GGTTCGTCGA CCACACGCTG GTATTCGGCG CGGAGCTGCG CGACGACCAC CGCCAGTCCT TCCACCGCCG CTTCCTCTCG CCCGCTGGCG AAGTCACGGA TCGTGACGAC GGCGAGCTTT CGCGCCGCAC CTTCAGTCTC TACGTCGCCG ACGACTACCG GCTGAACCAG CAGTGGACGC TCAACCTGGG CGTGCGTCAC GACGACGCCG ACGATCTCGA CGGCAACCTC AGCCCGCGTG CCGCGTTGAT CTGGCAGCAG GATCCGGCGA CGACCTGGAA GGCTTCGTAC AGCGAGGCCT TCAAGATGCC CAACGCCAAC GACCGCTGGA CGTCCGACGA CACGGCCGTC CCCGAGTACG TCGCCGCCAC CGAGCTCGTG CTGCAGCGCC AGCTCGCGCC GCACACGCGC TTCACCGGCT CCCTGTACCG CTACCGGCGC AGCGACCTGC CGATCGAAAA CGCGGACGGG GACGAGGTTC CCGAGGGAAG CAGCCGCGCG CGCGGCGTCG AGACCGAGAT CGAGCATGTC TGGGAGCGCG GGGCGCGTGC GCGCGCCAGC GTGGCCTGGC AGCGCTCGCG CGATGTGTAC GGGCGCGACG CGGTCAACTC GCCCGACCTG CTCGGCAAGC TCGCCTTCAC CTTCCTGCTG CCGGGCGAGG CGCTGCGTGC CGGCCTCGAG ACGCAATATC TCGGCCCGCG CCTGACCCGC GAGCGGCGCA TGCTGGGCGG GGTGACGCTT TCCAATCTGA CCCTGTCCAC CGAGCGCGAC TGGCATGGCC TGTCGGCCTC GCTGAGCGTG CGCAACCTGT TCGATCGTGA CTACGAGACC GTGTCGGGCT TCGACTGGCG GCCCGGTGAC GTGGCACAGG ACGGCCTGCG CATGGACGGG CGCAGCGTCT GGCTGCAGGT CGGGTACGCG CTATGA
|
Protein sequence | MEWGAEKVVD PRRLKLAVLG LGVLLAFPSE AGAAQDDLTA LPFEELLLRD FVSASRLARQ VSDSPAAVAI VTADDIRAYG YRTLADVING MRGLYTTDER TYHYMGGRSF GDVEDYAGRV MLLIDGYAVQ DNLFDQAYID ESGLIDLELV DRVEYVPGTG SVTYGNNALL GILNVVTRRG RDFDGARVSA EISSRGASRQ RATWGKHFNN GAEVLLSAST LDVDGRNLYF PAYDTPATNF GVAEGLDGER NQRVFGKLSW SGWTVQAAWV EREKSVPTNP SAYTAFNTPF PTRDESAFLG VRHETDLGLQ LYSSSSLMLG RYAYWNQREY ALDEDGEYDD GEKYGVRDYH GAWWRFDQKF VGRWFVDHTL VFGAELRDDH RQSFHRRFLS PAGEVTDRDD GELSRRTFSL YVADDYRLNQ QWTLNLGVRH DDADDLDGNL SPRAALIWQQ DPATTWKASY SEAFKMPNAN DRWTSDDTAV PEYVAATELV LQRQLAPHTR FTGSLYRYRR SDLPIENADG DEVPEGSSRA RGVETEIEHV WERGARARAS VAWQRSRDVY GRDAVNSPDL LGKLAFTFLL PGEALRAGLE TQYLGPRLTR ERRMLGGVTL SNLTLSTERD WHGLSASLSV RNLFDRDYET VSGFDWRPGD VAQDGLRMDG RSVWLQVGYA L
|
| |