Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3733 |
Symbol | |
ID | 7873732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4102869 |
End bp | 4103834 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643700679 |
Product | periplasmic binding protein |
Protein accession | YP_002890703 |
Protein GI | 237654389 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCTC TCCCCTGCCG TCTTTCCGAT GCAGCGGCCT CGGCCGCAGC GGCCACGCTG CTCGCCCTCG GCGGCCCCGC GTCCGCCGCG GGCGTCGAGC TGGTGGACGA CACCGGCCGC AAGCTCGCGC TGGCCGCGCC GGCGCAGCGC ATCGTCAGCC TCGCCCCGCA CGTCACCGAG ATGCTGTTCG CCGCCGGCGC GGGCGAGCGC GTGGTGGGCG CGGTGGACTA CAGCGACTAC CCCGAGGCGG CACAGCGCAT CGCGCGTGTG GGCGGCTACA CCCGGATCGA CCTCGAGGCG GTCGCGGCGC TGCGCCCGGA CCTGGTGATC GGCTGGCAGA GCGGCAACCG CGAAGGCGAT CTCGCCCGCC TGCAGGCGCT CGGCATCCCG GTCTACCTGA GCGAGCCGCG CAACCTGGAA GACGTGGCGC GCAACCTGGA GCGACTCGGG CAGCTCGCCG GCAGCGAACC CGCCGCCCAG GCCGCGGCGA GCGCCTTCCG CGCGCGCCGC GAGCACCTCG CCGCCACGTA TTCGGCACGC GACAAGGTGC GCGTTTTCTA CCAGATCTGG GATCGCCCGC TGATGACGGT GAACGACCAC CACCTGATCG CCGACGTGAT CCGCCTCTGT GGCGGCGCCA ACGTGTTCGG CGAGGTCGCC CACCTGACGC CGACGATCGG CGTCGAGGCG GTGCTCGCGG CCAACCCCGA GGTGATCGTG GCCTCGGGCA TGGGCGAGGC CCGTCCGGAG TGGCTCGACC AGTGGTCGCG CTGGCCGCAG CTCGAGGCCG CGCGCCGCGA CAACCTGTTC TTCATCCCGC CCGAGCTCAT CCAGCGCCAT ACGCCGCGCA TCCTCGACGG CGCGGCGCGC CTGTGCGGCC AGGTCGAGAC CGCACGCAAG CGCCGCGGCG GCGCCTCCGC GGCGCTCACG CCCCCTGCAG CACCCGCTCC GGCGCGCGCG GACTGA
|
Protein sequence | MSSLPCRLSD AAASAAAATL LALGGPASAA GVELVDDTGR KLALAAPAQR IVSLAPHVTE MLFAAGAGER VVGAVDYSDY PEAAQRIARV GGYTRIDLEA VAALRPDLVI GWQSGNREGD LARLQALGIP VYLSEPRNLE DVARNLERLG QLAGSEPAAQ AAASAFRARR EHLAATYSAR DKVRVFYQIW DRPLMTVNDH HLIADVIRLC GGANVFGEVA HLTPTIGVEA VLAANPEVIV ASGMGEARPE WLDQWSRWPQ LEAARRDNLF FIPPELIQRH TPRILDGAAR LCGQVETARK RRGGASAALT PPAAPAPARA D
|
| |