Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0603 |
Symbol | |
ID | 7084541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 680201 |
End bp | 681406 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697630 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002354272 |
Protein GI | 217969038 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAA GAACAGTCGC AGCAGTGGTC GGCGCCATGC TCGGCGCCGC GGTCGTCGCC GCACCGGCCC ACGCCCAGGA CGTCAAGCTC GGCCTGATGG CGGCCATCTC GGGCCCGATC GCGGCGCTCG CGCCGCCGAT GGCGGCGGCC TCCAAGCTGG CGGTCGCCCA CGTCAACGAA CAGGGCGGCA TCCTCAAGGG CGGCAAGCTC GAGGTCGTGC TCGGCGACAG CGCCTGCAAC CCGCAGAACG CCACCGACGT CGCCACCAAG GCGGTGAACA TCGACCGCGT GATCGCGGTC GTCGGTCCGG CCTGCTCGGG CGCGGTGCTC GCCTCGGCGA ACTCGGTGAC CATCCCCGCC GGCGTTCTGA TGATCACCCC TTCCGGCACC TCGCCCGAGA TCACCAAGCT CAAGGACAAG GACCTCGTCT ACCGCACGCT GCCGTCCGAC GACTACCAGG GCCGCGCCCT GGCGCGCACC CTCAAGGCGC GCGGCATCTC CAAGGTGGCG GTGGCCTACC TGAACAACGA TTACGGCAAG GGCCTGGCCG AATCCTTCAA GTCCGAGTTC GAGGCCAACG GCGGCACCAT CGCCGGCTAT TCGGGCCACG AGGAGGGCAA GGCCTCCTAC CGCTCCGAGC TGGCCACGCT CGCCCGCGGT GGCGCAGACA CCCTGGTCAT CTTCGATTAC GGCGATGGCA CCGGCCTGAG CATCCTGCGC CAGTCGCTCG AGAACAACTT CTTCAAGACC TTCGTCGGCG CCGACGGCAT GAAGTCCGAG GGTCTGGTCA AGGCGATCGG CGCGGCCAAC CTCGGTGGCT TCTTCGTGTC CGCCCCGGTG GGCGAGGCCT CGGCCTCGCT CGACAACTTC AACAAGGCCT TCAAGGCCGC GGGCGAGAAC ATCGACGCCG TGTTCGCCAC CACCTCCTAC GACGCCGCCT TCCTCGCCGC GCTGGCGATC GAGAAGGCGG GGGGCGACAA GACCAAGCTG GCCGAGTCGC TGCGCGCGGT GGCGACCGCG CCCGGCGAGC CGATCCTGGC GGGCGAGTGG GCCAAGGCCA AGAAGCTGAT CGCCGAAGGC AAGGACATCG ACTACAAGGG CGCCGGCGGC GACCACGAGT TCGATGCCGC CGGCGACGTG CCGGGCAACT ACGCCTTCTT CAAGGTCAGC GGCAACAGGT ACGAGTCGAT CGCCGACATG AAGTGA
|
Protein sequence | MKKRTVAAVV GAMLGAAVVA APAHAQDVKL GLMAAISGPI AALAPPMAAA SKLAVAHVNE QGGILKGGKL EVVLGDSACN PQNATDVATK AVNIDRVIAV VGPACSGAVL ASANSVTIPA GVLMITPSGT SPEITKLKDK DLVYRTLPSD DYQGRALART LKARGISKVA VAYLNNDYGK GLAESFKSEF EANGGTIAGY SGHEEGKASY RSELATLARG GADTLVIFDY GDGTGLSILR QSLENNFFKT FVGADGMKSE GLVKAIGAAN LGGFFVSAPV GEASASLDNF NKAFKAAGEN IDAVFATTSY DAAFLAALAI EKAGGDKTKL AESLRAVATA PGEPILAGEW AKAKKLIAEG KDIDYKGAGG DHEFDAAGDV PGNYAFFKVS GNRYESIADM K
|
| |