Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3194 |
Symbol | |
ID | 7874334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3476982 |
End bp | 3478229 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700123 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002890166 |
Protein GI | 237653852 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.100454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCCT TCCTCGCGCT GCTCGGGCGC GCCTTCGTGA CGCTTCTGCT CCTGGCGCCG CAGTTGTCGG CACAGGCCGT CCCGGCAGAG GCCGTGGGCG CGGGCGCACG CCCGAGCGGC GGCGAGCGGG CACCGATCCG GGTGGGCGTG TCGGGGCCCT TCTCCGGTCC GTCGGCGCCG ATGGGGCTGT CGATGCGCGA GGGCATCCGC ATCGCCGCCG AGGAGCTCAA CGCCGGCGGC GGCCTGCTCG GTCGGCGCAT CGAGCTCGTC GAGCGCGACG ACGAGGCGAG CAACGAGCTT GGTGCGCAGA TCGTGCGCGA CTTCATCCAT CGCGAACGTG TCACCGCCGG ACTGGGCATC GTCAATACCG GTGTCGCGCT CGCCAGCCAG CGCCATTACC AGATGGCGCG CATCCCGGTG ATCACCTCGG TCGCCACCGG CTCGCTGATC ACCAAGCAGT TCCAGCCCCC CGATTTCACC GAGAACTACG TGTTCCGCGT CTCCGCCAGC GACACCTTGC AGGCCGCGGT GATCGTCGAG GAGGCTGTCG GCCGGCGCGG CCTGACCCGG CTCGCGATCC TGCACGACGC CACCAACTAC GGGGTGCTGG GAAGCCAGGA TCTCATCGCA GCCCTGGGCA CGCGCGGGCG GACCGCGGTG GTGGTCGAGC GCTTCCAATT GCGCGAGACC GACATGCGAC CGCAGCTCGA ACGCGCACGC GCCGCCGGTG CGCAAGCCGT GCTCACCTAT GGCATCGGCC CCGAGCTCGC CCACATCGCC AACTCGATGG CCCGCCTCGG ATGGCAGGTG CCGATCATCG GCAGCTGGAC GCTGGCCATG TCGAGCTTCA TCGAACTCGC CGGCCGCAAC GCCGAGGGCG CACGCATGCC ACAGACCTTC ATCGCCGAGG CACGCAGCCC GGCGCAGGCG GCCTTCCTCG CGGCGTGGGA GCGCGCCACC GGCAGCGTGC GTATCCCGGT GCCACCGGCG GCCGCACAGG GCTACGACTC GATGCAGCTG CTCGCCGCGG CGATCCGCCA GGCCGGCAGC CTCGACGGTC CGCGCATCCG CGAGGCGCTC GAAAACCTGG ATGCGGAGGT CGATGGCGTG ATCATGCGCT ATCGCCGCCC CTTCTCCCGC GACAACCACG AAACCCTGCG CAGCGCGCGC CAGATCCACC TCGGCGAGAT CCGCGACGGC GCCGTGGTGT TCGCCCACGA GGCGGCGCCG GAGAGGCCGG GACCATGA
|
Protein sequence | MKPFLALLGR AFVTLLLLAP QLSAQAVPAE AVGAGARPSG GERAPIRVGV SGPFSGPSAP MGLSMREGIR IAAEELNAGG GLLGRRIELV ERDDEASNEL GAQIVRDFIH RERVTAGLGI VNTGVALASQ RHYQMARIPV ITSVATGSLI TKQFQPPDFT ENYVFRVSAS DTLQAAVIVE EAVGRRGLTR LAILHDATNY GVLGSQDLIA ALGTRGRTAV VVERFQLRET DMRPQLERAR AAGAQAVLTY GIGPELAHIA NSMARLGWQV PIIGSWTLAM SSFIELAGRN AEGARMPQTF IAEARSPAQA AFLAAWERAT GSVRIPVPPA AAQGYDSMQL LAAAIRQAGS LDGPRIREAL ENLDAEVDGV IMRYRRPFSR DNHETLRSAR QIHLGEIRDG AVVFAHEAAP ERPGP
|
| |