Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3549 |
Symbol | |
ID | 7873055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3889534 |
End bp | 3890493 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700490 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_002890520 |
Protein GI | 237654206 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG AATCCCCCAC GCCCACCACC GCTGCCGCAC CGCGCGGTGC GCTCGCACGC CTGCTCGACA GCGACCTCCT GCACGGCTTC CTCCGCTCGC CGATCACCGT GCTCTCGGCC TTGATCGTGC TGGCGATCCT GATCGCCGCG TTGCTCGCGC CGGTGATCGC GCCGCAGAAC CCCTTCGATC CCGCCACGCT GAACCTGATG GACGGCTTCA GCCGCCCCAT GCAGGCCAAC GCGTTCACCG GCAACGTGTA TTGGCTGGGC ACCGACGCGC AGGGGCGCGA CCTCTTCTCG GCCATCCTCT ACGGCTCGCG GGTGTCGCTG CTGGTCGGTT TCGCGGCGGT GGCCTTCGCC GCGGTGCTCG GCATCGCGCT CGGGCTGATC GCCGGCTATC GCGGCGGCTG GGTGGACAGC CTGATCATGC GCATCGCCGA CGTGCAGCTG AGCTTCCCGG CGATCCTGGT GGCGCTGCTG ATCTTCGGCG TGTTCAAGGG CGTGGTGCCG CCGGCGCTGC ACGACAAGGC GGCGATCTAC GTGCTGATCC TGGCGATCGG CCTGTCGGAC TGGGTGCAGT ACGCGCGCAC GGTGCGCAGC TCCACGCTGG CCGAGCGCAA CAAGGAATAC GTGCAGGCCG CGCGCGTGAT CGGCGTGCCG GCGCCCACCA TCCTGCTGCG CCACATCCTG CCCAACGTGA TGGGGCCGGT GCTGGTGATC GCCACCATCG GCCTCGCGCT GGCGATCATC CTGGAGTCGA CGCTGTCCTT CCTCGGCGTG GGCGTGCCGC CCACCCAGCC CAGCCTCGGC ACCCTGATCC GCGTCGGCCA GGACTACCTG TTCTCGGGCG AGTGGTGGAT CGTGTTCTTC CCCGGGCTGA CGCTGCTGCT GCTCGCGCTG TCGGTGAACC TGCTCGGCGA CTGGCTGCGC GACGCGCTCG ACCCGAGGCT GCGCCGATGA
|
Protein sequence | MSTESPTPTT AAAPRGALAR LLDSDLLHGF LRSPITVLSA LIVLAILIAA LLAPVIAPQN PFDPATLNLM DGFSRPMQAN AFTGNVYWLG TDAQGRDLFS AILYGSRVSL LVGFAAVAFA AVLGIALGLI AGYRGGWVDS LIMRIADVQL SFPAILVALL IFGVFKGVVP PALHDKAAIY VLILAIGLSD WVQYARTVRS STLAERNKEY VQAARVIGVP APTILLRHIL PNVMGPVLVI ATIGLALAII LESTLSFLGV GVPPTQPSLG TLIRVGQDYL FSGEWWIVFF PGLTLLLLAL SVNLLGDWLR DALDPRLRR
|
| |