Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1981 |
Symbol | |
ID | 3831163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2064899 |
End bp | 2065906 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829912 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonates |
Protein accession | YP_430822 |
Protein GI | 83590813 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0169392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAA AGGCCATTGG TTTAACGTTA ATAGCCTTAT TGATGGTAAC CAGCCTGGCC GGGTGCGGTA ACGGTAAGGG GGCGACAGCC AGCAGGGACG GAGAACCTGC GGCCATTAAA GTAGGAACCA ACCGCGCCCT GGGGACTGTT GTCCCTTATA TCGCCAGAAC CCGGGGGATA ATCGCCGCAA AAGGGCTAAA GGTTGATATC GTGGACTTCC AGGACGGATC CACACTGATG GAGGCCTTCG CTTCCGGGCA ACTGGATATC GCCTTTACCG GCGTCGCTCC CGCAGCTATC TGGCAGGGTA AGGGAGTACC TTTAAAGGTA GTTGCCTCGG CCAATGGCGG CGGGCATGTC CTGCTGACCA GAGAAGATGC CGGGATTAAA GACCTCTCGG AGTTAAAGGG AAAGAAAATA GCCGAGCCCA GGACGGGGAC GGTTAGCGAC ACCCTCCTCC GTAGCCGCAT CTTACAAGAT GAGGCCAAGC TGGACCCGGA GAAGAACGTC CAGCTCTTAC CCGGCATGGC GCCGGCCGAT ATGCCTGCGG CCCTGACTGT GTCCAAAGAG GTGGATGCGG TCCTTACCTG GGAACCCTTC GCTTCCCGGG CCGAAAGGGA GTTTAAAGGG ATCAGGGTGC TCTACGATGC CGCGGCGGAA TGGAAGAAGC AAAAGTCCGG CGCGGCCTAT TATCCGGTCA ACGTGGTCGT CGCCCGCCAG TCTTTTATTG ACCGGCATCC GGATGAATTG AAGAAATTCC TGGCCGCTTA CAAGGAAACC GTTGATTTTA TAAACAACCG TCCAGATGAA GCCAACGCCT TGATTGCCAG GGAATTAAAT CTTGATAAGG AGATTGTGGC CAGCGCCCGC CAGAGAATCG ATTATACCTG GCAGCTTGAC ATCCCGGCCA CCCTGGAAAC CTTAAAATGG TCGCAAAAAC TGGGTTATTT GCAGGAAATC CCTTCTCCTG GCAAGCTGTT TGACAGTAGT TATTTACCCA GGGAATAA
|
Protein sequence | MIKKAIGLTL IALLMVTSLA GCGNGKGATA SRDGEPAAIK VGTNRALGTV VPYIARTRGI IAAKGLKVDI VDFQDGSTLM EAFASGQLDI AFTGVAPAAI WQGKGVPLKV VASANGGGHV LLTREDAGIK DLSELKGKKI AEPRTGTVSD TLLRSRILQD EAKLDPEKNV QLLPGMAPAD MPAALTVSKE VDAVLTWEPF ASRAEREFKG IRVLYDAAAE WKKQKSGAAY YPVNVVVARQ SFIDRHPDEL KKFLAAYKET VDFINNRPDE ANALIARELN LDKEIVASAR QRIDYTWQLD IPATLETLKW SQKLGYLQEI PSPGKLFDSS YLPRE
|
| |