Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0699 |
Symbol | |
ID | 3832700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 729145 |
End bp | 730221 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637828631 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_429561 |
Protein GI | 83589552 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000602744 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000163268 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCATTT ATGAAAAAGG TATGAATCGG AGGGAGTTTA TTGTCAAAAG TCTGATGGCG AGTGGGTTGG TAGCAGGTAG CTCTCTGTTG CTGAGCGGGT GCTCATCGGG CTCAACGGGT ACTACTGCAA AAGAAGGCAA ACGCTTGAAG GCGGCATTCT CCAATGCTGG ACTCCAAGCC ACCTGGTGCG CCCAGGGCAA GGATACTGTG GAACGTTGGG GTAAATGGTT GGGAGTAGAT ATTACCTGGT ATGACGGCGC TCTTAGTGTA GACAAGCAGC GTGCGGCCGT AGAAGATATG GCAACTAAAG ACTGGGATTT TGTCGCTATC CAACCATTGG GTATCGGTAC TTTAAATGAG CCGGTCAAAA AGATGCTGGA ACGCGGCATC CCCGTAATCG ATATGGATAC CATGATCGCT CAACCGGGTG AATTGCCTAT CACCTGCTTT ATCGCTCCGG ATAATGTGTG GGGAGCTGAA CAGGTAACCG AAGCCCTGAT GCAGGCTATT GGTGGGAAAG GTAATGTGGT AATGACTCAA GGGTCTTTAG GGCATACAGG AGCACAAGGC CGCGCCCAAG GCTTCCATAA TGTCATAAAA CGTTATCCTG ATGTTAAAGT GATAGATGAA ACGCCAGCCG ACTTCGACGT CAATAAAGTT GCCCAGATTT GGGAAAATTT GCTTAATCGT TACGATAAAA TTGATGCCGC GTATTTTCAT AACGACGATA TGGCCCTGGC AGCCTACCAG GTTATTAAAA ATGCCGGCAG GGAAAAAGAA ATTAAAATTG GTGGTAATGA TGGTATGCAA CCGGCAGTAG AAGCCGTCCA AAAGGGTATT ATGGTTGCTA CGGCCCGCAA TTCGGCACCA CGTATTCACT GGGGTGCATT GATGATTGGG TATTATGCTG CTACTGAAAA AGATGCGAAT AAAAAAATTC CACCCTTTAT TCTGGCTGAT GGTCCAATTA TCACCTACAA TGTAGACCAG AGTAACAAGC AACCTTGGCT CAACAAGGGC TACGGCCAGT CTCTCGCTCC TGGCCTGATC TGGCAAGAAG ATCACTTTAT GGTCTAA
|
Protein sequence | MSIYEKGMNR REFIVKSLMA SGLVAGSSLL LSGCSSGSTG TTAKEGKRLK AAFSNAGLQA TWCAQGKDTV ERWGKWLGVD ITWYDGALSV DKQRAAVEDM ATKDWDFVAI QPLGIGTLNE PVKKMLERGI PVIDMDTMIA QPGELPITCF IAPDNVWGAE QVTEALMQAI GGKGNVVMTQ GSLGHTGAQG RAQGFHNVIK RYPDVKVIDE TPADFDVNKV AQIWENLLNR YDKIDAAYFH NDDMALAAYQ VIKNAGREKE IKIGGNDGMQ PAVEAVQKGI MVATARNSAP RIHWGALMIG YYAATEKDAN KKIPPFILAD GPIITYNVDQ SNKQPWLNKG YGQSLAPGLI WQEDHFMV
|
| |