Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1126 |
Symbol | |
ID | 3833259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1154642 |
End bp | 1155496 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637829055 |
Product | extracellular solute-binding protein |
Protein accession | YP_429983 |
Protein GI | 83589974 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGGTT GGCAAAAAAT AGGCGGCGTG TTGCTACTGA TGCTTTTACT GGCCGGCCTT ACAGGGTGCG GTGGTAGTAA TCCCCAGGGT AACGGTGCTG GAGCAGGAAC CCAGGCAAAC AGTTCCAACA CCTCTACCGG CAAGCCCAGG TATTCTGTAG CCCTGGAAGC CACTTTTGCG CCCTTTGAGT TCCGGGATAT GAAAACTGGC GAGTTTACCG GTTTTGATAT TGACTTGATT AAGGCCATTG GCCAGGTGGA AGGTTTTGAT GTTGATATTA AAGAAATGGG TTTCGACGGC ATTATTACAG CCCTCCAGAC CAACAACGTT GACCTGGCTA TCTCGGGGAT AAGCATTGAC GATGAACGTA AAAAGGCAGT CGACTTTTCC CTACCCTACT ACCAATCGGG TTTAGTGGTC GCGGTTAAAG CTGATAATAA TACTATCAAG GGGTTCGACG ATTTAAAAGG TAAAAAAATA GCCGTTCAGA TTAATACCAC CAGTGCCAAG GAAGCGAAAA AAATCCCCGG CGCCATGGTT ACCGAACTGG ACAAAGTGCC TGATATGTTC CTGGAACTGA AAAACGGTGG CGTCGATGCC GTGGTTAACG ACCTTCCGGT GACGGCCTAT TATATTAAAC AGGGTAATAA TGACGTTAAG ATTGTCGGTG ATATCCGCTC GGCCGAATAC TACGGCATCG CCGTTCCTAA AGGTAAAACC GAGATCCTGC AAAAGATTAA TGATGGTCTA AAGGCCATCA AGGCTAATGG CCAGTATGAG GAACTCTATA AGAAATGGTT CGGCCAGGAA CCGCCCGCTT TCTTGCCGGG GGAACCACCC CAGCAAAAGT CCTGA
|
Protein sequence | MRGWQKIGGV LLLMLLLAGL TGCGGSNPQG NGAGAGTQAN SSNTSTGKPR YSVALEATFA PFEFRDMKTG EFTGFDIDLI KAIGQVEGFD VDIKEMGFDG IITALQTNNV DLAISGISID DERKKAVDFS LPYYQSGLVV AVKADNNTIK GFDDLKGKKI AVQINTTSAK EAKKIPGAMV TELDKVPDMF LELKNGGVDA VVNDLPVTAY YIKQGNNDVK IVGDIRSAEY YGIAVPKGKT EILQKINDGL KAIKANGQYE ELYKKWFGQE PPAFLPGEPP QQKS
|
| |