Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0097 |
Symbol | |
ID | 3832668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 94586 |
End bp | 95572 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828029 |
Product | NLPA lipoprotein |
Protein accession | YP_428979 |
Protein GI | 83588970 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA GGGGACTGGC CCTTCTTTTA CTCCTACTCT TCCTGGTCCC TGTCGTATCT GGCTGCGGCA GCCCGGCTAG CAGCGGGAGC AGCCAGGAAG AGAGCCTGAA GCTGGGGTTA ATCCCGGTGG AGGATAACTT CCCCTTTTTT GTCGCCGAGA AGGAAGGCCT CTTTACCAAA GCCGGCTTGA AGGTTGAACT GGTACCCTTT AATAGTGCCC GGGATCGCGA TCTGGCCCTG CAGTCCGGGA GTATCGACGG CGAGGTGGCC GATATTGTCG CTACGGCCCT GCTACGTAAA GGCGGAACGC CGGTGAAGAT CGTCTCCCTG ACCATGGGGG CCACCCCGGC CGAGGGACGC TTCGCCCTCC TGGCCCGGCC CGGGGCCGAT ATCAGCTCCC CCGGCCAGCT CAAAGGCCGG ACCGTCGGCA TCTCGGAAAA CACCATCATC GAGTATGTCG CTGACGGCCT CCTGCGGGAA GGAGGGGTAG ACCCCGGCTC CGTCCAGAAA GTCGCCGTAC CCCAGATCCC GGAACGGCTC CAGCTCTTGC TGGGTGGTAA GTTGGACGCC GCCCTGCTGC CTGATCCCTT TGCTTCCCTG GCCGCCAGGA AAGGGGCCAG GGTGATCCTG GACGATACGA AAATTAACCG CAATCTCTCC CAGGTGGTAC TTATCTTCCG GGAGGAAGCC ATCAAACATA AGACACCGGC TATTAAGAAG CTACTCCAGG TATATGCCGG GGCCGCGAGC TTGATTGCCC GGAACCCCTC CGCCTACCGG GAGCTATTTA TTGAAAAGGC CAGGATACCG GCGGAACTCC GGGACACCTA CCTGGCGCCC CAATACTCCC CGCCGCAACT GCCCCGGCAG GAGGAGGTCG CGGCGGTGAT GGACTGGATG GTGGCCAAAA AACTCCTGGC CGCACCCTAT AAATACGAAG AGCTGGTTGA CCCGGATTTG GTTAACCCCG GTGGGAACAA CCGGTGA
|
Protein sequence | MKIRGLALLL LLLFLVPVVS GCGSPASSGS SQEESLKLGL IPVEDNFPFF VAEKEGLFTK AGLKVELVPF NSARDRDLAL QSGSIDGEVA DIVATALLRK GGTPVKIVSL TMGATPAEGR FALLARPGAD ISSPGQLKGR TVGISENTII EYVADGLLRE GGVDPGSVQK VAVPQIPERL QLLLGGKLDA ALLPDPFASL AARKGARVIL DDTKINRNLS QVVLIFREEA IKHKTPAIKK LLQVYAGAAS LIARNPSAYR ELFIEKARIP AELRDTYLAP QYSPPQLPRQ EEVAAVMDWM VAKKLLAAPY KYEELVDPDL VNPGGNNR
|
| |