Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1878 |
Symbol | |
ID | 3831222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1941224 |
End bp | 1942249 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637829810 |
Product | hypothetical protein |
Protein accession | YP_430721 |
Protein GI | 83590712 |
COG category | [S] Function unknown |
COG ID | [COG2855] Predicted membrane protein |
TIGRFAM ID | [TIGR00698] conserved hypothetical integral membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000395194 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACAA GGAATCCAAC AATGACTGAA GGGAACCAGG GCTTGGGTCT TATCCAGGGG GTGGGCCTGA CTGTAATTCT TACCCTGGTT GCCAGGCAGC TTGCCATGTT ACCGATTCTA AAAATCATGG GCAGCATGGT TCTTGCTATC CTTCTGGGTG TTGCCTGGCG TTCCCTTATG GACATTCCAG CAACGGCAGA GGTGGGCATT AATTTCGCCA GCAAAAAAAT TCTCCGTTAT GGCATTATCC TCATGGGACT GCGTCTGGAT ATCCCTAAAA TTATTGCTGC CGGCCCGCAG GTAATTCTCC TTGACATCCT GGCCATCTTA GTGTCTATGG TAGTAATTAT TTTCCTGGGG CAGAGGATGG GGCTTAATAA AAAATTAGCC GCCCTCATAG CTGCCGGGAC GGGTATTTGC GGGGCAGCAG CCATTGCCGC CATAGCCCCG ATAGTCAGGT CCCGGGATGA TGAAACTGCC GTGGCGGTGG CTATTGTCGC CCTGCTGGGA ACCCTATTTA CGATTCTTTA CACCCTGCTT TACCCGGTAC TTAATTTAAC TTCCTTCCAG TATGGTTTAT TATCCGGCAG CAGCCTACAT GAACTGGCCC ATGTGATTGC GGCAGCCCAG GCCGGGGGCA GCGCCAGCGC TGATATCGCT ATCCTGGTAA AACTAGGGCG AGTGGCCTTC CTGGTGCCGG TAGCTCTTGT GCTAGGATTA ATTTTTGCTC GTCAAAATGA AACTGGCGCC GGCTGGCATT GGCGCCAGCT CCAGGTGCCA TGGTTCATTT TGGGTTTCCT GGTTTTTAGC GGTATCAACA CCATGGCTAT TCTGTCAACT CCCCTTATCG CATTTTTGAT CCAGGTCGGT GTTTTTCTCC TGACCGTGGC TATGGCCGGC CTGGGCCTTA ACGTAAGCCT GGAAATGATC AAAAAGGTCG GCAGCCGGGG CCTGGTAACC GGTTTGCTGG GTTCCGTTGT CCTTAGCTTA ACTATCTTTC TGGTTATTGC CAGTTTGATT AATTAA
|
Protein sequence | MSTRNPTMTE GNQGLGLIQG VGLTVILTLV ARQLAMLPIL KIMGSMVLAI LLGVAWRSLM DIPATAEVGI NFASKKILRY GIILMGLRLD IPKIIAAGPQ VILLDILAIL VSMVVIIFLG QRMGLNKKLA ALIAAGTGIC GAAAIAAIAP IVRSRDDETA VAVAIVALLG TLFTILYTLL YPVLNLTSFQ YGLLSGSSLH ELAHVIAAAQ AGGSASADIA ILVKLGRVAF LVPVALVLGL IFARQNETGA GWHWRQLQVP WFILGFLVFS GINTMAILST PLIAFLIQVG VFLLTVAMAG LGLNVSLEMI KKVGSRGLVT GLLGSVVLSL TIFLVIASLI N
|
| |