Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0449 |
Symbol | |
ID | 3830877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 449331 |
End bp | 450515 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637828384 |
Product | hypothetical protein |
Protein accession | YP_429323 |
Protein GI | 83589314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCCCGG ATACGCAACT GGCACATATT AACTTGAAGC CCAAATCCCA GGAGCCCTAC GCCCTGATAC TGGCCCTGGC GGCGGCTGCT ATATTCTGGC GGCTCCAGGG GCAGGGAATC GGCCTGGGTA CGATGTGGCT TTTTGGCCTG GCCCTGGGCT ATGTCCTCCA GCGCAGCCGC TTCTGCTTTG TAGCCTGTTT CCGTGACCCC TTTATCACCG GCAACACCTC CCTGAGCCGG GCCGTAGTCC TGGCTCTGGC GGTGGCTACC GCCGGGATGG CCCTGCTGGT CCTGGCCGGC GGTTCTCCGG TGGAAGTCTA CCCGGCCGGC TGGCATAACC TGGCCGGCGG TCTCCTCTTT GGAACAGGTA TGGTCCTGGC CGGGGGTTGT GCCAGCGGTA CCCTGATGCG GGCAGGAGAG GGACACCTGC TCCAGTGGCT GGCCCTGGCC GCCTTCATAA TGGGCTCCCT CTGGGGAGCC CATGACTTCG GTTGGTGGCA GCAGGTCAGC CTCAGTTGCT CCCCGGTGCT CTTTTTACCC CGGCTGGTTG GCTGGGGGCC GGCCCTGGTA CTCCAGTTGC TGATCCTGGG TCTGATTTAC CACGGCCTGT GGCGGATAGA AAAACAGGCC TTTCCAGATT TCAGCCCTCC AGGGAAGGGA CGGTATGCCT TCAAACTGCG CCACCTCTGG TCGCGGCCAT GGCCCTATTG GGCCGGCGGG GTCGTCCTGG CCGTCCTGGA CGTGGCCCTG GCCTGGTGCA CCGGCCGGCC CTGGGGGATT ACCACTGCTT TCAGTTACTG GGGGGCCTGG CTCTGGCAGG CCTTTACCGG CCACCCGCCC GGCTGGTACT ACTACTCTTT GCCGGAACAC ACCCGGGCCC TGGGCCTCGG TTTTCTGGCC GAACCCGGGA CCATCCTCAA CCTGGGGACC ATCTGGGGGG CCGGGTTGTC GGCCCTGGCA GCTTCTGAGT TTCGCCTCCA CCTGCCCCGG CGCTGGCAGG TAGTTCCCGC CGCCCTGGCC GGCGGCATAA TGATGGGTTA CGGCGCCAGG ATTGCCATGG GATGCAATAT CGGTGCCTTT TTCAATGGTA TAGCCTCCCT CTCTCTCCAC GGGTGGCTCT TTGGCCTGGG GCTGGCGGGG GGAGCCTACC TGGGGGGAAA ACTGCTGCTC CGCTTTCTCG TCTGA
|
Protein sequence | MAPDTQLAHI NLKPKSQEPY ALILALAAAA IFWRLQGQGI GLGTMWLFGL ALGYVLQRSR FCFVACFRDP FITGNTSLSR AVVLALAVAT AGMALLVLAG GSPVEVYPAG WHNLAGGLLF GTGMVLAGGC ASGTLMRAGE GHLLQWLALA AFIMGSLWGA HDFGWWQQVS LSCSPVLFLP RLVGWGPALV LQLLILGLIY HGLWRIEKQA FPDFSPPGKG RYAFKLRHLW SRPWPYWAGG VVLAVLDVAL AWCTGRPWGI TTAFSYWGAW LWQAFTGHPP GWYYYSLPEH TRALGLGFLA EPGTILNLGT IWGAGLSALA ASEFRLHLPR RWQVVPAALA GGIMMGYGAR IAMGCNIGAF FNGIASLSLH GWLFGLGLAG GAYLGGKLLL RFLV
|
| |