Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0759 |
Symbol | |
ID | 3831472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 795503 |
End bp | 796705 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637828690 |
Product | hypothetical protein |
Protein accession | YP_429620 |
Protein GI | 83589611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.547594 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGACATA TTGTTATTTA TAAAGGTCAG TCGCAGTATG ATGTACTTCG TGTCTTTGTA GATCAGTTAG GTGAAGCTTT TAAATCCTTA GGTAAAGATG TTTATATCGT AGACCTTCTG GCCAGCAATG CCGGTCAGCA ATTACAGGAA GCTTTTAGCC AGCCCTGCGA GTTTGTTTTT GCTTTTAATG CCATGGGCAT AGATCTGAAA ATAGGCAGCA AATCTTTATA TGATTCCCTG GGCATACCTT TTATTGCTGC TTTAGTAGAC GATCCAGTTT ATCACTTGCA GCGTTTGGAA TATCCGGTAG AGAATTTGTT AATCGGATGT GTGGATCGTT CGCATATTAA TTTTGTTAAC AGTTATTATG GTAATCAGCG GACCTGCTTT TTCTTTCCTC ATGGAGGATG TAAAGCCAAG GATGTTATTG ATGGTAGAAG CGTTCAACAG GATGGCATTC GGGGAATAGA TATTTTGTTC GGGGGCTCAT ATCAAGACCC GGATAGTATA CGTAATATCT GGGTTAATCT AAATACCACT ATAGCAAGGC TTTTGGATGA AATAGTGGAT TATATCCTGG GTAAAGATTA TATAAACCTG GCGATTGCGG CTGAAAATGT TTTTGCTTCC AGAGGTATTT ATTTAAATAA CGAACTATCG AATAAACTTA TCTACCTTTT GCCTTTTGTC GATAAATATG TCCGCGCTTA TCGCCGGCGC CAGTGCTTGC AAATGCTTGC CGATTCCGCC TTAGAGGTCC ATGTGTATGG TGCCAACTGG GAAAATGCCC GGATTAATGC AAAAAACAAT ATCTTAATCC ATCAGCCAGT GGGCTTCTTA GAAATGCTCG GCTTAATGGA ACAGGCAAAG ATGGTGCTTA ATATTGAACC TAGTTTTGCA AATGGCGGCC ACGAGCGAGT GTTTTCGGCC ATGATAAATG GGGCAGTTAC ACTTTCCAAC ACTAATAGTT TTTACTCACA GGAATTCATG GATGGCGAAG ATATTATTCT TTATTCGTGG AGCAAACTCC ATGAATTACC GTCAAAAATT TACGGGCTCC TTGAAAACCC TGATAAAATG GAGGCCCTGA GATTGGCGGG TAAACGCATA GCGGAAGAAA GACATACCTG GGTTGTCAGG GCGAAAAGAA TCCTGGATGT TATCGAGACA TATAAGTCGT TAAAGAATCT TAGGGTATCA TAA
|
Protein sequence | MGHIVIYKGQ SQYDVLRVFV DQLGEAFKSL GKDVYIVDLL ASNAGQQLQE AFSQPCEFVF AFNAMGIDLK IGSKSLYDSL GIPFIAALVD DPVYHLQRLE YPVENLLIGC VDRSHINFVN SYYGNQRTCF FFPHGGCKAK DVIDGRSVQQ DGIRGIDILF GGSYQDPDSI RNIWVNLNTT IARLLDEIVD YILGKDYINL AIAAENVFAS RGIYLNNELS NKLIYLLPFV DKYVRAYRRR QCLQMLADSA LEVHVYGANW ENARINAKNN ILIHQPVGFL EMLGLMEQAK MVLNIEPSFA NGGHERVFSA MINGAVTLSN TNSFYSQEFM DGEDIILYSW SKLHELPSKI YGLLENPDKM EALRLAGKRI AEERHTWVVR AKRILDVIET YKSLKNLRVS
|
| |