Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0938 |
Symbol | |
ID | 3832939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 971736 |
End bp | 972959 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828869 |
Product | nucleoside recognition |
Protein accession | YP_429798 |
Protein GI | 83589789 |
COG category | [S] Function unknown |
COG ID | [COG3314] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02871] sporulation integral membrane protein YlbJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000955347 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAAC CTGTATTCAT AATTAGCCGG GGTATAACCC CCTTCCTAAC AGCAGTGGCC GTCGTTATCC TGGCCCTGGC TATCGTCCTT TTCCCCCAGC CCGTTTTTCA GGCTGCCCTG CGCGGTCTCC GGGCCTGGTG GGAAATCGTC GTCCCGGCCC TTTTACCTTT TTTTATTATT TCTCAATTAT TTATGGGCCT GGGTATCGTC CACTTTCTGG GCGTGCTCCT GGAGCCGGTT ATGCGGCCTC TCTTCAATGT CCCGGGCAGC GGCGCCTTTG TCATGGCCAT GGGGTACACT TCTGGCGCCC CCATCAGCGC CATCCTTACC TCCCAGTTAC GCCAGCAGCA GCTGGTAACC AGGGTTGAAG GGGAACGCTT AATCTGTTTC ACCAACAACG CCAGCCCCCT TTTTATGCTG GGGGCAGTAG CCGTGGGTAT GCTCCATAAC CCGGCCCTGG GCCCGGCCCT GGCGGGAGCC CATTATGGAG CCAACCTCTT CCTGGGAGTC CTATTTCGCT TCTACGGTCG ACGGGCACCG GCTTCACCGC CGGGGAACCA CCCCCTCCTA TCCCTGCCAC GGAGAGCTTG GCGGGCCATG ATTCAGGCCC AACAAAGGGA TGGCCGTTCC CTCGGCCAGC TCCTTGGGGA TGCCGTGAGC CACTCCTTCC AGACCCTGAT TACCATCGGC GGCTTTATAA CCCTCTTCAG CGTCATTATC CAGGTAGCCG GTATGCTGGG TATCCTGGAC CTCCTGGCCA GGTTGCTGCT TTATGCCGGC CATCCCCTGG GTTTAACCCC GGCAACAGCC GGGGCCCTGG CCAGCGGTAT CTTTGAAATG ACCATGGGGA CCAAGTTTGC CAGTGAAGCT CCCGTACCCC TTGGGGAGCA GCTCACTGCT ATTAGTATCA TCATGGGCTG GGCCGGGCTC TCCGTCCTGG GCCAGGTGGC TGCCATGACC AGCAAAACGG ATCTCCGCCT GGGTCCCTTT ATCCTGGCCC GCCTTCTCCA TGGTTTCCTG GCGGCCTTCA TGGTCCAACT CTTCCGGGGA CCAGCCCGGC CAGTCCTTGG TTGGCTGACA GGTAGCCATT TCCTGTCGCC CCCGGTATCA TGGCTTTCCC TGGGGGTCCA CTATACAGGG TTTACCCTCA CCCTGGCGGC CTTGTTATTG TTCCTGACCG TGCTGGGATT GTTCGCCCGC CTGACCCTTT ACCGGCGGTT TTGA
|
Protein sequence | MRQPVFIISR GITPFLTAVA VVILALAIVL FPQPVFQAAL RGLRAWWEIV VPALLPFFII SQLFMGLGIV HFLGVLLEPV MRPLFNVPGS GAFVMAMGYT SGAPISAILT SQLRQQQLVT RVEGERLICF TNNASPLFML GAVAVGMLHN PALGPALAGA HYGANLFLGV LFRFYGRRAP ASPPGNHPLL SLPRRAWRAM IQAQQRDGRS LGQLLGDAVS HSFQTLITIG GFITLFSVII QVAGMLGILD LLARLLLYAG HPLGLTPATA GALASGIFEM TMGTKFASEA PVPLGEQLTA ISIIMGWAGL SVLGQVAAMT SKTDLRLGPF ILARLLHGFL AAFMVQLFRG PARPVLGWLT GSHFLSPPVS WLSLGVHYTG FTLTLAALLL FLTVLGLFAR LTLYRRF
|
| |