Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0886 |
Symbol | |
ID | 3831524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 917207 |
End bp | 918961 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637828816 |
Product | hypothetical protein |
Protein accession | YP_429746 |
Protein GI | 83589737 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTTG ATGGCCTTTT CCTGGCCGCC ATCAGCGCAG AACTATCCGG CCTGACAGGC AGCCGGGTGG ACCGCATCTT TCAACCGGAA AAGGAGACCG TTATCCTCCA CCTGCGTAAA GGTCGCGACA CCAGGAAGCT GCTTCTTTGC AGCCTTTCCG ACCAGGCCCG GGTCCACCTG ACGACGGCCA GTTTTACCAA TCCCCCCACC CCGCCCCTTT TCTGCATGGT CCTGCGCAAG CACCTGGAAG GGGGTATTTT GACGGCCGTC GAGCAGCCGG GCCTGGAACG GGTGCTGAAA CTCCACTTTA ACACCACCGA CGAGCTGGGA CGGCAGGCCC CCCGCTTGCT GTTAATTGAA ATCATGGGTA AGCACTCCAA CATCATCCTG CTCAACCCGG AAGGTAGCAT CATCGACGCC GCCCGCCGCT ACACCCATGC CGTCAGCCGC CACCGGGAGG TCCTGCCCGG CCGGCCCTAC GTCCCGCCAC CGGCCCAGGA CAAGGCCGAT CCCCGAAAAC TCGACGACGA GGCCTTCACT CGCCTCCTCT ATGAGGGTAA CTGGGGCGAT CCCCTGGAGC GTCTGCTGGT AAACAGGCTG GCCGGCGTGG GGCCGGAAAC GGCCCGGGAG ATTATCCACC GCGCCGGCCT GCCGGCCGGG ACGACCCTGG AGGGCTGCGG CGCGTATGAA GTGAACCGCC TCTACCAGGC CCTGGGGGAG GTGCTGGCGG CCACCGGCCC CGCCGCCTGG AAGCCGGAGG TCATCCTCCG GCCGGAGGGG GAACCCCTGG CCTTCGCCTC CTTTGAGCTC CACCAGTACC AGGGTTTGCC CCGGGAGCAC CCGGCCACTC CGGGCGCCGC CTGCGATTAC TTCTACTCCC TCCGCCGGGA GCACCAGCTC CTGGAAGGTA CCCGGCGAAG CCTGGAGCAT GTCCTGGAAA AGGAGTTAAA GCGCTGCCGC AAAAAGGAGG GCCTCCAGGC CGCCACCGTA GCCGAAGCCG CCGGGGCGGA GGAGTTCCGC CTGGCCGGGG AGCTCATCAC CGCCAATATC TACCGCATTA AAAAGGGCCA GGCCAGCCTG ACGGCGGCCA ATTTCTACGA CCCGGACGGC GAGCCCGTTA CCATCGAGCT CGACCCTTCC CGTACCCCGG CGGAAAACGC CCAGTGGTAC TTCAACCGCT ACAACAAGGC CAAGCACGCC GCCCGCCTGG CAGCCGCCCA GCTGGAACAA ACCAAGGCTG AAATAGCCTA CCTGGAGAGC ATCGCCCAGG CCGTCAGCAT GGCCGCCACC AGGGACGACC TGGAAGAGAT CCGCCGGGAA TTGCGCCAGG CCGGTTACCT GCCTGAGGAA AGGGACAAAC AAAAACCCGG TAAAAAGGCC GCTAAACCAG AAGCGCATCA GCCCTCCCGG CCCCTGGAGT TTACTTCCCC GGATGGTTTC AAGATCCTGG TGGGCAAAAA CAACCGCCAG AACGACTGGC TGACCCTGAA ACAGGCCGCG GATGGCGACC TCTGGCTCCA CGCCAAGGAT ATCCCAGGTT CCCACGTAAT TATCCGCACC GGGGGCCGGG AGGTGCCCGC TACCACCCTG GAAACGGCCG CCCGCCTGGC GGCCCGCTAC AGCCGCGCCG GCCAGTCCAG CCGGGTGCCG GTAGATTACA CCCTGGTGAA ACACGTCCGG AAGCCCCCCG GCGCCAGACC GGGAATGGTC ATCTACGACC ACCAGCGGAC GGTTTATGTC ACGCCGGCGG AGTAG
|
Protein sequence | MAFDGLFLAA ISAELSGLTG SRVDRIFQPE KETVILHLRK GRDTRKLLLC SLSDQARVHL TTASFTNPPT PPLFCMVLRK HLEGGILTAV EQPGLERVLK LHFNTTDELG RQAPRLLLIE IMGKHSNIIL LNPEGSIIDA ARRYTHAVSR HREVLPGRPY VPPPAQDKAD PRKLDDEAFT RLLYEGNWGD PLERLLVNRL AGVGPETARE IIHRAGLPAG TTLEGCGAYE VNRLYQALGE VLAATGPAAW KPEVILRPEG EPLAFASFEL HQYQGLPREH PATPGAACDY FYSLRREHQL LEGTRRSLEH VLEKELKRCR KKEGLQAATV AEAAGAEEFR LAGELITANI YRIKKGQASL TAANFYDPDG EPVTIELDPS RTPAENAQWY FNRYNKAKHA ARLAAAQLEQ TKAEIAYLES IAQAVSMAAT RDDLEEIRRE LRQAGYLPEE RDKQKPGKKA AKPEAHQPSR PLEFTSPDGF KILVGKNNRQ NDWLTLKQAA DGDLWLHAKD IPGSHVIIRT GGREVPATTL ETAARLAARY SRAGQSSRVP VDYTLVKHVR KPPGARPGMV IYDHQRTVYV TPAE
|
| |