Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1601 |
Symbol | |
ID | 3832747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1636385 |
End bp | 1637803 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829530 |
Product | sulfite reductase, dissimilatory-type alpha subunit |
Protein accession | YP_430450 |
Protein GI | 83590441 |
COG category | [C] Energy production and conversion |
COG ID | [COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits |
TIGRFAM ID | [TIGR02064] sulfite reductase, dissimilatory-type alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000022809 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTA AGCCCAAGCG GCCGCAGAAG GACTTAAAGT ACGACGAATT GCGCATTTAT ACCGACGAAG AATTGCATAA CTACTCGGAA GAAGAACTAA AAAACTTTAA GCTCAAACAC GACATTCCCG ACCTGGACGA ACTGGAAAAG GGACCGTGGC CCAGCTTTGT CGCCGATGCC AAGCGGGAAG CCCTGCATCG CAAGAAGCTC GCCGATGACC GGCTTATGAT CGACAAGGAC GTAGTTGACG ATTTACTCGG ACAGCTGCAA TTATCCTTTG ACGACGGAGA AACCCACTGG AAGCACGGCG GTATCGTCGG CGTCTTCGGT TACGGCGGCG GCGTCATCGG CCGGTACTCG GACGTTCCCG AAAAATTCCC TTCGGTGGCC CAATTCCATA CCCTGCGGGT CAACCAACCG GCCAGCAAGT TCTATAAGAC TGATTTCTTG CGGGCCCTGG CCGACCTGTG GGAGTACCGC GGCAGCGGTA TGTTCAATCT GCATGGCTCC ACCGGGGACA TCATTCTCCT GGGAACCTCT ACCGAACAGC TGGAACCTAT CTTTTATGAT CTGACCCACG AGCTGGATCA GGACCTTGGC GGGTCCGGTT CCAACCTGCG GACACCTTCC TGCTGCATCG GCAAGGCCAG GTGCGAGTTT GCCTGCATCG ACACCCAGGA TTTATGTTAT GAGATAACCA CCCACTACCA GGATGAGCTG CACCGCCCGG CCTTCCCCTA CAAGTTTAAG ATTAAGGTCG ACGGTTGCCC CAACGGTTGC GTAGCTTCCA TTGCCCGTTC TGACATGTCC CTCATTGGCA CCTGGCGGGA CGATATCCGC ATTGACCAGG AGGCTGTACG GGCCTACATG GCCGGCGATA TTGAACCCAA CGGGGGCGCC CATAAGGGCC GCGATTGGGG CAAATTTGAT ATCCAGAAAG AGGTTATTGA TCTCTGCCCG ACTGGCTGTA TGGCCCTGGA AGACGGCCAG CTGAAAATCA ATAATAAAGA ATGCAACCGC TGCATGCACT GCATCAACGT CATGCCGCGA GCCCTGAAAC CGGGAAGGGA TACCGGCGTC AGCGTCCTCT TCGGGGCCAA GGCACCCATC CTGGAGGGCG CCCAGCTGGC GGTATTAACA ATACCCTTCA TGAAGGCCGA AGCGCCCTAC GATAATATTA AAGAGCTGGT TGAAAAGGTC TGGGATTGGT GGATGGAAGA GGGCAAAAAC CGTGAGCGCC TGGGCGAACT GATCCAGCGC AAGGGTTTAC CCAAGTTCCT GGAGGTTATC GGCGTACCGG CCGCACCCCA AATGGTTCGC CATCCCCGGA CCAATCCTTA TATCTTCTGG AAGGAAGAAG ACGTACCCGG CGGCTGGAAA CGCGATATCA ACGAATACCG GCAGCGGCAC AAGAGATAG
|
Protein sequence | MEFKPKRPQK DLKYDELRIY TDEELHNYSE EELKNFKLKH DIPDLDELEK GPWPSFVADA KREALHRKKL ADDRLMIDKD VVDDLLGQLQ LSFDDGETHW KHGGIVGVFG YGGGVIGRYS DVPEKFPSVA QFHTLRVNQP ASKFYKTDFL RALADLWEYR GSGMFNLHGS TGDIILLGTS TEQLEPIFYD LTHELDQDLG GSGSNLRTPS CCIGKARCEF ACIDTQDLCY EITTHYQDEL HRPAFPYKFK IKVDGCPNGC VASIARSDMS LIGTWRDDIR IDQEAVRAYM AGDIEPNGGA HKGRDWGKFD IQKEVIDLCP TGCMALEDGQ LKINNKECNR CMHCINVMPR ALKPGRDTGV SVLFGAKAPI LEGAQLAVLT IPFMKAEAPY DNIKELVEKV WDWWMEEGKN RERLGELIQR KGLPKFLEVI GVPAAPQMVR HPRTNPYIFW KEEDVPGGWK RDINEYRQRH KR
|
| |