Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1652 |
Symbol | |
ID | 3830940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1686589 |
End bp | 1687767 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829577 |
Product | aminotransferase, class V |
Protein accession | YP_430497 |
Protein GI | 83590488 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.143138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00767612 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCAGGG TTTACCTTGA TCATAGCGCC ACAACTCCGG TAAGGCCCGA AGTCCTGGAG GCCATGTTAC CCTTTTTGAA GGATGAGGCC TTTGGTAATC CTTCCACCGT TTACAGCTAC GGCCGGGAAG CGAAAAAGGC CCTGGAGGAA GCCCGGGAAA AGGTGGCCAA CCTCATCGGC GCCCGGCCGG AGGAGATCTT CTTTACCAGC GGCGGCACGG AAGCCGACAA CCTGGCCCTT ATCGGTACGG CTGCGGCCAA TGAAAAGAAG GGCCGTCACA TTATTACCTC CAGCATCGAA CACCATGCCG TCCTGCACAC GGCCCAGTAC CTCCTGCGCC ACGGCTTTAA GGTAACCTTC CTGCCGGTGA CCCCGGAGGG CCTGGTGCGG GTGGAGGACG TCGAAAAGGC CATTACCGAT GAAACCATCC TCATCAGCGT CATGCATGTT AACAACGAAG TGGGTACCAT CCAACCCATC AAAGAAATAG GGAAACTGGC CCGGGAACGG GGGATCATCT TCCATACCGA CGCCGTCCAG AGCGTTGGCA AGCTCCCCGT TAATGTCGAC GAGCTGGGGG TGGACCTGCT GTCGGCCTCC GGGCACAAGA TTTATGGCCC CAAGGGCATC GGCTGCCTTT ATATCCGCAA GGGGACGAAG ATCAACCCCA TCCTTTACGG CGGTGCCCAG GAGCGTAAAC GTCGGCCTGG GACGGAGAAC ATGCCCGGTA TTGTCGGCTT TGGCCGGGCA GCCGAACTGG CCGGCCAGGA ACTGGAGAGC GAAATGGAGC GCCTGCAGGC CCTGCGGGAC AAGCTAATTG ACGGTATCTT GACACGTATT GAAGACGTCC AGCTGAACGG TGATCCGCGG CAGCGGGTGG CCACCAATGC CAACTTCAGC TTCCGCTATT GTGAGGGCGA ATCCATACTC CTGAGCCTGG ACATGAAGGG TATCTGCGCT TCCAGTGGTT CGGCCTGTAC CTCCGGTTCC CTGGACCCGT CCCACGTCCT CCTGGCCATG GGTATCCCCC ACGAAGTAGC CCATGGTTCG GTACGTATGA CCCTGGGCCG CGAAAATACA GAAGAAGATA TTGACTACGT CCTGGAAGTC ATGCCGGAGA TAATAGCCCG GTTGCGTTCC ATGTCACCCC TCTATGAGGA GGCCGCAGGG AAGAGGTAG
|
Protein sequence | MRRVYLDHSA TTPVRPEVLE AMLPFLKDEA FGNPSTVYSY GREAKKALEE AREKVANLIG ARPEEIFFTS GGTEADNLAL IGTAAANEKK GRHIITSSIE HHAVLHTAQY LLRHGFKVTF LPVTPEGLVR VEDVEKAITD ETILISVMHV NNEVGTIQPI KEIGKLARER GIIFHTDAVQ SVGKLPVNVD ELGVDLLSAS GHKIYGPKGI GCLYIRKGTK INPILYGGAQ ERKRRPGTEN MPGIVGFGRA AELAGQELES EMERLQALRD KLIDGILTRI EDVQLNGDPR QRVATNANFS FRYCEGESIL LSLDMKGICA SSGSACTSGS LDPSHVLLAM GIPHEVAHGS VRMTLGRENT EEDIDYVLEV MPEIIARLRS MSPLYEEAAG KR
|
| |