Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1929 |
Symbol | |
ID | 3830853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2001953 |
End bp | 2003881 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829861 |
Product | hypothetical protein |
Protein accession | YP_430771 |
Protein GI | 83590762 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.943643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0112851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCTGGT TTCTTCTGGG CACACTGGCC CTGCTCACCG GGTCCTGGCT CATGGCGGCC CGGATGGCGT TGAAAAAAAC TCTCGACCGC TGGCTGGCCG TAGGGGTATT GACCATGGCC GGGCAGGTAC TGATCCTCCT GCTCGCCGGT CAAGCCCTGC GGCATCTTAA CCGCGGCGTG GTATTGCTCC TGGCCCTGGG CTGGGCGGGG CTCGGGTTGC TGATCCGGAA CAGGCAGGGG AGTAGATCCG GTGAGATTTT CTCCTTTGAA AGCGGGGCGG CCGGGGATAT AGGAGAACCC GGGAACGCCG AGATTTTTTC CAGTTCCCTC ATCGCAGTAA CCGGTGTCAT TCTGGCCTTT AGCCTGGCAG CCCTGGTCCT GGGGTGGATT TACCTGCCGC CCTTCGCCTG GGATGAGATC TGGTATCACC TTACTCCTAT GGCCGCCTGG TTCAAACAGG GGGCCATTAC CCGGCTGCCG GAGGCCCTAC TCTGGCAGCA CTACGATCCC ACCCGGGTAA CCTCCAGAAA GCTCGCCCTG GATTTAAGCG TGGCCTTTAA TTGGGCCAAT GTTTACCCCC TTAACGCCGA ATTAAATGCC CTATGGATCA TGGTCCTGAC GGGTAACGAC CTCCTGGTGG ATGCCACCCA GTTCCCTTAT GTTGTAATCG GCGCCCTGGC CACCTTCGGC CTGAGCAGGT CAGGGGGTGC CGGACGGAAT GCATCGATTC TGGCTTCTAT GCTCTTCCTT TTGACCCCCA TGGTTCTGAT CCACCTGCGG GTAGCCTATG TCGACGCCGC CTTCGGCTCT ATGGTAGCCG CCGCCCTTTA TCTTTTTTTA AGGTGGCAGG AGGAGCGCAA TCTGGAGTAT GCCCTGATCC TGGGCCTGGC CATTGGGCTG ATGATGGGGA TTAAAGCTAC GGGCATCGCC TTTGCCGGTG TCTTCGTCCT GGCTATACTG GCCTCAGGCT TCTGGCAGTA CCGGCAAGGG GTATCCGGCA ACAGAGCTCT CTGGCTCCAG ATAACAGTGA TTTTATTGTC CCTGATGGCC ACGGGCACTT TTTGGTACTT GCGAACCTGG TGGTTTTACG GTAATCCCGT TTACCCCGTA GAGATTCAAG TGCTGGGATG GAAGCTTCCC GGCATGGGCA GTGTCTCCAG GTTATTTATG GCCGCGAATA CACCGGAAGC CTACCGGGGC CGCAACATCA TCCTCAATAT CCTTACTTCC TGGTTGGAGC TTGGTAACGA ATCCTACAAT TATTATTCCC GCACCCGCGG CCTGGGACCG GCCTGGGCGG CCCTGGCCCT CCCCGCTATC CTACCCTTCG CATTTTCAGC CTGGCGCCGG CGCCGGGCCC CGGTACTCTG GATGGTGGGC CTGACCCTTA TTTTTTTAAT CCTCCAGCCG GCTGCCTGGT GGCCGCGCTA TGTCCTTTAT GTAGTGCCGG TGGGACTGGC GGCTATGGCC TGGGTTTATG ACCGCCTCGA CCCCAGGTTG AAAATGGCTG TGGCCGGAAT CATGATTCTG AACCTGGTGG TGGCCACCGG TTTGACTCTG GTAGAAACCC TGGATAAGCT ACCTGCAGCC ATGCGCCTGG ACCCAGCTCA CCGGACCTTT GGGGAACTGT ACTTCAGTGA CTATGCCTGG GTGGACCAGC TGCCTCCCAG CCGGATTGGT TATACGCCTA TGGCCTGGAT CTACCCCCTG TATGGCGGCC TGCGTAACCA GGTGGAGCTG GTCGATGGAA CCACGGCGGA GTCCTGGCGG GAGGCCATCA TCTGTCAGGG CCTGGATTTT GTTGTTACCA ACACCCAGTA TGGCGACTAC GATAGATGGG CCCTTTCACT GCCTGATTTA TTAACACCCT ACAGGAAAGG GGAGATGATT AATGTCTACC GGGTCAGGCC CGGCCGGGGA CGACAATGA
|
Protein sequence | MLWFLLGTLA LLTGSWLMAA RMALKKTLDR WLAVGVLTMA GQVLILLLAG QALRHLNRGV VLLLALGWAG LGLLIRNRQG SRSGEIFSFE SGAAGDIGEP GNAEIFSSSL IAVTGVILAF SLAALVLGWI YLPPFAWDEI WYHLTPMAAW FKQGAITRLP EALLWQHYDP TRVTSRKLAL DLSVAFNWAN VYPLNAELNA LWIMVLTGND LLVDATQFPY VVIGALATFG LSRSGGAGRN ASILASMLFL LTPMVLIHLR VAYVDAAFGS MVAAALYLFL RWQEERNLEY ALILGLAIGL MMGIKATGIA FAGVFVLAIL ASGFWQYRQG VSGNRALWLQ ITVILLSLMA TGTFWYLRTW WFYGNPVYPV EIQVLGWKLP GMGSVSRLFM AANTPEAYRG RNIILNILTS WLELGNESYN YYSRTRGLGP AWAALALPAI LPFAFSAWRR RRAPVLWMVG LTLIFLILQP AAWWPRYVLY VVPVGLAAMA WVYDRLDPRL KMAVAGIMIL NLVVATGLTL VETLDKLPAA MRLDPAHRTF GELYFSDYAW VDQLPPSRIG YTPMAWIYPL YGGLRNQVEL VDGTTAESWR EAIICQGLDF VVTNTQYGDY DRWALSLPDL LTPYRKGEMI NVYRVRPGRG RQ
|
| |