Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1765 |
Symbol | |
ID | 3831057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1820654 |
End bp | 1822066 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829690 |
Product | hypothetical protein |
Protein accession | YP_430609 |
Protein GI | 83590600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA TCCTGCCGGA ATTTTTACGC TGGCAGCCGG GGGCGGAGCA GTGGGCCGTC CCGGTCAGCG CCTGGGATAC CGGCCCCACC AGCTACCTGA GGTTGCGGCG GGTATTGCTG GACGGCCGCC CGGTCGCCGG GCGTAATATT ATCTTTCCTG GGGTAATGCC GGTTTACCTC CTGCCTGCCG GGGCCCGGAC CTCCGACCCG GCGGCTTACT TAGCCACCCT GGGAAGAGAA GATAAGCGCT ACGCCTGTTC GTGGTTGATC CTGAAGACGG CTGGTCTGTC CGCTCTGCAG GTTACCGGCG GTGAACACTC CCTGGAACTG GAGTTTATCA CCTTTGCAGG TATGACCTGC CGCGCCGCCA CCACCATCCT GCTGGCACCC CTGCCAGCCC ATCCCCACTG GCAACCGGTG GAGCTGCAAA TGCATAACTC CAGCCACGAC GACGGCCACT GGTCGCCGGC AGCAGTAGTA AAGGAACTTG TTGGGCGAGG CTACCGGGCC CTGTATTTCA CTCCCCACGC TGATTTGATC GCCGGTTTCT GGGAAGAGTT CGCCGGCCTC TGCCGGGATT TATCAGGCAC CATTGCCGTC TTCCCGGGCC TGGAACTAGC TACCAGGAAT AGCGCCGGGC ACCTCCTTAT TTATGGCTTG ACGGCCCTAG AAGGCTGGCA GAATGCCAGG AACCCCGGCC AGGTGATTAT TGACCGGGTT AACCGCTTAC CTGGCCATAC CGCCTCGGTA ACCGTATCCC ACCCCTTCGG TCGGCCTCCC TGGCCCTGGG AGGACGAGCC GGTGGTGGAT TACAGTGGTC TGGAGGTCTT TTCCGGCCTG CAGTGGTACT TCGACCTGGA GTCGCGACCC CTGCAACTGT GGCGCAGGGA GGTGGCGCGC CTGTCCGGCA GGGTTTTCTT AACTGGCTAT TTGCCTTCGG CCCGAGCCGG TAGCGACTGG CATCAAGTAC TCCCCTATCA GGGCTATGTA ACTTACGTCT ACCTACCCGA CAGCTGGGCG GGCCTACCCT GGCAGGAACA GAAATACTTC CTGGACCTGG CCCTGCGGCG GGGGTATACT GTGGCCAGCC GCCGCGGGGG TCTGGCTTAT TTTTTAATTA ACGGCCAGCC GCCGGGGACT TCAGTCACCC TGCCGCCGGG TGCTATTTTG GAGATCAAAA TCTACTGGCA GGGGGTAGTA GAGGGCGATT ACCAGTTTTT GCTTTTCCAG GGCTATAAAA ATATGGGAAA AGCCATCTGG CAGGCGGAAA CCAGAGGCGC TGGAGGGGGG CGCAGGCCTG CTTGGAAAGT TGAACTGGCG GCACCGGGAG AAACATCCTA TTACTGGCTC TATGTGTCCG GGCCGGATCA GGTCCTAACC TCACCGGTTT TTTTAAGACC GGCGAGGCGT TAG
|
Protein sequence | MKIILPEFLR WQPGAEQWAV PVSAWDTGPT SYLRLRRVLL DGRPVAGRNI IFPGVMPVYL LPAGARTSDP AAYLATLGRE DKRYACSWLI LKTAGLSALQ VTGGEHSLEL EFITFAGMTC RAATTILLAP LPAHPHWQPV ELQMHNSSHD DGHWSPAAVV KELVGRGYRA LYFTPHADLI AGFWEEFAGL CRDLSGTIAV FPGLELATRN SAGHLLIYGL TALEGWQNAR NPGQVIIDRV NRLPGHTASV TVSHPFGRPP WPWEDEPVVD YSGLEVFSGL QWYFDLESRP LQLWRREVAR LSGRVFLTGY LPSARAGSDW HQVLPYQGYV TYVYLPDSWA GLPWQEQKYF LDLALRRGYT VASRRGGLAY FLINGQPPGT SVTLPPGAIL EIKIYWQGVV EGDYQFLLFQ GYKNMGKAIW QAETRGAGGG RRPAWKVELA APGETSYYWL YVSGPDQVLT SPVFLRPARR
|
| |