Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1064 |
Symbol | |
ID | 3833329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1094371 |
End bp | 1095264 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828992 |
Product | dipicolinate synthase subunit A |
Protein accession | YP_429921 |
Protein GI | 83589912 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0373] Glutamyl-tRNA reductase |
TIGRFAM ID | [TIGR02853] dipicolinic acid synthetase, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.189136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00191942 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTGGGA TGCTAACAGG CATTAAAGTG GCAATGCTGG GGGGCGATGC CCGGGAAGCA ATCCTCCTGG AGGAATTGCT GCGGCAGGGA GCTGAGGTCC AGGCCTGCGG CCTACCAGAA GTACCAGGAG GGGGCGGTTA TGCCTGTTAT GATGATCCCC GCGCGACCGT CCGGGGAGTA CGGGTAGTCA TCCTGCCGGT ACCGGGGGTC AACTCCGAGG GCCGGATTCA CGCCCCCGGG AGCCAGCAGC CCCTTTATTT CAACCAGGAA CTGGCGGAAG CCATACCAGC AGGGACCCTG GTCCTGGTGG GAGTCGCCCG CCCCCTGCTC AAGGAAATGG CAATTACTGG GGGCTGGCAA CTAGTTGAAA CGGCAGATAG GGATGAAATG GCGATTTTGA ACTCTATTCC TACTGCCGAA GGAGCCTTGA TGCTGGCCAT GCAGGAGTTG CCTATTACCC TCCACGGTAG CCGGGCCTTT GTCCTGGGGC TGGGGCGGAC TGGTTTTACC CTGGCCCGCA TGTTGGCCGG AGTAGGGGCC CTGGTAACGG TGGTCGACCG GGGCGCGGCC GACCGGGCGC GGGCCTATGC CGAAGGGTGG CGGGCGGTAG CCTTTACTGA CCTGGCAGGA GTAATCGGGG AGGCAGATGT GATTTTTAAT ACTGTGCCCG CCCAGGTCCT GACAGCGTCC GTTTTGGCTG CCACCAGCCC GGGGGTCCTG ATTATTGACC TGGCATCGGC TCCCGGTGGG ATAGATTTTG CCGCTGCTAC CGCCATGAAA CGCCGGGCCA TGCTGGCACC CGGGCTACCG GGGAAGGTCG CTCCCCGGAC GGCGGGGCTT ATCCTGGCCC GCCTCTATCC TACCCTGATC TTAGAGTGCC TGGAACGGCT GTAA
|
Protein sequence | MGGMLTGIKV AMLGGDAREA ILLEELLRQG AEVQACGLPE VPGGGGYACY DDPRATVRGV RVVILPVPGV NSEGRIHAPG SQQPLYFNQE LAEAIPAGTL VLVGVARPLL KEMAITGGWQ LVETADRDEM AILNSIPTAE GALMLAMQEL PITLHGSRAF VLGLGRTGFT LARMLAGVGA LVTVVDRGAA DRARAYAEGW RAVAFTDLAG VIGEADVIFN TVPAQVLTAS VLAATSPGVL IIDLASAPGG IDFAAATAMK RRAMLAPGLP GKVAPRTAGL ILARLYPTLI LECLERL
|
| |