Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1332 |
Symbol | |
ID | 3831042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1378015 |
End bp | 1379322 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829268 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_430188 |
Protein GI | 83590179 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000320048 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.085361 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTTA TTATTCAACC CCCGCCATCC CTGGAAGGGA ATATCCAGCC GCCGGGGGAT AAATCCATAT CCCACCGGGC GGCCATCATC GGCGCCCTGG CCGAAGGTAC AACTGTCATC GACGGCTTCC TGGCGGGGGC TGACTGCCTC AGTACCTTGA ATTGCCTGCG CGCCCTGGGG GTAGACATTG CAGGACCGGA CGGGGGCCGG GTAGTGGTGC GGGGCGGGGG GTTAGATTCC CTTAAGGAGC CGGAGACGGT TCTCGATGCC GGCAATTCCG GGACGACCAT GCGCCTTCTC CTGGGTGTCC TGGCCGGGCA GCCCTTTTAC AGTGTAATTA ATGGCGATGA ATCCTTACGC CGACGACCCA TGGGCCGGGT TACGGGACCA CTGAGGTCGA TGGGGGCCGC AATCTGGGGC CGGCAGAATG GGGAACTGGC ACCCTTAAGT GTCCGGGGCG GCAAGTTAGA ACCCCTGGCG TACACCTTGC CTGTAGCCAG CGCCCAGGTT AAATCGGCCC TCCTCCTGGC CGGCCTTTTC ACCTCCGGGG AAACCATCGT TACCGAGCCC TGCCGCTCAC GGGACCACAG CGAAAGGATG CTCCGGGCCG CCGGGGCCGA CCTCCGGGTT GAGGGCTTGC GGGTGAGGAT CAGGGGCCGG AAACCCCTTA AACCTCTGAC CATTAACGTG CCCGGTGATA TCTCGGCAGC CGCCTTTTTC CTGGTGGCCG GTTGCCTGCA CCCCCGGGCC GCCCTTACAC TGGAAAAAGT AAACTTAAAT CCCACCAGGA CGGGCATTAT CGACGTCCTG ACGGCAATGG GGGCGCCACT TGAAATAATC CCCGGGGAAG AAACAGCTGG GGAACCTGCC GGGTCAATCC GGGTGACCTC CGGCAGACTT CACGGCCTGG AAGTGGGCGG GGCCATGATC CCCCGGTTAA TCGATGAAAT ACCGGTCCTG GCGGTGGCCG CCGCCCTGGC CGAAGGGGAA ACCCTTATCC GTGGCGCCGC CGAGCTAAAA GTGAAGGAAA GCGACCGCAT TGCCATGGTG GCCGGGGAAC TGGCCCGCAT GGGAGCGGAG ATTTATGCCC GGCCCGATGG TTTCCGGATC AAGGGGGTAC GCCGCCTTCG CGGGGCGGTG GTTGACAGTC ACGGGGATCA TCGCCTGGCC ATGGCCCTGG CGATGGCGGG CCTGGTGGCT GAAGGTACAA CAGAGGTCAG GGGAGCGGAA TGTATCGCTA TATCATATCC CGGCTTTACT GCCGATCTGG CCAGGCTGGG GGTTATGGTG AAGGAAGAAG ATGATTGA
|
Protein sequence | MRLIIQPPPS LEGNIQPPGD KSISHRAAII GALAEGTTVI DGFLAGADCL STLNCLRALG VDIAGPDGGR VVVRGGGLDS LKEPETVLDA GNSGTTMRLL LGVLAGQPFY SVINGDESLR RRPMGRVTGP LRSMGAAIWG RQNGELAPLS VRGGKLEPLA YTLPVASAQV KSALLLAGLF TSGETIVTEP CRSRDHSERM LRAAGADLRV EGLRVRIRGR KPLKPLTINV PGDISAAAFF LVAGCLHPRA ALTLEKVNLN PTRTGIIDVL TAMGAPLEII PGEETAGEPA GSIRVTSGRL HGLEVGGAMI PRLIDEIPVL AVAAALAEGE TLIRGAAELK VKESDRIAMV AGELARMGAE IYARPDGFRI KGVRRLRGAV VDSHGDHRLA MALAMAGLVA EGTTEVRGAE CIAISYPGFT ADLARLGVMV KEEDD
|
| |