Gene Moth_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1332 
Symbol 
ID3831042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1378015 
End bp1379322 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content63% 
IMG OID637829268 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_430188 
Protein GI83590179 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000320048 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.085361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTTA TTATTCAACC CCCGCCATCC CTGGAAGGGA ATATCCAGCC GCCGGGGGAT 
AAATCCATAT CCCACCGGGC GGCCATCATC GGCGCCCTGG CCGAAGGTAC AACTGTCATC
GACGGCTTCC TGGCGGGGGC TGACTGCCTC AGTACCTTGA ATTGCCTGCG CGCCCTGGGG
GTAGACATTG CAGGACCGGA CGGGGGCCGG GTAGTGGTGC GGGGCGGGGG GTTAGATTCC
CTTAAGGAGC CGGAGACGGT TCTCGATGCC GGCAATTCCG GGACGACCAT GCGCCTTCTC
CTGGGTGTCC TGGCCGGGCA GCCCTTTTAC AGTGTAATTA ATGGCGATGA ATCCTTACGC
CGACGACCCA TGGGCCGGGT TACGGGACCA CTGAGGTCGA TGGGGGCCGC AATCTGGGGC
CGGCAGAATG GGGAACTGGC ACCCTTAAGT GTCCGGGGCG GCAAGTTAGA ACCCCTGGCG
TACACCTTGC CTGTAGCCAG CGCCCAGGTT AAATCGGCCC TCCTCCTGGC CGGCCTTTTC
ACCTCCGGGG AAACCATCGT TACCGAGCCC TGCCGCTCAC GGGACCACAG CGAAAGGATG
CTCCGGGCCG CCGGGGCCGA CCTCCGGGTT GAGGGCTTGC GGGTGAGGAT CAGGGGCCGG
AAACCCCTTA AACCTCTGAC CATTAACGTG CCCGGTGATA TCTCGGCAGC CGCCTTTTTC
CTGGTGGCCG GTTGCCTGCA CCCCCGGGCC GCCCTTACAC TGGAAAAAGT AAACTTAAAT
CCCACCAGGA CGGGCATTAT CGACGTCCTG ACGGCAATGG GGGCGCCACT TGAAATAATC
CCCGGGGAAG AAACAGCTGG GGAACCTGCC GGGTCAATCC GGGTGACCTC CGGCAGACTT
CACGGCCTGG AAGTGGGCGG GGCCATGATC CCCCGGTTAA TCGATGAAAT ACCGGTCCTG
GCGGTGGCCG CCGCCCTGGC CGAAGGGGAA ACCCTTATCC GTGGCGCCGC CGAGCTAAAA
GTGAAGGAAA GCGACCGCAT TGCCATGGTG GCCGGGGAAC TGGCCCGCAT GGGAGCGGAG
ATTTATGCCC GGCCCGATGG TTTCCGGATC AAGGGGGTAC GCCGCCTTCG CGGGGCGGTG
GTTGACAGTC ACGGGGATCA TCGCCTGGCC ATGGCCCTGG CGATGGCGGG CCTGGTGGCT
GAAGGTACAA CAGAGGTCAG GGGAGCGGAA TGTATCGCTA TATCATATCC CGGCTTTACT
GCCGATCTGG CCAGGCTGGG GGTTATGGTG AAGGAAGAAG ATGATTGA
 
Protein sequence
MRLIIQPPPS LEGNIQPPGD KSISHRAAII GALAEGTTVI DGFLAGADCL STLNCLRALG 
VDIAGPDGGR VVVRGGGLDS LKEPETVLDA GNSGTTMRLL LGVLAGQPFY SVINGDESLR
RRPMGRVTGP LRSMGAAIWG RQNGELAPLS VRGGKLEPLA YTLPVASAQV KSALLLAGLF
TSGETIVTEP CRSRDHSERM LRAAGADLRV EGLRVRIRGR KPLKPLTINV PGDISAAAFF
LVAGCLHPRA ALTLEKVNLN PTRTGIIDVL TAMGAPLEII PGEETAGEPA GSIRVTSGRL
HGLEVGGAMI PRLIDEIPVL AVAAALAEGE TLIRGAAELK VKESDRIAMV AGELARMGAE
IYARPDGFRI KGVRRLRGAV VDSHGDHRLA MALAMAGLVA EGTTEVRGAE CIAISYPGFT
ADLARLGVMV KEEDD