Gene Athe_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0843 
Symbol 
ID7407418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp934389 
End bp935690 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content36% 
IMG OID643715221 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002572731 
Protein GI222528849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGTTA AGATTGATGG AAGAAGAAAA ATAAACTCAA ATGTAATTGT TCCGCCTGAC 
AAGTCAATAT CCCACAGAAG TATCATGATT GGAAGTTTGG CAAATGGTGT GACAGAAATA
GAAAACTTTC TTTTTTCAGA CGATTGCTTG GCGACCATCA ATTGTTTCAA AAATCTTAGC
ACTGACATAG AAATTCGAAA TGACAAAATA ATTGTTAAAG GAAATGGCTT TGCTCTTAGT
GCTCCAAAAC AGATACTGGA CTGTCAAAAT TCAGGAACAA CCACAAGACT TCTCCTTGGT
ATTTTGTCAA CCCAGGAATT TGAATCTATT TTGACAGGTG ACAGCTCCCT TAAAAAAAGA
CCTATGAAAA GGGTCACAGT ACCTCTTTCT CAAATGGGAG CTGAGTTTGA GTTTTTAGAA
AAAGAAGATT TCTTGCCTAT TAAGGTAAAA GGCAGCAAAA AATTAAAACC GATTGAATAT
ACCTTACCTA TTCCAAGTGC GCAGGTAAAA TCTGCATTGA TTTTTGCGTC TTTAAAAGCT
GAAGGCAAAA GTGTCATAAA AGAAAGTCCT AAGTCAAGAG ATCACACAGA GCTTATGTTA
AAGCATGCAG GAGCAAATAT AAAAAGCTGG GAAAAAGATG GGGTATATAC AGTAGAGATA
CTGCCGAGTC AAATTTCCAG TATAAAGATA AAAATTCCAT CAGATATATC ATCTGCAGCA
TTTTTTATTG TTCTTGCACT GATATGTGAA GGTAGCTCAG TGGTAATTGA AAACTGCATT
TTAAACCCAA CAAGAACAGG TATAATTGAT GTTCTAAAAC AAATGGGTGC TGAGATTAAA
ATTGAAGATG TGGAAAATAG AAATGGAGAG CTTGTGGGAA AAATAGTTGC AAGAAGCAGC
AACCTAAGAG GTGTAAAGGT TGAAAAAAAC GATATTCCGC GCATCATAGA CGAAATACCT
ATTTTGGCAG TTGCAGCGGC ATTTGCCGAA GGTAAAACCA TAATTGACCA TGCTTCAGAG
CTAAGAGTAA AAGAGAGTGA TAGAATAAAG ACAACAGTTG AGATGCTGAA AAGTTTTGGA
GCTGAGTGCT ATGAACTTGA AAACGGACTC GAAATAATAG GTTCAAGAGA AAAACTCAAA
AGTGCAGTTG TAAATTCATA TAAAGATCAC AGAATAGCAA TGGCAGCATC TATCATGGCA
TGTGCAGTGG AGGGTGAAAG TACCATTTTG GATGCAGACT GCGTATCAAT CTCTTTTCCA
AACTTTTACG ACATTCTTTT TTCCTCAACA AAAAAGATAT AA
 
Protein sequence
MNVKIDGRRK INSNVIVPPD KSISHRSIMI GSLANGVTEI ENFLFSDDCL ATINCFKNLS 
TDIEIRNDKI IVKGNGFALS APKQILDCQN SGTTTRLLLG ILSTQEFESI LTGDSSLKKR
PMKRVTVPLS QMGAEFEFLE KEDFLPIKVK GSKKLKPIEY TLPIPSAQVK SALIFASLKA
EGKSVIKESP KSRDHTELML KHAGANIKSW EKDGVYTVEI LPSQISSIKI KIPSDISSAA
FFIVLALICE GSSVVIENCI LNPTRTGIID VLKQMGAEIK IEDVENRNGE LVGKIVARSS
NLRGVKVEKN DIPRIIDEIP ILAVAAAFAE GKTIIDHASE LRVKESDRIK TTVEMLKSFG
AECYELENGL EIIGSREKLK SAVVNSYKDH RIAMAASIMA CAVEGESTIL DADCVSISFP
NFYDILFSST KKI