Gene Cphy_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2540 
Symbol 
ID5741818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3110347 
End bp3111657 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content39% 
IMG OID641293630 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001559640 
Protein GI160880672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.341654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTC ATAAAGTGAA ACAGATAAAT GGTACCCTCA CCGTACCAGG TGATAAGTCC 
ATCTCACACC GTGCAGTTAT GTTTGGAGCT ATAGCAGAAG GAACTACAGA AGTTTATAAT
TTCTTAAAAG GTGCTGACTG TCTCTCAACC ATACAGTGTT TTAGACAGCT GGGAATCAAT
ATAGAAGAAG ATACTAAACA GCAAGTGATT CGAATTCACG GAAAAGGACT TCATGGATTA
ACTCCACCTT CTACTATTCT TGATGTAGGT AATAGTGGAA CGACGCTCCG TCTTATTTCT
GGAATATTAA GTGGTCAACC ATTTGAAAGT AACATTACCG GTGATAGTTC TATACAAAAA
CGGCCAATGA ATAGAGTTAT TACACCTCTA AGCCTAATGA ATGCTGATAT TAAAAGTGTT
CTAGGAAACG GTTGTGCACC ACTCTGCATT AATGGATCCT ATCAAAACGG CGCAAAGTCT
GCCTTAAAGA GTATTCATTA TAATTCTCCT ATTGCCTCTG CACAAGTTAA ATCTTCTATT
CTTTTAGCAG GTCTATATGC AGAAGGTGAA ACTTCAGTAA CTGAGCCATA CGTTTCGAGG
AATCATACCG AACTTATGTT ACAAAAATTC GGTGCAAATC TTAGCGTAAA CGACAAAACA
GTAACTATTC AACCTGAACC AAGGTTAATG GCACAAAAAG TTCATGTACC AGGAGACATC
TCTTCTGCCG CTTATTTCCT TGCTGCTGCT TGTATACTCC CTAATTCTGA ACTTGTTATA
AATAATGTAG GTGTAAATCC TACACGTGAT GGAATCATCG ATGTCTTGCT TGCGATGGGT
GCTGACATTA CGAAAGAAGA TTTAAAGAAT CAAGAAGGTG AAGCAGTATG CAATCTAAGG
GTTAGAAGCA GTAAACTTCA TGGCACTGTG ATTGAAGGAA GTATCATCCC TCGTCTTATT
GATGAGATAC CTGTTATCGC TGTTGTTGCA TGTTTTGCAG AAGGCGATAC AATCATCAAA
GATGCAGCCG AATTAAAGGT GAAAGAGTCC AATCGTATTG ATGTAATGGT ACAACAACTG
AAACATATGG GCGCTAATCT TACTGCAACC GAAGATGGTA TGATTATTCA CGGAGGCCAA
AAGCTATCTG GTACTGTCAT CGAAAGTAAA GAAGATCATC GTATTGCAAT GTCTTTCGCT
ATTGCAAGCC TAATGGCCGA AGGCGAAACG ACTATTCAAG GTGCAGAATG TGTTAACATC
TCCTATCCAG AATTTTATCA AGATTTGTAT AGACTAACCT GCGATAATTA G
 
Protein sequence
MKFHKVKQIN GTLTVPGDKS ISHRAVMFGA IAEGTTEVYN FLKGADCLST IQCFRQLGIN 
IEEDTKQQVI RIHGKGLHGL TPPSTILDVG NSGTTLRLIS GILSGQPFES NITGDSSIQK
RPMNRVITPL SLMNADIKSV LGNGCAPLCI NGSYQNGAKS ALKSIHYNSP IASAQVKSSI
LLAGLYAEGE TSVTEPYVSR NHTELMLQKF GANLSVNDKT VTIQPEPRLM AQKVHVPGDI
SSAAYFLAAA CILPNSELVI NNVGVNPTRD GIIDVLLAMG ADITKEDLKN QEGEAVCNLR
VRSSKLHGTV IEGSIIPRLI DEIPVIAVVA CFAEGDTIIK DAAELKVKES NRIDVMVQQL
KHMGANLTAT EDGMIIHGGQ KLSGTVIESK EDHRIAMSFA IASLMAEGET TIQGAECVNI
SYPEFYQDLY RLTCDN