Gene ECD_00287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00287 
SymbolprpC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp319641 
End bp320810 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID 
Product2-methylcitrate synthase 
Protein accessionACT42186 
Protein GI253976516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACA CAACGATCCT GCAAAACAGT ACCCATGTCA TTAAACCGAA AAAATCTGTG 
GCACTTTCTG GCGTTCCGGC GGGCAATACG GCGCTCTGCA CCGTGGGTAA AAGCGGCAAT
GACCTGCATT ACCGCGGCTA CGATATTCTT GATCTGGCGG AACATTGCGA ATTTGAAGAA
GTGGCGCATC TGCTGATCCA CGGCAAACTG CCGACCCGTG ACGAACTCGC CGCTTACAAA
ACGAAACTGA AAGCCCTGCG CGGTTTACCG GCTAACGTGC GTACCGTGCT GGAAGCCTTA
CCGGCGGCGT CACACCCGAT GGATGTTATG CGCACCGGCG TTTCCGCGCT CGGCTGCACG
CTGCCAGAAA AAGAGGGACA TACCGTCTCT GGCGCGCGGG ATATTGCCGA CAAACTGCTG
GCGTCGCTTA GCTCGATTCT CCTTTATTGG TATCACTACA GCCACAACGG CGAACGCATC
CAGCCGGAAA CCGATGACGA CTCCATCGGC GGTCACTTCC TGCATCTGCT GCACGGCGAA
AAGCCATCGC AAAGCTGGGA AAAGGCGATG CATATCTCGC TGGTGCTGTA CGCCGAACAC
GAGTTTAACG CCTCCACCTT TACCAGTCGG GTGATTGCGG GTACTGGCTC TGATATGTAT
TCCGCGATTA TTGGCGCGAT TGGCGCACTG CGCGGGCCGA AACACGGCGG GGCGAATGAA
GTGTCGCTGG AGATCCAGCA ACGCTACGAA ACGCCGGACG AAGCCGAAGC CGATATCCGC
AAGCGGGTGG AAAACAAAGA AGTAGTCATT GGTTTTGGTC ATCCGGTTTA TACCATCGCC
GACCCGCGTC ATCAGGTGAT CAAACGCGTG GCGAAGCAGC TCTCGCAGGA AGGCGGCTCG
CTGAAGATGT ACAACATCGC CGATCGCCTG GAAACGGTGA TGTGGGAGAG CAAAAAGATG
TTCCCCAATC TCGACTGGTT CTCTGCTGTT TCCTACAACA TGATGGGCGT TCCCACCGAG
ATGTTCACAC CACTGTTTGT TATCGCCCGC GTCACCGGCT GGGCGGCGCA CATTATCGAA
CAACGTCAGG ACAACAAAAT TATCCGTCCT TCCGCCAATT ATGTTGGACC GGAAGACCGC
CCGTTTGTCG CGCTGGATAA GCGCCAGTAA
 
Protein sequence
MSDTTILQNS THVIKPKKSV ALSGVPAGNT ALCTVGKSGN DLHYRGYDIL DLAEHCEFEE 
VAHLLIHGKL PTRDELAAYK TKLKALRGLP ANVRTVLEAL PAASHPMDVM RTGVSALGCT
LPEKEGHTVS GARDIADKLL ASLSSILLYW YHYSHNGERI QPETDDDSIG GHFLHLLHGE
KPSQSWEKAM HISLVLYAEH EFNASTFTSR VIAGTGSDMY SAIIGAIGAL RGPKHGGANE
VSLEIQQRYE TPDEAEADIR KRVENKEVVI GFGHPVYTIA DPRHQVIKRV AKQLSQEGGS
LKMYNIADRL ETVMWESKKM FPNLDWFSAV SYNMMGVPTE MFTPLFVIAR VTGWAAHIIE
QRQDNKIIRP SANYVGPEDR PFVALDKRQ