Gene CPR_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2029 
Symbol 
ID4204019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2237164 
End bp2238564 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content30% 
IMG OID642566579 
Productglutamate decarboxylase 
Protein accessionYP_699338 
Protein GI110801514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID[TIGR01788] glutamate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTTTG GAAAAAATGA ATTAGGTAGT CAATTTGAAA CACCTATATT TGGAACAGTT 
GAAAGTAATG AACCTATTCC AAAATATAAA TTAGCAAAAA AATCAATTGC ACCACAAGTT
GCATATAGAT TAATAAAAGA TGATTTACTA GATGAAGGTA ATGCACGTCA AAACTTAGCA
ACATTTTGTC AAACATATAT GGATGATGAA GCAGTAAAAT TAATGTCAGA GACATTAGAG
AAAAATGCTA TAGATAAATC AGAATATCCA CAAACAACAG ATATGGAAAA CCGTTGTGTA
AATATATTAG CTGATTTATG GCATGCTCCA AAAGAATTAA ATTATATGGG AACTTCAACA
GTTGGATCAT CAGAAGCTTG TATGCTTGGT GGTATGGCTA TGAAATTTAG ATGGAGAAAT
AGAGCAAAAG CTTTAGGTAT GGATGTTACA TCTAAAAAAC CAAACTTAGT AATCTCTTCA
GGATACCAAG TATGTTGGGA AAAATTCTGT GTTTATTGGG ATATAGAAAT GAGATTAGTT
CCAATGGATG AACAACATAT GAGTATAAAT GTAGATAAAG TTTTAGACTA TGTTGATGAT
TATACAATAG GTGTAGTTGG AATACTAGGA ATTACTTACA CTGGTAAATA TGACGATATA
AAAGCTTTAG ATAAAAAATT AGAAGAATAC AATAAAACAG CAAAAATAAC AGTTCCAATA
CATGTTGATG GTGCTTCAGG AGCTATGTTT GCTCCATTTA TAGAACCTGA TTTAAAATGG
GACTTCCAAT TAAAAAATGT TGTTTCAATT AGTACATCAG GTCATAAATA TGGATTAGTT
TATCCAGGTA TAGGTTGGGT ATTATGGAAA GATAAAGAAT ACTTACCTCA AGAATTAGTA
TTTGAAGTAA GCTATTTAGG TGGAAAAATG CCTACAATGG CAATAAACTT CTCAAGATCA
GCTAGCCAAA TTCTTGGTCA ATACTATAAC TTCTTAAGAT ACGGATTTGA AGGTTATAGA
CAAATACATC AAAGAACAAA AGATGTAGCT ATGTACTTAT CAAGTGAACT TGAAAAAACT
GGTTTATTTG AAATATATAA TAATGGTGAA AACTTACCAA TAGTTTGCTA CAAATTAAAA
GATGATGTTA AAGTAAACTG GACATTATAT GATTTAGCAG ATAGATTATT AATGAAAGGA
TGGCAAGTAC CTGCTTATCC ATTACCAGAA GATTTACAAA ATGTAATAAT TCAAAGATTT
GTATGTCGTG CAGATTTAAG TAGAAACTTA GCTGATTTAT TAATGAGAGA TTTAAAAGCA
GCTATAGAAG ATTTAAATAA TGCAAGAATT TTAAGCAAAA CTCAAGTAGA TAATAAGGGA
GTACAAGGAT TTACTCACTA G
 
Protein sequence
MLFGKNELGS QFETPIFGTV ESNEPIPKYK LAKKSIAPQV AYRLIKDDLL DEGNARQNLA 
TFCQTYMDDE AVKLMSETLE KNAIDKSEYP QTTDMENRCV NILADLWHAP KELNYMGTST
VGSSEACMLG GMAMKFRWRN RAKALGMDVT SKKPNLVISS GYQVCWEKFC VYWDIEMRLV
PMDEQHMSIN VDKVLDYVDD YTIGVVGILG ITYTGKYDDI KALDKKLEEY NKTAKITVPI
HVDGASGAMF APFIEPDLKW DFQLKNVVSI STSGHKYGLV YPGIGWVLWK DKEYLPQELV
FEVSYLGGKM PTMAINFSRS ASQILGQYYN FLRYGFEGYR QIHQRTKDVA MYLSSELEKT
GLFEIYNNGE NLPIVCYKLK DDVKVNWTLY DLADRLLMKG WQVPAYPLPE DLQNVIIQRF
VCRADLSRNL ADLLMRDLKA AIEDLNNARI LSKTQVDNKG VQGFTH