Gene GWCH70_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2242 
Symbol 
ID7978409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2296312 
End bp2297631 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content45% 
IMG OID644799056 
Productdiaminopimelate decarboxylase 
Protein accessionYP_002950216 
Protein GI239827592 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0241935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTTC ATGGAACAAG CCGTGTCAAT GACAAGGGGC ATTTAGAAAT TGGCGGTGTC 
GATACCGTTG ATTTAGCGAA AGAATATGGA ACACCATTGT ATATTTATGA TGTGGCACTC
ATTCGTGAGC GTGCGCGGGG GTTTAAAGAA GCGTTTCAAA AACATGGCGT CAAAGCACAA
GTTGCCTATG CAAGCAAGGC ATTTTCTTCC ATTGCGATGG TTCAGTTAGC AGAAGAAGAA
GGATTATCGT TGGATGTAGT GTCAGGAGGA GAGCTGTATA CAGCATTGCA AGCGGGATTT
CCGCCAGAAC GCATCCATTT TCACGGAAAT AATAAAAGCC GTGACGAATT GATGATGGCT
CTTGAGAATG GAGTCGGATG CATCGTTGTC GATAACTTTT ATGAACTGGA ATTGTTGGAA
CAACTAAGCA AGCAGTACGG AAAAAAAACG GCGATTTTAC TAAGAGTCAC CCCTGGTGTG
GAGGCGCATA CGCACGATTA TATTTTAACC GGTCAGGAAG ATTCAAAATT TGGGTTTGAT
TTGAACAATG GCCAAGCAGA TGAAGCGCTG CAAAAAGCGC TGTCTTCCCT GTCTTTTTCC
GTATTGGGAA TTCATTGCCA TATTGGTTCG CAAATTTTCG AGACGACTGG ATTTGTGTTA
GCGGCACAAA AAATTTTCCA AAAAATCGCT CACTGGAAGG AAACGTACGG ATTTATTCCA
ACGGTTGTCA ATCTTGGCGG TGGATTTGGC ATTCGCTATA CGAGTGATGA CGACCCGATT
CCTGTTTCAG AATATGTCGA CCAAATCGTA GAGGAAGTAA AAAAACAAGC AAGCGAAAAA
AATATTCCAA TGCCAGAAAT CTGGATTGAG CCGGGACGCT CCCTTGTCGG CGATGCGGGA
ACGACGATTT ATTCGATCGG TTCGCGCAAA GATGTGCCGA ATGTCCGTCA TTACGTTGCG
GTAGACGGAG GGATGAGCGA CAATATCCGC CCGGCGCTAT ATGATGCGAA GTATGAAGCG
GTGTTAGCGA ATCGAGTGTT AGACGAAAAA AATGAGATTG TCGCCATCGC CGGGAAGTGT
TGTGAATCAG GAGATATGCT TATTTGGGAT TTGCCGCTGC CAAAAGCATC CCCTGGCGAT
TATTTAGCCG TTTTTTGCAC TGGGGCGTAT GGCTATTCGA TGGCGAACAA CTATAACCGC
ATTCCACGTC CAGCTGTCGT GTTTGTGGAA AATGGAGAAG CACAACTCGT TGTTAAACGG
GAAACGTATG AAGATTTAGT GCGGCTTGAT GTACCGCTTA AAACAAAAGT GAATAAGTAA
 
Protein sequence
MFFHGTSRVN DKGHLEIGGV DTVDLAKEYG TPLYIYDVAL IRERARGFKE AFQKHGVKAQ 
VAYASKAFSS IAMVQLAEEE GLSLDVVSGG ELYTALQAGF PPERIHFHGN NKSRDELMMA
LENGVGCIVV DNFYELELLE QLSKQYGKKT AILLRVTPGV EAHTHDYILT GQEDSKFGFD
LNNGQADEAL QKALSSLSFS VLGIHCHIGS QIFETTGFVL AAQKIFQKIA HWKETYGFIP
TVVNLGGGFG IRYTSDDDPI PVSEYVDQIV EEVKKQASEK NIPMPEIWIE PGRSLVGDAG
TTIYSIGSRK DVPNVRHYVA VDGGMSDNIR PALYDAKYEA VLANRVLDEK NEIVAIAGKC
CESGDMLIWD LPLPKASPGD YLAVFCTGAY GYSMANNYNR IPRPAVVFVE NGEAQLVVKR
ETYEDLVRLD VPLKTKVNK