Gene CPR_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1835 
Symbol 
ID4204461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2025699 
End bp2026796 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content32% 
IMG OID642566385 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_699149 
Protein GI110801535 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.259258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAG GAATGGTTGG ATTACCAAAT GTTGGTAAAA GTACATTATT CAATGCTATT 
ACAAAAGCAG GAGCTGAATC AGCAAACTAT CCTTTCTGTA CAATAGAACC AAACGTAGGT
GTTGTAAGTG TACCAGATAA GAGATTAGAT GTTTTAGAAA AAATATATAA TACAAAGAAA
AAAGTATATA CTGCAATAGA GTTTTATGAT ATAGCAGGAT TAGTTAAAGG TGCTAGCAAA
GGTGAAGGAT TAGGAAATAA ATTCTTATCA CACATAAGAG AAGTTGCAGC TATAGTTCAT
GTTGTAAGAT GTTTTGACGA TGAGAATGTT GTTCACGTAG AAGGTTCTGT AGATCCAATA
AGAGATATAG AAACTATAAA CTTAGAGCTT ATCTTTGCAG ACTTAGATGT TCTTGAAAGA
AGAATGGAAA AGACTATGAA GTTAGTAAGA TCAGGTGATA AAACTGCTAA GTTTGAGTAT
GATGTAATGG AAAGATTAAA AGCTCACTTA GAAGCAAATA AACCAGCTAG AACTTTAGAA
GCTACTGAAG ATGAAGAAGC TTTCGTAAAA AGTTTATTCT TAATAACTTC AAAACCAGTT
TTATATGCTT GTAACATATC AGAAGATGAT ATGATGGAAG GAAACTTAGA TAATGAGTAT
GTTAAAAAAG TTAGAGCATA TGCTGAAACT GAGAATTCAG GAATCATGGT TGTTTGTGCT
AAACTTGAAG AAGAATTATC AGGATTAGAT GAAGAAGAAA AAGCTGAAAT GTTATCTGAG
TATGGATTAG AAGAATCAGG TCTTGATAAA CTTATACAAG CAAGTTATAA ATTATTAGGA
TTAATGAGTT ACTTAACTGC AGGTGTACAA GAAGTTAGAG CTTGGACAAT AAAACAAGGA
ACTAAAGCTC CACAAGCAGC AGGTAAAATT CATTCTGATA TAGAAAGAGG GTTCATAAGA
GCAGAGGTAG TTTCTTATGA TGATTTAGTA GAATGTGGTT CAGAAGCAGC TGCTAAAGAA
AAAGGTGTTT ACAGATTAGA AGGTAAAGAA TACGTAATGA AAGATGGAGA CATAGTTAAC
TTCAGATTCA ACGTATAA
 
Protein sequence
MKLGMVGLPN VGKSTLFNAI TKAGAESANY PFCTIEPNVG VVSVPDKRLD VLEKIYNTKK 
KVYTAIEFYD IAGLVKGASK GEGLGNKFLS HIREVAAIVH VVRCFDDENV VHVEGSVDPI
RDIETINLEL IFADLDVLER RMEKTMKLVR SGDKTAKFEY DVMERLKAHL EANKPARTLE
ATEDEEAFVK SLFLITSKPV LYACNISEDD MMEGNLDNEY VKKVRAYAET ENSGIMVVCA
KLEEELSGLD EEEKAEMLSE YGLEESGLDK LIQASYKLLG LMSYLTAGVQ EVRAWTIKQG
TKAPQAAGKI HSDIERGFIR AEVVSYDDLV ECGSEAAAKE KGVYRLEGKE YVMKDGDIVN
FRFNV