Gene CPR_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0868 
Symbol 
ID4204559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1005492 
End bp1007129 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content23% 
IMG OID642565427 
ProductEAL domain-containing protein 
Protein accessionYP_698193 
Protein GI110803278 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.58008e-05 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGGTGTCTTG GGTAAGTATA GTATTTTTAA TAATTATAAT TGGTTTAGCA 
ATTACAATTA AATTTGATTG TAAGAAAGTT AATTGTAACA GAAAAACAAT TAAAGTAGGA
TTTTATGAAT ATTATCCTTA TTATTATCTT AATAAAAATT CTATGCCAGA TGGCTATTAT
AATGAAATAT TAGAATTAAT ATGTAATAAG ATAAATTTAA ATTATGAGTA TGTAGATTGT
AATATAACAA ATGCTTTAGA AAAGCTTAAA TCTGGAGAAG TAGATTTAGT TTTTGGAATA
AGTAAGACCC CTGATAGAGA AAAGGATTAT GAATTTACTG ACCATTACAT AAATAATGAT
AACTTTGCCA TATATACTAA TAAAAATATA AAAAATGGTG ATTTAAAAGC TTTAAATGGA
TTAAAAATGG GGTTTTTAAA AGGAGAAGAA AATAATGAGT GGATTTTAAG GTTTTTAAAA
GATAAAGGCA TAAATGTGAA GCTTATAGAT GTATATAATT ATCCTGAAGA TGAAGAATAT
TTACATGATA ACAAAGTTGA TTTTATAATA GAAAACAAAA GAAGTAATAT AGATTATGAA
AATAAAAATA TTAAAAAAAT TTTTGAGTTT TCTTCTGGAC CAGTTTATAT AGTTAGTAGA
AAAGGAAATA AAAAATTAAT TGAAAGAATA AATTCCGCTC TTGGAGAGAT TGAAGATGAT
GAAGAGAATG ATACTAATTT ATACTTTAAT GATTTTGATG AATATTTAAA TAAATTAGAG
ATTAAAAAAT TATTTATTGG AATTTTTTTG ATTATAATAA CATTATTTAT TTATAAAAAA
AGAAAAAATA AAATATTCGC TATAAGAACT AAAAGAAAAA TTAGAAACTA TATGAAAAAT
GATAAATATA TATTATATTA TCAACCTATA GTAGATCCAA AGACAAATGG AGTAAAGGGC
TTTGAATCTC TTTTGAGATT AAATAAGAAT GGAAAAATTT TAACACCTGA TAGCTTTATA
AGGGAAATTG AAGACAATAA TATGTCTTTA GAGGTTTCTT TATGGCTTTT AAAGAAAGTT
ATTGCAGACT ACAGAATAAT CAAAGATTAT AGGATGGTCA AAGGAAGTAA TTTTTATATA
TCATTAAATG TTTCATTTAA AGAAATAGAA AATCCTAAGT TTTTAAGATC CTTAATGAAA
ATTGCAAAAG ATTATAAGAT TGATGATTGT AATATTTGTT TAGAGATAGT AGAAAAATTT
GGTATGGAGG ATATAGGAAG AATACAGAGT GCAATAAGAA TAATAAAGGA ATATGGATTT
TTAATAGCTA TAGATGATTT TGGTGTGGAG TATTCTAATT TAGATTTATT AAATAAAATT
GATTCTGATA TAGTAAAGCT AGACAAGTAC TTTGCTGATA ATTTAGATAA GTCTATTATA
AATAAAAAAA CAGTGGAATT TATATCAGAA ATATGTAATA TATCAAATAG AACCATAGTA
TTTGAAGGAG TAGAGGGAAA ATATCAGGTT AATATTATTA AGGCATTTCC ATATGAAAAA
ATATATATTC AGGGATATTT CTATTCAAAG CCAGTGGATA TTGAGACTTT AAAGGATTTT
GAAGTTAAGG ATAGTTAA
 
Protein sequence
MKKRVSWVSI VFLIIIIGLA ITIKFDCKKV NCNRKTIKVG FYEYYPYYYL NKNSMPDGYY 
NEILELICNK INLNYEYVDC NITNALEKLK SGEVDLVFGI SKTPDREKDY EFTDHYINND
NFAIYTNKNI KNGDLKALNG LKMGFLKGEE NNEWILRFLK DKGINVKLID VYNYPEDEEY
LHDNKVDFII ENKRSNIDYE NKNIKKIFEF SSGPVYIVSR KGNKKLIERI NSALGEIEDD
EENDTNLYFN DFDEYLNKLE IKKLFIGIFL IIITLFIYKK RKNKIFAIRT KRKIRNYMKN
DKYILYYQPI VDPKTNGVKG FESLLRLNKN GKILTPDSFI REIEDNNMSL EVSLWLLKKV
IADYRIIKDY RMVKGSNFYI SLNVSFKEIE NPKFLRSLMK IAKDYKIDDC NICLEIVEKF
GMEDIGRIQS AIRIIKEYGF LIAIDDFGVE YSNLDLLNKI DSDIVKLDKY FADNLDKSII
NKKTVEFISE ICNISNRTIV FEGVEGKYQV NIIKAFPYEK IYIQGYFYSK PVDIETLKDF
EVKDS