Gene CPR_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1838 
Symbol 
ID4204650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2028598 
End bp2029884 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content26% 
IMG OID642566388 
Producthypothetical protein 
Protein accessionYP_699152 
Protein GI110802341 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000272519 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT TATTTATAGC TTGTTATTCT CCTATGATAA ATAATTCAGC ATCAATTGAA 
ACTCTTATGT ACTTAAATAA TTTATGTAAT ATAGAGAATA ATTGTGTGCA TCTTTTAACT
GTAGACTTTC CTAAAAACTC TATATACTAT GATGAGGAAA TATTAAAGCT TTTAGATAGT
AAAGTAAAGG TTCATGCTAT TGAAGGTGGA AAATTATTTA ATAAGATTAT GCCAAAAAAA
TCTATAGGGG CAAAGGAAGA TGAGAAGTCT TCTAACACAA AATCTAGTAG TAAAATTAAA
CTTATGAGGA AGATTAAAAA TAAAATAATT TTTCCTGATA TGTATTATAA CTGGAGCTTT
AAAGCTTCAA AGTATAGCAT AGAACTTATG AATAAAGAAA AGTTTGATGT TATATTTTCT
ATGCATGAGC CACCATCTAG TCACCTTTGT GCTTTAAGAA TAAAAAAGCA CTTTAAAGAG
ATTCCTTGGG TTTTATATTG GAGCGATCCT TGGCTTAAGG ATCCATCAAG AGAGAATATT
GGTTTTATAA GAAAATTCAT AGAAGGTAGA CAAGAAAAAT CAGTAGTATT AAATGGGGAT
AGACATATAT TTGTAACTGA AGAGAATAAA AAAGATTTTA TGGAAAAATA TAATGTAAAA
GAAGATAAAA TGTTTATCGT AACTAGGGGA TACAATAAAG CCATATATGA AGAAATTGAA
AGGGCAGAAA AGCCAGAACT TTTAAAGGAT AATAAGATAA ACTTAATTTA TGCTGGAGAA
ATTTTCAGTA AAATTAGGGA TTTAAAACCT TTTATAAAAG CTTTAAAAGA ATTAGAGAAA
AGAGATCAGG AGCTATTTAA TAGATTAAAC ATAATATTTT TTGGAAACAT AGATGATGAA
AATATTAAAG AAGAATTAAA AAAGTTTTCT AACGTTAGTG TTAATGGAAG AATTGACTAT
AAGGAAGCTT TAAGATATAT GATACATGGA GATGTTCTTC TTGTTTTAGG AAACAAAAAT
TCTAAGCAAA TACCTGCTAA AATATATGAC TATTTAGGAA CAAAGAATCT TATTATAGTT
ATATTAGGAG ATGAAAATGA TCCTATTAAG AATGTTGCAC TTAATAAAGA AAAGTGTATA
GTTAGTGAAA ATAATTATGA GGCTATAATA GATGACTTAA ATAAATGTAG AGATTTAATA
GATTCAGGGA AGAAATTTAA GGCAAATGAA GAATATGAAT GGAGTAGTAT AGGTAAGAGG
CTAAATAATA TACTAAAATT AAAATAG
 
Protein sequence
MKILFIACYS PMINNSASIE TLMYLNNLCN IENNCVHLLT VDFPKNSIYY DEEILKLLDS 
KVKVHAIEGG KLFNKIMPKK SIGAKEDEKS SNTKSSSKIK LMRKIKNKII FPDMYYNWSF
KASKYSIELM NKEKFDVIFS MHEPPSSHLC ALRIKKHFKE IPWVLYWSDP WLKDPSRENI
GFIRKFIEGR QEKSVVLNGD RHIFVTEENK KDFMEKYNVK EDKMFIVTRG YNKAIYEEIE
RAEKPELLKD NKINLIYAGE IFSKIRDLKP FIKALKELEK RDQELFNRLN IIFFGNIDDE
NIKEELKKFS NVSVNGRIDY KEALRYMIHG DVLLVLGNKN SKQIPAKIYD YLGTKNLIIV
ILGDENDPIK NVALNKEKCI VSENNYEAII DDLNKCRDLI DSGKKFKANE EYEWSSIGKR
LNNILKLK