Gene CPR_1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1271 
Symbol 
ID4204225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1430864 
End bp1432627 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content31% 
IMG OID642565827 
Producthypothetical protein 
Protein accessionYP_698593 
Protein GI110803314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG ATACTATTCT TGGTGCTTCT ATTGGCTCTA CTGACTTTCA TTATCTTCAA 
AAGGATTATG ATGAAATAAA AAAATTAAAC TTAAATACTT TGACTGAGGT AGCTTGGATA
GGAGATGAAC TTAATTCTAA AATTGTTATG TGGACAAACT CCTCTCCTGT TAATAATGTT
ACCCTCTCTT CAAGTGACTT TATAAATGAA AATGGGGATT TAATCTCTTC AAATAATATT
AAGATTTCTT GGCTTAAAGA AACCTTAGCT AATATAGGAC GTAGTAATCC TTCTGCTCCC
CTTGAACCTT TCCCAGATAT TATTCATAAT TATATTTCAC TAAATATAGA AAAAAATAAA
ATAGCCTCTG CATGGATTAA TATAAAGATT CCTAGGAATG CAAAACCTGG AATTTATAAT
GGTTCTATTG AGGTCACTGC TGATGAATTA GAAAAAGCCT ATACTTTTGA TTATTCCTTT
GAAGTTTTAA ACTTAGTACA ACCTCTTCCA AGGGAAACAA ATACTCAAAT TGAGTTTTGG
CAACATCCTT ACACCATAGC AAGGTATTAT AAAATATCCA AAGAAGATTT ATTTACAGAA
AAGCATTTTA AATATTTAAG AGAGAATCTT AAAGAATATA GAGATATGGG AGGATGTGGT
GTTATAGCCA CTATAGTTCA TGAAGCTTGG AATCATCAAT CTTATGATAG TGACCCTTCA
ATGATTAAGT GGAGAAAAAA CTCCTATGGC ACCTTTGAAT TTGACTACTC TCACTTTGAT
AAGTGGATTC AACTTAATAT AGACTTAGGA ATTCTAGATC CTGAAAAGAG CTTTGGCCAA
ATAAAGTGTT ATAGTATTGT CCCTTGGGAT AATAGAATTC AGTACTTTAA TGAAGCTACT
AATAAAGAAG AAGCCATAAA TCCAAACCCT GGTAGTGATC TTTGGATAAA CATTTGGACA
CAATTTTTAA CTTCATTTAT GTCTCACCTT GAAGAAAAAG GTTGGTTTAA CATAACTTAT
ATTTCAATGG ATGAAAGAAG TATAGATGAT TTAAAAGCTT GTGTTGATTT AATTGAAAGT
ATAACAAATA ACTCTTATGA GCATTTTAAA ATCTCTTCTG CCATGAATTA TGAAAGTGGA
AATGACTACT CTCTCTTAGA TAGAATAGAT GATATATCAA TTGGATTATC CCATATAAAT
CATAATTCTG ATGATATGAA AAATATGGCT AAACATAGAC AAGAACTTGG ATTATTAACT
ACAATATACA CCTGTACTGG AGATTATCCA AGTAGTTTCA CAATAAGTGA CCCTTCAGAA
GGTGCCTTTA CTATGTGGTA TTCCCTATGC CAAAACACTA ATGGATTTAT GCGTTGGTCA
TGGGATGGTT GGGTTGAAAA TCCTTTAAAA AATGTTTCTT ATAAATATTG GGAACCTGGA
GATCCTTTTC TTATATACCC ATCAGAAAAG GATAGCATAG GTAAAACCTT TTACTCTACT
CCTAGATTAG AAAAATTAAA AGAAGGTATA AGAGATATAA ACAAAGCCAA ATACCTTATG
GAAAAGGATC CAAACTTAAA AAAATCTATA GAAAATTTAA TCTACTCTCT AAAAAGACCT
AATAAAGGAG AAAATGCCTA TGGCTCTTCA GTAGCAGCTT CTAAGGAGGA TAGAGATTTA
ACTATCTCAG AAGCAAATAG AATGAAAAAT GGCATAAATA ACTTTGCAAG AGAATTTATT
TCATTAACTA TGAAAACCTT GTAG
 
Protein sequence
MKKDTILGAS IGSTDFHYLQ KDYDEIKKLN LNTLTEVAWI GDELNSKIVM WTNSSPVNNV 
TLSSSDFINE NGDLISSNNI KISWLKETLA NIGRSNPSAP LEPFPDIIHN YISLNIEKNK
IASAWINIKI PRNAKPGIYN GSIEVTADEL EKAYTFDYSF EVLNLVQPLP RETNTQIEFW
QHPYTIARYY KISKEDLFTE KHFKYLRENL KEYRDMGGCG VIATIVHEAW NHQSYDSDPS
MIKWRKNSYG TFEFDYSHFD KWIQLNIDLG ILDPEKSFGQ IKCYSIVPWD NRIQYFNEAT
NKEEAINPNP GSDLWINIWT QFLTSFMSHL EEKGWFNITY ISMDERSIDD LKACVDLIES
ITNNSYEHFK ISSAMNYESG NDYSLLDRID DISIGLSHIN HNSDDMKNMA KHRQELGLLT
TIYTCTGDYP SSFTISDPSE GAFTMWYSLC QNTNGFMRWS WDGWVENPLK NVSYKYWEPG
DPFLIYPSEK DSIGKTFYST PRLEKLKEGI RDINKAKYLM EKDPNLKKSI ENLIYSLKRP
NKGENAYGSS VAASKEDRDL TISEANRMKN GINNFAREFI SLTMKTL