Gene CPR_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1551 
Symbol 
ID4205729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1738678 
End bp1740213 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content31% 
IMG OID642566103 
Productsugar ABC transporter ATP-binding protein 
Protein accessionYP_698868 
Protein GI110802771 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.048612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATG TAGTAGAAAT GCTTAATATC CGTAAAGAAT TTCCTGGTAT AGTAGCAAAT 
GATGATATAA CTTTGCAACT TAAAAAAGGA GAGATACATG CTTTACTTGG AGAAAATGGA
GCTGGTAAAT CTACTTTAAT GGGAATTCTT TTTGGAATGA ACCAACCAGA TAAAGGGGTT
ATAAAGGTTA GAGGTAAAGA AGTAAAAATT ACTAATCCAA ATGTTGCAAA TGATCTTGGA
ATAGGAATGG TACACCAACA TTTTAAATTA GTTGAAAATT TTACAGTAAC TCAAAATATA
GTTTTAGGAT GCGAGCCTAA GATTCTTTTA GGATTAGGAA TGGATTTAAA CAAGGCAGCT
AAAAGAATTG AAAAATTGTC AAATCAATAT GGATTAAATG TTGACCCAAA TGCAAAGATT
GAAAATATAT CTGTTGGTAT GCAACAAAGG GTTGAAATAT TAAAAATGCT TTATAGAGAT
GCTGATGTTC TTATATTAGA TGAGCCTACT GCAGTTTTAA CCCCTCAAGA AATTGATGAA
CTTATAAAAA TAATGAAAAA TCTTATAAAT GAGGGAAAAT CAATAATAAT TATAACTCAT
AAACTTAAGG AGATAAAAGC TGCTGCAGAT AGATGTACAG TTATAAGAAG AGGTAGATAC
ATTGGTACTG TAGATGTAAA AACTACTAGC GAAGCTGAAA TGGCTAAAAT GATGGTAGGA
AGAGAAGTAT CATTTAAAGT TAATAAAAAG CCTGCTAAGC CTGGGGAGAT AGTATTAGAT
ATTAAAAATC TTTCAGTTAA GAATAATAAG AAAGTATTAG GATTAAAGGA CTTTTCTATT
GATGTTAGAG CAGGAGAAAT AGTAGGTATA GCAGGGGTAG AAGGTAATGG TCAAAGTGAA
CTTATTGAGG CTATTACAGG ACTTAGAAAA AGTGAAAGTG GAACTATAAA CTTCAAAAAC
AAGGATATAA ATAGAGAATC TATAAGAAAT AGAATAAACT CAGGGATTGC ACATATTCCA
GAAGATAGAC ACAAGAGAGG ACTAGTTTTA GATTATACAA TTGAAGAGAA TATGGTTCTA
GAAGTGTATG ATAAGAAACC TTTTTCAAAT AAAGGTTTAT TAAACAAAAA AGAAATAAAA
AAATATGCAG AAAAAATAAT AGATGAATTT GATGTAAGAT CTGGAGAAGG GGCTGAATCA
ATAGCAAGAT CTCTTTCAGG AGGAAATCAG CAAAAAGCAA TTATAGGTCG TGAAATAGAA
TTAAATCCAG AACTTTTAAT AGCAGCACAA CCTACTAGAG GACTTGATGT AGGATCTATA
GAGTATATTC ATAAAAGGCT TGTCGAGCAA AGAGATAGAG GAAAAGCTGT GCTTTTAGTT
TCCCTTGAAT TAGATGAAAT ATTAAATGTC TCAGATAGAA TTGCCATAAT AAATAACGGA
GAACTTATAG GTATTGTAAA TGCAGATGAA ACTAATGAAA ATGAGGTAGG TCTTATGATG
GCTGGTATAA GAGGAGGAGA AAAGCATGAA GTTTAA
 
Protein sequence
MEYVVEMLNI RKEFPGIVAN DDITLQLKKG EIHALLGENG AGKSTLMGIL FGMNQPDKGV 
IKVRGKEVKI TNPNVANDLG IGMVHQHFKL VENFTVTQNI VLGCEPKILL GLGMDLNKAA
KRIEKLSNQY GLNVDPNAKI ENISVGMQQR VEILKMLYRD ADVLILDEPT AVLTPQEIDE
LIKIMKNLIN EGKSIIIITH KLKEIKAAAD RCTVIRRGRY IGTVDVKTTS EAEMAKMMVG
REVSFKVNKK PAKPGEIVLD IKNLSVKNNK KVLGLKDFSI DVRAGEIVGI AGVEGNGQSE
LIEAITGLRK SESGTINFKN KDINRESIRN RINSGIAHIP EDRHKRGLVL DYTIEENMVL
EVYDKKPFSN KGLLNKKEIK KYAEKIIDEF DVRSGEGAES IARSLSGGNQ QKAIIGREIE
LNPELLIAAQ PTRGLDVGSI EYIHKRLVEQ RDRGKAVLLV SLELDEILNV SDRIAIINNG
ELIGIVNADE TNENEVGLMM AGIRGGEKHE V