Gene CPR_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1547 
Symbol 
ID4203993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1734526 
End bp1735815 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content31% 
IMG OID642566099 
Productpermease 
Protein accessionYP_698864 
Protein GI110801721 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0198208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGT TCTTTAAGCT TAAAGAAAAT AACACTGATG CAAAAACAGA ATTTATTGCT 
GGATTAACTA CTTTTATGAC TATGGCTTAT ATACTTATAG TAAATCCATC AATATTATCA
GCAACAGGAA TGGATCAAGG AGCTGTATTT ACAGCAACGG CTTTATCAGC AGTAATAGCA
ACTTTAATAA TGGGACTTTA TGCTAAGTTA CCATTTGCAC AAGCTCCAGG AATGGGACTA
AATGCATTTT TTGCTTATAC AATAGTTATT CAAATGGGAT ATTCATTTGA ATTTGCTTTA
ACTGCAGTTT TATTAGAAGG AATAATATTT ATACTTTTAA CTATATTTAA CGTACGTGAA
GCAATAGTAG ACTCAATACC AAGGGGAATA AAAAATGCTA TATCAGTAGG TATAGGATTA
CTTATTTCTT TAATAGGATT AGAGGGAGCA GGAATCGTAG TACATACAGA TGGTGGAACT
ATAGTTTCTT TAGGAAATAT AGTTTCAGGA TCAGGACTTT TAGCAATAAT AGGTCTTTTA
ATAACAAGTG TTTTAATAGC TAAAAACGTT AAGGGAGCAT TATTTATAGG TATGATTATT
ACAGCAATAA TAGGAATACC TATGGGAATA ACTCCTATGC CAAGCAAGAT TATTAGTACG
CCACCTTCAA TAGCACCTAC TTTCTTCAAG TTCGATTTTC ATAACATATT CTCTTTAGAC
ATGGTAATAG CATTATTTAC ATTATTATTC ATGGATATGT TTGATACAAT AGGAACTTTA
GTTGGTGTTG CAACTAAGGC TAAAATGTTA GATAAGGATG GAAAAGTACC TAACATAAAG
AAAGCTTTAT TTTCTGACGC AGTAGGTACA ACATTAGGAG CTTTTTTAGG AACAAGTACA
GTAAGTACTT TTGTAGAGAG TGCATCAGGG GTTGCAGAAG GAGGAAGAAC TGGATTAACA
GCAGTTTCAA CTGCGTTTAT GTTTTTCTTA GCTTTATTCT TTGCTCCATT ATTTGCAATT
ATAACTCCAG CAGTTACAGC GTCAGCTTTA GTTTTAGTTG GATTATTTAT GATAGAACCA
ATAAAAGAAA TAGACTTACA TGATTTTACA GAAGCTATAC CAGCTTTCTT AACAATAATC
ATGATGCCAT TTGCTTACTC AATATCAGAT GGTATAGTAT TTGGAGTTAT ATCATACATA
ATATTAAAAT TATTCACTGG AAAAAGAAAA GAGATAAGTT TAACTACTGT TATCTTAGGA
TTAGTATTTT TACTTAAGTT TTTAATATAA
 
Protein sequence
MEKFFKLKEN NTDAKTEFIA GLTTFMTMAY ILIVNPSILS ATGMDQGAVF TATALSAVIA 
TLIMGLYAKL PFAQAPGMGL NAFFAYTIVI QMGYSFEFAL TAVLLEGIIF ILLTIFNVRE
AIVDSIPRGI KNAISVGIGL LISLIGLEGA GIVVHTDGGT IVSLGNIVSG SGLLAIIGLL
ITSVLIAKNV KGALFIGMII TAIIGIPMGI TPMPSKIIST PPSIAPTFFK FDFHNIFSLD
MVIALFTLLF MDMFDTIGTL VGVATKAKML DKDGKVPNIK KALFSDAVGT TLGAFLGTST
VSTFVESASG VAEGGRTGLT AVSTAFMFFL ALFFAPLFAI ITPAVTASAL VLVGLFMIEP
IKEIDLHDFT EAIPAFLTII MMPFAYSISD GIVFGVISYI ILKLFTGKRK EISLTTVILG
LVFLLKFLI