Gene CPR_2069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2069 
Symbol 
ID4204939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2291000 
End bp2292421 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content28% 
IMG OID642566619 
Productmajor facilitator transporter 
Protein accessionYP_699378 
Protein GI110801969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCTG GTCTAGTTAA TCTAATTACA AGCAAAGATG AAAACTGGGA CAAAAGATGG 
AAGTATAACT GTGCAATGTT TATATTATGT TATATTTTCA TGGGGGCTGT CACAGGAATA
ACTAATGACT CTTATTTATC TTATTTAAAC ATCACTGTTC CTAATGTAGT TAAAGCACTT
CCTACCTATG CATCAATTGG AACCTTCATA ATAGCAATGC TTTTACTATT AACTCATAAA
GTAGGATTTA AAAAATTAAT ATTACTTGCA CCTATTTTTT CAATTCTTGG TTTATTGGTT
TGTATATATA GTAAAGATGG TTTAATCATA ACTTTTGCAT ATATAGTTGT AAATATTGGT
TTGGGAATGT TTGATTGTAT TTATCCTGTG ATGTTTACAT CATATACACC TAGAAAAGAA
AGAACTAAAA TGTTCTCTAG GGTTATGTAC TGCAATCTTA TTAGCCAATC AATTTTAACA
TTTTTAAATG GTAAAATTGT TGTTTGGAAA TTTGCTAAAT CTTTAGGTAT CTCTTATAAT
AGAGCTTCTG TTTTATCAGG AAATCAAGAT GCCTTAAGTT CTGTTCAATT AATGGCATAT
TCAGATTCAT ATAAATTTGT ACTTTGGATT GCAATAGCCT TAACAGTAAT AGCTTCAGTC
TTTCTATTAT TCTTAAAAGA AAAAAGTGAA GATTATAGAG AAACTCCTGA AGAAATTCAA
GCTAGAAAAG GAGAAAAAGT CTTTGATTTC AAATTATTAG CTAATAAATA TGTTATCTTA
TGGATTCTTA TATTTGGAAT AGTAAGATTT GGTGCTTTAT TGGTAACACC ATTCTTTCCA
ATTTATTTAA ATAATTTCTT ACATATAAGC AGAGGTACTG TTTCATCTAT CATAACATTC
CAAACAATAT CTATGGTAAT TGGTTATTTC TGCACCCCTT ACCTAGAAAA AAAATTTGGT
TCAATAGTAA CCATATCTGT AACTACTATA CTTTGTGTAC CACTTATGAT ATTAATGGCT
AATGGTGCTA TATTTGGAAG TAATGTTGCT ATGATAGTAG GTATAATACT ATTTTTAAGA
TCTGGTGTAG CAAATGTTTC TGCTCCAATA CAAAGTTCTT TACCATTAAC CTTTGTTCCT
AAAAATTTAG TTTCAGCTTA TAATTCACTT ATATTAGTTG TTAATTCACT TATAGGAATT
GTAGCTGGTA TATATACAAG ATATTCTTTA TTAAAGACAG AATCAGGTTA TGGAAAAGCT
TACTATATAG CAGGTGCTCT TTACTTAATA GCTAGCATAT TTCTTCTTAT AATATTTACC
AAAAAATATA ATAGAACTAA TAATGATTCT GAGGTAATTG AAATTAATGC AGAAACCGCA
ATATCCTCTG TAGATGATAG TATAAAAGAA ACTTTAAAAT AG
 
Protein sequence
MEAGLVNLIT SKDENWDKRW KYNCAMFILC YIFMGAVTGI TNDSYLSYLN ITVPNVVKAL 
PTYASIGTFI IAMLLLLTHK VGFKKLILLA PIFSILGLLV CIYSKDGLII TFAYIVVNIG
LGMFDCIYPV MFTSYTPRKE RTKMFSRVMY CNLISQSILT FLNGKIVVWK FAKSLGISYN
RASVLSGNQD ALSSVQLMAY SDSYKFVLWI AIALTVIASV FLLFLKEKSE DYRETPEEIQ
ARKGEKVFDF KLLANKYVIL WILIFGIVRF GALLVTPFFP IYLNNFLHIS RGTVSSIITF
QTISMVIGYF CTPYLEKKFG SIVTISVTTI LCVPLMILMA NGAIFGSNVA MIVGIILFLR
SGVANVSAPI QSSLPLTFVP KNLVSAYNSL ILVVNSLIGI VAGIYTRYSL LKTESGYGKA
YYIAGALYLI ASIFLLIIFT KKYNRTNNDS EVIEINAETA ISSVDDSIKE TLK