Gene CPR_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1212 
Symbol 
ID4204168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1358495 
End bp1359991 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content31% 
IMG OID642565768 
Productamino acid permease yshA 
Protein accessionYP_698534 
Protein GI110803526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0367716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCAG AACATAAAAG TAAATTGTCT TTAACAGCGC TTGTATTAAT GATTTTCACA 
TCAGTATATG GTTTTGCTAA TATGCCAAAG GCATTTTATT TAATGGGATA TAGTGCTATT
CCATGGTACA TAATATCAGC ATTAGCTTTC TCAATTCCAT ATGCTTTTAT GATGGCAGAA
TATGGTGCAG CATTTAGAAA GGAAAAAGGA GGAATATACT CATGGATGGC TAAATCTGTA
GGCCCTAAAT ATGCCTTCAT AGTTACATTT ATGTGGTATT CATCAAATTT AATATGGTTA
GTTAGTAACT CATCATCAAT ATGGATTCCT TTCTCAAATG TTATATATGG AAAAGATACT
ACAGGAACAT GGTCTTTACT TGGATTAAGC GTTCCACAAA CAATGGCTGT ATTAGGATCA
ATACTTATAA TAGTTATAAC TTATGCAGCA TCTAAAGGAT TAAAGAGTAT AACAAAGGTT
ACTTCAATTG GAGGAACTGT TGTTGCATTA TCAAACATAG TACTTATTTT AGGAGCAATT
ATTGTTTTTG TAAGCAATGG ATTTAAACCA GCTCAGATAA TAAATGGTGC TGCCTTTGTT
CATGCACAAA ATCCATCGTA TCAATCGCCA ATAGGTGTAT TAGGATTTTT AGTTTTTGCA
GTGTTTGCTT ATGGTGGACT AGAAGCAGTA AGCGGATTAG TTGATGAAAC TAAGGATGCT
AAAAAGAATT TTCCAAAGGG ATTAATGATT GCATCTATAA TAATAGCCGT TGGATATGCT
GTTGGAATAC TTTGTGTTGG ATTATTTACA AATTGGCAAG AAGTATTATC AGGGGATAAT
GTTAACTTAG CAAATGTATC ATATATAGTA ATTCAAAACT TAGGTGTAAA ATTAGGACAA
GCTTTTGGCA TGAGCACTAA TGCTTCACTT CAATTAGGAG CTTGGTTTGC AAGATATATA
GGTCTTTCAA TGATGTTAGC CTTAATGGGA GCTTTCTTTA CATTAAGTTA TGCACCAATT
AAACAATTAA TAGAAGGAAC ACCTAAAGAA ATCTGGCCAA AAAAATGGAC AATTTTAAAT
GAAAATAACA TGCCAGTTAA TGCAATGTGG GTTCAATGTA TAGTTGTTGT TATATTTATA
TTAATAGCTT CTTTTGGTGG AGAATCTGCA AATAAATTCT TTAGTTATTT AATATTAATG
GGTAATGTGG CCATGACAAT ACCATATATG TTCTTATCTG GAGCTTTTCC AGCATTTAAA
AAGAAAAAGC ATATAGAAAA ACCATTTATT ATGTATAAGT CATACAAATC ATCATTAATT
TGGTCTATAA TAGTAACATT TACAATTGGT TTTGCAAATT TATTTACAAT AATAAAGCCT
GTTATTCAAG ATAAGGATTA TACAGCTATG GCATTCCAAA TAACTGGACC GATAATTTTC
TCTATAATTG CTTTTATACT TTATCATAAT TATGAGAAGA AAATAAAAAA TAAATAA
 
Protein sequence
MSSEHKSKLS LTALVLMIFT SVYGFANMPK AFYLMGYSAI PWYIISALAF SIPYAFMMAE 
YGAAFRKEKG GIYSWMAKSV GPKYAFIVTF MWYSSNLIWL VSNSSSIWIP FSNVIYGKDT
TGTWSLLGLS VPQTMAVLGS ILIIVITYAA SKGLKSITKV TSIGGTVVAL SNIVLILGAI
IVFVSNGFKP AQIINGAAFV HAQNPSYQSP IGVLGFLVFA VFAYGGLEAV SGLVDETKDA
KKNFPKGLMI ASIIIAVGYA VGILCVGLFT NWQEVLSGDN VNLANVSYIV IQNLGVKLGQ
AFGMSTNASL QLGAWFARYI GLSMMLALMG AFFTLSYAPI KQLIEGTPKE IWPKKWTILN
ENNMPVNAMW VQCIVVVIFI LIASFGGESA NKFFSYLILM GNVAMTIPYM FLSGAFPAFK
KKKHIEKPFI MYKSYKSSLI WSIIVTFTIG FANLFTIIKP VIQDKDYTAM AFQITGPIIF
SIIAFILYHN YEKKIKNK