Gene CPR_1183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1183 
Symbol 
ID4204070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1333458 
End bp1334561 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content30% 
IMG OID642565739 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_698505 
Protein GI110801653 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value8.17553e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATT TATCCTTAAT AAAAAGAATA TTTGTTGCAA TTATTTTAGG AATACTTATT 
GGGCTAGGAT GTTCCTATAT TAATTTAGAT ATACCTATTA GAATATTAAT GACCTTTAAT
AGCATATTTG GGAATTTACT AAGTTTCTTA ATTCCACTTA TAATAGTTGG GTTTATAGTT
CCTGGTATAG CATCCTTAGG AAATAAATCA GGAAAAGGAC TTTTCATAAC TACTTTAATT
TCATATGCTT CAACATTTTT AATAGGAATA CTTACTTTCT TTATAGGACG CGCAGTACTT
CCTAAATTTA TAGTAAGTGC TTCTCTAAGC ACTGGATCAG TAAATGTTGA TCCTTATTTT
ACAATTGATA TTCCTCCAAT GTTTGGTGTT ATGTCAGCTT TAGTTTTTGC ATTCTTATTA
GGAATAGGAA TATCAAGAAT AAAAAATAGT TACTTATTAA AAGTATCAGA AGAATTTAAT
CACGTTATTT CATTAACTAT AAAAAATGTG TTAATACCTT TAGTACCTAT TTACATACTT
TCAATATTTT CAAAGTTAAG TTATAATGGT GAGATTTTTA CTACTTTAAA GTCTTTTGGA
CTTGTGTACT TAGTTTTATT TTCAATACAA GGATCTTATT TAGTGGTTCA ATATGCTTTA
GCTGGAACTT TAAAGAAAGA AAATCCATTA AAATTACTTA AAAATATGAT TCCTGCATAT
ATGACAGCTT TGGGAACTCA ATCATCAGCA GCTACAATCC CAGTTACTTT AAACTGTACT
AAGGAAAATA AAGTTGACCA AGATGTAGCA GACTTTGTTA TTCCTTTAGG AGCAACAATA
AATTTAGCAG GTGATACTAT TACTCTAGTT CTTGCATCAA TGTCTGTAAT ATATATGAAA
GGACAAGTTC CAACTTTCTC TATTATGGTT CCATTTATAA TTATGTTAGG AGTAACTATG
GTAGCAGCAC CAGGGGTACC AGGTGGCGGA GTTATGGCTG CTTTAGGATT ACTTGAAGGT
ATGCTTGGAT TTGGTAATAT TGAAAAATCC TTAATGATAG CACTTCATGC TGCTCAAGAT
AGTTTGGAAC AGCAACTAAT GTAA
 
Protein sequence
MKNLSLIKRI FVAIILGILI GLGCSYINLD IPIRILMTFN SIFGNLLSFL IPLIIVGFIV 
PGIASLGNKS GKGLFITTLI SYASTFLIGI LTFFIGRAVL PKFIVSASLS TGSVNVDPYF
TIDIPPMFGV MSALVFAFLL GIGISRIKNS YLLKVSEEFN HVISLTIKNV LIPLVPIYIL
SIFSKLSYNG EIFTTLKSFG LVYLVLFSIQ GSYLVVQYAL AGTLKKENPL KLLKNMIPAY
MTALGTQSSA ATIPVTLNCT KENKVDQDVA DFVIPLGATI NLAGDTITLV LASMSVIYMK
GQVPTFSIMV PFIIMLGVTM VAAPGVPGGG VMAALGLLEG MLGFGNIEKS LMIALHAAQD
SLEQQLM