Gene CPR_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1574 
Symbol 
ID4206383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1764529 
End bp1766226 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content30% 
IMG OID642566125 
Productmajor facilitator transporter 
Protein accessionYP_698890 
Protein GI110803512 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000345614 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AATCAGTTGG AATTACAATG GCTGTCTTCT TGTTAGGAAT CTTCATGGGT 
GCTATTGACT CTGGTATAGT TTCACCAGCA AGAGATATAA TAGCTAATGG ATTAAAAGTT
TCACAAAATG CTAGTGTCTG GGTTGTAACA ATATATACCT TAGCATATGC AGTATCAATG
CCTCTTATAG GAAAACTTTC TGATAAATAT GGTAGAAAAA AAATTTACAT GGTTTCAATA
ACTCTATTTG GACTAGGTTC TTTACTATGT GGAATATCAG ATTATGTAAA TAGTTATACA
TTCTTATTAT TTTCAAGAGT TATAGAAGCA GTAGGTGGCG GAGGTATAAT GCCAATAGCT
ACAGCGTACA TAGGAACATC ATTCCCAGTT GAAAAAAGAG GTTCAGCGCT AGGAATGATT
GGAGGGGTAT ATGGAATAGC AACAGTTGTA GGACCAACCT TAGGTTCAGG AATACTTTCT
ATCTTTGGAG ATAAAAACTG GGGATTTTTA TTCTTAGTAA ATGTTCCAAT CAGTATAATA
ATATTACTTA TGGCAACTAA ATTAGAAGAA AATACTTCTG CACAAGGAAT TAAGAAATTA
GATGTTTGTG GTTCAGGAGT ATTAACAATA TTAATTTTAT CTTTAATGTA TGGAGCTACA
AACTTAAAAT TCTATGATTT TGCTAATTCA ATAAAATCAC TAGATGTTTG GCCATATCTT
TTAATATTTA TAATATCAAT TCCAATATTA GTTTTGGTGG AAAAGAAAGC AGAAGATCCA
GTAATAAACT TATCTTACTT TACTAATAAG GAAATAGCTA TAACTTTAAT ATTAAGTTTT
GTTGTAGGTT GTGGATTAAT GGCAACAGTA TTTATTCCTC AATTTAGTGA AAATATATTA
AGAACACCAA TGGGTAGTGG TGGATATATA GTTACAATAT TTGCAATATT TGTAGGTATA
GCAGCACCTT TAGGTGGAAA ATTCATAGAT AAAATAGGAG TTAAAAAAGT ACTATTAATA
GGTATGTCTT TAGTTATAAT AGGTAACCTT TATCAAGGAT ACGTAACAAC TAAACACCCA
GGTATGGTTA ACTTAATAAT AGGTTTAGCT ATTATGGGAT TTGGCTTAGG ATTCTCTATG
GGAACACCAA TAAATTACTT AATGCTTAGT TTAGTACCAG ATAATGAGGC TACAGTTGGA
CAATCAGCAG TATCATTAAT TAAATCCATA GGTATTGCAG TATCACCAAA TATTCTTATT
AACTTTATAT CAGATGCAGG TAGAAGAGTA CCAGGAGCAT TACAAAAAGT TATGCCACAT
ATAGATGGAA TGTCTAATAT TATGTCAAAT AGTGGTGGTG CTTCAAATTT TAATAATTCA
ATGGCAAATG CCAGTGTTAC TAATATATTT AGTCTTATAA AAGAAATGGT ACAATCACAA
TTTGCAGCTT TAGGAGATAA GTTTTCAAAT AATCCTCATA TGAATATTGG TATGATTGAA
AAATCATATA TGCAAAGTTT AGATGGAGCT AAAGGTGCAA TAGAAACAGC CTTCCAAAAA
ACTATGAATA CAGGGTACAC TAAATTATTC GTAACATGTG CTATTATAGC TTTAATAGGG
CTAATATTAA CAGCTATGTT AAATAATAAT TTAATAACAA TGAAAAATAG AAGATTAGAA
AAGAAGGAAA AAAACTAA
 
Protein sequence
MKKKSVGITM AVFLLGIFMG AIDSGIVSPA RDIIANGLKV SQNASVWVVT IYTLAYAVSM 
PLIGKLSDKY GRKKIYMVSI TLFGLGSLLC GISDYVNSYT FLLFSRVIEA VGGGGIMPIA
TAYIGTSFPV EKRGSALGMI GGVYGIATVV GPTLGSGILS IFGDKNWGFL FLVNVPISII
ILLMATKLEE NTSAQGIKKL DVCGSGVLTI LILSLMYGAT NLKFYDFANS IKSLDVWPYL
LIFIISIPIL VLVEKKAEDP VINLSYFTNK EIAITLILSF VVGCGLMATV FIPQFSENIL
RTPMGSGGYI VTIFAIFVGI AAPLGGKFID KIGVKKVLLI GMSLVIIGNL YQGYVTTKHP
GMVNLIIGLA IMGFGLGFSM GTPINYLMLS LVPDNEATVG QSAVSLIKSI GIAVSPNILI
NFISDAGRRV PGALQKVMPH IDGMSNIMSN SGGASNFNNS MANASVTNIF SLIKEMVQSQ
FAALGDKFSN NPHMNIGMIE KSYMQSLDGA KGAIETAFQK TMNTGYTKLF VTCAIIALIG
LILTAMLNNN LITMKNRRLE KKEKN