Gene CPF_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1857 
Symbol 
ID4201310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2092764 
End bp2094464 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content31% 
IMG OID638082727 
Productmajor facilitator transporter 
Protein accessionYP_696291 
Protein GI110798704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.906774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AATCAGTTGG AATTACCATG GCTGTCTTCT TGTTAGGAAT CTTCATGGGA 
GCTATTGACT CTGGTATAGT TTCACCAGCA AGAGATATAA TAGCTGATGG ATTAAAAGTT
TCACAAAATG CTAGTGTTTG GGTTGTAACA ATATACACCT TAGCATATGC AGTATCAATG
CCTCTTATGG GAAAACTTTC TGATAAGTAT GGTAGAAAAA AAGTTTACAT GGTTTCAATA
ACTCTATTTG GACTAGGTTC TTTACTATGT GGAGTATCAG ATTACGTAAA TAGTTATACA
TTCTTATTAT TCTCAAGGGT TATAGAAGCA ATAGGTGGCG GAGGTATAAT GCCAATAGCT
ACAGCATACA TAGGAACATC ATTCCCCGTT GAAAAAAGAG GTTCAGCGCT AGGAATGATT
GGAGGAGTAT ATGGAATAGC AACAGTTGTA GGACCAACCT TAGGTTCAGG AATACTTTCT
ATTTTTGGAG ATAAAAACTG GGGATTTTTA TTCTTAGTAA ATGTTCCAAT CAGTATAATA
ATATTACTTA TGGCAACTAA ATTAGAAGAA AATACTTCTG CACAAGGAAT TAAGAAATTA
GATGTTTGTG GTTCAGGGGT ATTAACAATA TTAATTTTAT CTTTAATGTA TGGAGCTACA
AACTTAAAAT TCTATGAGTT TGCTAATTCA ATAAAATCAC TAGATGTTTG GCCATATCTT
TTAATATTTA TAATATCAAT TCCAATATTA GTTTGGGTGG AAAAGAAAGC AGAAGATCCA
GTAATAAACT TATCTTACTT TACTAATAAG GAAATAGCTA TAACTTTAAT ATTAAGCTTT
GTTGTAGGTT GTGGATTAAT GGCAACAGTA TTTATTCCTC AATTTAGTGA AAATATATTA
AGAACACCAA TGGGTAGTGG TGGATATATA GTTACAATAT TTGCAATATT TGTAGGTATA
GCAGCACCTT TAGGTGGAAA ATTCATAGAT AAAATAGGAG TTAAAAAAGT ACTATTAATA
GGTATGTCTT TAGTTATCAT AGGTAACCTT TATCAAGGAT ATGTAACAAC TAAACATCCA
GGTATGGTTA ACTTAATAAT AGGTTTAGCT ATTATGGGAT TTGGTTTAGG ATTCTCTATG
GGAACACCAA TAAATTACTT AATGCTTAGC TTAGTACCAG ATAATGAGGC TACAGTTGGA
CAATCAGCAG TATCATTAAT TAAATCCATA GGTATTGCAG TATCACCAAA TATTCTCATT
AACTTTATAT CAGATGCAGG TAGAAGAGTA CCAGAAGCAT TACAAAAAGT TATGCCACAT
GTAGATGGAA TGTCTAATAT TATGTCAAAT AGTGGTGGAG CTTCAAATGT TGCTAATTCA
ATGGGAAATG CCAGTGTTAC TAATATATTT AGTCTTATAA AAGGAATGGC ACAATCACAA
TTTGCAGCTT TAGGAGACAA ATTTGCAAAT AATCCTCATA TGAATATTGA TATGATTGAA
AAATCATATA TGCAAAGTTT AGATGGAGCT AAAGATGCAA TAGAAACTGC ATTCCAACAA
ACTATGAATA CAGGATATAC TAAATTATTC TTAACATGTG CTATTATAGC TTTAATAGGG
TTAATATTAA CAGCTATGTT AAATAACAAT TTAATAACAA TGAAAAATAG AAGATTAGAA
AAGAAGGAAA AAACTAATTG A
 
Protein sequence
MKKKSVGITM AVFLLGIFMG AIDSGIVSPA RDIIADGLKV SQNASVWVVT IYTLAYAVSM 
PLMGKLSDKY GRKKVYMVSI TLFGLGSLLC GVSDYVNSYT FLLFSRVIEA IGGGGIMPIA
TAYIGTSFPV EKRGSALGMI GGVYGIATVV GPTLGSGILS IFGDKNWGFL FLVNVPISII
ILLMATKLEE NTSAQGIKKL DVCGSGVLTI LILSLMYGAT NLKFYEFANS IKSLDVWPYL
LIFIISIPIL VWVEKKAEDP VINLSYFTNK EIAITLILSF VVGCGLMATV FIPQFSENIL
RTPMGSGGYI VTIFAIFVGI AAPLGGKFID KIGVKKVLLI GMSLVIIGNL YQGYVTTKHP
GMVNLIIGLA IMGFGLGFSM GTPINYLMLS LVPDNEATVG QSAVSLIKSI GIAVSPNILI
NFISDAGRRV PEALQKVMPH VDGMSNIMSN SGGASNVANS MGNASVTNIF SLIKGMAQSQ
FAALGDKFAN NPHMNIDMIE KSYMQSLDGA KDAIETAFQQ TMNTGYTKLF LTCAIIALIG
LILTAMLNNN LITMKNRRLE KKEKTN