Gene CPF_0096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0096 
Symbol 
ID4201115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp115780 
End bp117213 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content31% 
IMG OID638080977 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_694560 
Protein GI110798992 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACATG TAATAAAAAA GGGCGAAAAT GCACAAGAGG TTAAATATAA TAAATGGGCT 
ATATTATTTT GTGTAGTTAC TATGACCTTC ATGTCTTGCA TAGATGGTAG CATAGTAAAT
GTAGCTTTGC CTAAAATTTC ATATGATTTA AATAAGCCTA TATTAGATAC TCAATGGATT
GTAACTACTT ATCTTATGGT TATATCAGCC TTAGTATTAC TTTGTGGAAG AATTGGTGAC
ATAAAGGGTA AATGTAAGGT TTTTAAAGTC GGCGTTTTAA TTTTTACAAT AGGTTCATTC
TTTAGTGGGT TATCAAAGAC ATTGCCTCTT CTTATAGTGT CTAGAGCAAT TCAAGGGGTT
GGAGGCTCCT GTGCAATGGC AACTGGTATG GGAATAATAA CAGCTTTCTT TAATGAAAAA
GAGAGAGGAA AGGCAATGGG GTTATCAGCT AGTGCAGTAG CTATGGGAGT AATGGTTGGA
CCTGCATTAG GTGGTATTTT AGTCTCTATA AGATGGGATC TTATATTTTG GATTAATGTA
CCTATTGGAA TAATTGCTTT TTTACTTTCA ATGGTTTATC TTCCTAAGAT GGAGACAAAT
TCAAAGGAAA AGATAGATAT AAGAGGAACA ATAGTATTTG CAATATTTAT AGTATCAAGC
ATGTTATCTA TAACAAAAGG GGAAGTTTTA GGTTATACAG ATAAATATAT AATGTTAGGA
TTTATAGTAT CTATAGTTTC TTTTACAGTA TTTATATATT TGCAAAAGAC AGTGGAATCA
CCAGTTTTAG ATTTAAATTT ATTTAAGACT AAACTATTTT CTCTAAGTAT AGTGTGTTCA
GCACTTTCAT TTTTAGCTAT AAGCAGCATG AATATAATAA TTCCACTTTA TTTAGAACAA
GCATTACAAA TGAGTTCTTT ACATGCAGGA TTATTTTTAA TGATTTACCC ACTATGTCTA
TCAATAGTTG CTCCACTTAG TAGTTCTCTT TCTGATAGAT TTAATGGAAG ATTGATTTCA
TTAATAGGAA TATCTTTATT AACAGTAGCT CTTTTCTTTA TGGGAAGAAT AAATATAAAT
AGTACTTTAA TTTATTTAGG AACTTGCTGT GCTATTATGG GGGTTGGCAA TGGAATATTT
AAATCAACAA ACAACGCTCT TGTTATGGAA AAAGTGCCTA AGCATAGACT AGGTATTGCA
GGAAGTGTTA ACTCATTAGT TTCAAATTTA GCTATGGCCT ATGGGTTTAC ATTTGCAACA
ACAATCTTAT ATGGAAGAAT GAGTTATAGA CTAGGGTATA AGGTCACAAA TTATATACCA
GGCCATGAAG AGGCATTTTT ATATGGATTA GATTGTGTAT TTTTTTCAGC TACAATAATG
TGTTTAATCG CATCAGTAGT AGCATTTATT AGATATAGAA GAAAAGATAT ATAG
 
Protein sequence
MEHVIKKGEN AQEVKYNKWA ILFCVVTMTF MSCIDGSIVN VALPKISYDL NKPILDTQWI 
VTTYLMVISA LVLLCGRIGD IKGKCKVFKV GVLIFTIGSF FSGLSKTLPL LIVSRAIQGV
GGSCAMATGM GIITAFFNEK ERGKAMGLSA SAVAMGVMVG PALGGILVSI RWDLIFWINV
PIGIIAFLLS MVYLPKMETN SKEKIDIRGT IVFAIFIVSS MLSITKGEVL GYTDKYIMLG
FIVSIVSFTV FIYLQKTVES PVLDLNLFKT KLFSLSIVCS ALSFLAISSM NIIIPLYLEQ
ALQMSSLHAG LFLMIYPLCL SIVAPLSSSL SDRFNGRLIS LIGISLLTVA LFFMGRININ
STLIYLGTCC AIMGVGNGIF KSTNNALVME KVPKHRLGIA GSVNSLVSNL AMAYGFTFAT
TILYGRMSYR LGYKVTNYIP GHEEAFLYGL DCVFFSATIM CLIASVVAFI RYRRKDI