Gene CPF_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1028 
Symbol 
ID4201164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1179642 
End bp1180868 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content34% 
IMG OID638081909 
Productmultidrug resistance protein 
Protein accessionYP_695474 
Protein GI110799410 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0302699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAT TTAAATCATA TAATAAAGAA AAAAATGAAG AAACATTGGA TAAGAAGGCT 
TTCATCTTTG GTCTTATGTC TGTATTTCTT TGTGGAATGG GCTTTAGTAT TATATCCCCT
GTTGTTCCAT TTTTAGTGGA GCCTTATGTA AGCAATACTA GTGAACAAGC TTTCTTCGTT
ACTCTACTAA CCTCAGTTTA TGCAGTTTGT GTATTTTTTG TAGCTCCTGG ACTTGGTGCT
TTAAGTGATA GATATGGACG TCGCCCCATA CTTTTAATAT GCCTTTTAGG TTCCTCAATT
GGATACTTAA TCTTTGGTAT AGGTGGCTCT ATATGGGTAC TATTTCTTGG ACGTATAATA
GATGGTGTAA CAGGTGGGAG CATAAGCACA ATTTTTGCAT ATTTTGCAGA TATAACTCCT
AAGGAAGAAA GGACTAAATA CTTTGGATGG ATAAGTGCAA GTGCAGGTAT AGGTGCTGCC
ATTGGCCCTA CTCTAGGTGG AGCGCTTGCC AAATTTGGCT ATGCTGTGCC AATGTATTTT
GGAGCAATAA TAACCTTATT AAACTTTATT TATGGAATCT TATATATGCC TGAAAGTCTT
CATGAAAATA ATAAGCTTAA GAAAATCACC CTTGTAAGAC TTAATCCATT TACACAGCTT
ATGAGTGTAC TTTCTATGAA AAACTTAAAA AGACTACTTA TTTCAGCCTT CTTAATTTGG
ATACCTAATG GATCTTTACA ATCAATTTTT TCACTATTTA CAATGGATAC TTTCAATTGG
ACACCTACAT TAATAGGACT TATGTTTTCA ATTATGGGTA TTCAAGATAT TATTTCACAG
GGCTTAATAA TGCCAAAGCT TTTAATGAAA CTTAGTGATG TAAAGATAGC AATCCTTGGA
ATGGTCTCTG AGATTATAGG ATATGCTCTT ATTGCAGCAT CAGCTATTTT CACATTCTAT
CCTTTTTTCA TAGTTGGCAT GTTTATATTT GGTTTTGGAG ATTCAATTTT TGGTCCCTCA
TTTAATGGAA TGCTCTCTAA GTCTGCTAAT TCTAGTGAAC AAGGAAGGAT TCAAGGAGGT
AGCCAAGCTC TTCAATCTCT AGCAAGAATA ATTGGCCCTA TTTTAGGAGG ACAAATCTAT
GTATCTCTAG GTCATTCCTC CCCTGCTTTT ACGGGTATGA TTCTAATAAT ATTGGCCATA
CCAATTTTGT ATAAGAGTAT TAGATAG
 
Protein sequence
MTKFKSYNKE KNEETLDKKA FIFGLMSVFL CGMGFSIISP VVPFLVEPYV SNTSEQAFFV 
TLLTSVYAVC VFFVAPGLGA LSDRYGRRPI LLICLLGSSI GYLIFGIGGS IWVLFLGRII
DGVTGGSIST IFAYFADITP KEERTKYFGW ISASAGIGAA IGPTLGGALA KFGYAVPMYF
GAIITLLNFI YGILYMPESL HENNKLKKIT LVRLNPFTQL MSVLSMKNLK RLLISAFLIW
IPNGSLQSIF SLFTMDTFNW TPTLIGLMFS IMGIQDIISQ GLIMPKLLMK LSDVKIAILG
MVSEIIGYAL IAASAIFTFY PFFIVGMFIF GFGDSIFGPS FNGMLSKSAN SSEQGRIQGG
SQALQSLARI IGPILGGQIY VSLGHSSPAF TGMILIILAI PILYKSIR