Gene CPF_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1052 
SymbolfucP 
ID4202981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1200057 
End bp1201391 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content32% 
IMG OID638081933 
ProductL-fucose permease 
Protein accessionYP_695498 
Protein GI110800204 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR00885] L-fucose:H+ symporter permease 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.682471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA TTGGTGCTAT TGAAAAAACA AAAACTAAGG AAGAAACAGC TATTGTACCT 
AAAAAGTATA GAATGCATTT TATAATGCTT ATATCATGTT TTGTACTTTG GGGTCTTTTA
AATAATATGA CTGATAATCT TGTTCCTGCC TTTGGAAAGA TATTTATGTT AGAAGCTGCT
GACTCGTCTT TAGTACAAGT AGCTTTCTAT GGATCGTATG CAGTACTTGC CTTACCGGCA
GCAATACTTA TTAAAAAATA TTCCTATAGA AATGGAGTAT TAGTAGGTCT TGGATTATAT
ATAATTGGAG CAATGGGATA TATCCCAGCT GCCATGCTAC AAAATTTTAA TTTATTCTTA
GTATCAGTTT TTGTTTTAGC AGGGGGACTT TCAATACTAG AAACTACTTG TAATCCATAT
GTAATTTCTT TAGGGAGTGA AGAGACTAGT GTTCGTCGTT TAAATTTAGC ACAAGCCTTT
AACCCTATTG GCTCATTAGC TGGTATTATA ATGGCTAAAT ATATAATATT AGGGAATTTA
CATCCAGCAA CTTATGAAGA GAGAGTTGCT ATGGGATCAG AGGCTTTAAG TAAAATACAA
AACAATGAAT TAATATGGGT ATGTGTACCA TATGTTAGTT TAGTAGTTAT AGCTATAATA
ATTTGGTGCT TCTTTAAGAG AAGTAAAGGT TCAGAAAAAG ATAATTCAGG AGAACTTAAT
ATAATTGAAT CAATAAAGAA GTTAGTAAAA ATTCCACGTT ATGCCTTTGG AGTAATCACA
CAGTTTTTCT ATGTTGGAGT TCAAATAGCT GTTTGGACTT GGACTATAAA ATATGTAATG
GTTACTGTGG GGATAGATGA AGCATCTGCT GCTAAATATT ATCTTATAGC TATGTTTGGA
TTTATAGCAT GTCGTTGGAT TTGTACAGCA CTTATGAAAT ATATAGAACC AGGTATTATG
ATGGCAGTGT TTGCAGTACT TGGAATATTA TGTAGCTTAG GAGCAATTTA TTTACCTACA
AATTTATCAG TTTGGTCTTT AGTTTTAATT TCATCATGTA TGTCATTAAT GTTCCCAACA
ATTTATGGTA TAGCACTTGA AGGGTTAGGA AAAGAAGTAA AGGTTGGTGC AGCAGGTCTT
ATAATGGCAA TATTAGGTGG TGCAGTTATA ACACCTATAA TGGGATTATT TATTGATAGT
GGAAAATTAT CAAGTTTAGT TACTTCATAT CAAGGTGCAG AAGCTGCAGT TCGTTCAGCA
TTCTTTATAC CTGTAGTTTG TTTTGCAGTT GTTCTTATTT ATTCATTATG TTTTAGAAAG
AAAAAAGTAG CATAA
 
Protein sequence
MEKIGAIEKT KTKEETAIVP KKYRMHFIML ISCFVLWGLL NNMTDNLVPA FGKIFMLEAA 
DSSLVQVAFY GSYAVLALPA AILIKKYSYR NGVLVGLGLY IIGAMGYIPA AMLQNFNLFL
VSVFVLAGGL SILETTCNPY VISLGSEETS VRRLNLAQAF NPIGSLAGII MAKYIILGNL
HPATYEERVA MGSEALSKIQ NNELIWVCVP YVSLVVIAII IWCFFKRSKG SEKDNSGELN
IIESIKKLVK IPRYAFGVIT QFFYVGVQIA VWTWTIKYVM VTVGIDEASA AKYYLIAMFG
FIACRWICTA LMKYIEPGIM MAVFAVLGIL CSLGAIYLPT NLSVWSLVLI SSCMSLMFPT
IYGIALEGLG KEVKVGAAGL IMAILGGAVI TPIMGLFIDS GKLSSLVTSY QGAEAAVRSA
FFIPVVCFAV VLIYSLCFRK KKVA