Gene CPR_1843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1843 
Symbol 
ID4206239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2035057 
End bp2037309 
Gene Length2253 bp 
Protein Length750 aa 
Translation table11 
GC content32% 
IMG OID642566393 
ProductFucA 
Protein accessionYP_699157 
Protein GI110803677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTA AAGCTATAAA AAGTTTTTTA ATAACTACAA TGGCAATGAC TTTTTTAGTT 
AGTATGGGAC AAGGATCTAT ATTAACTAAG GCAAATACTA TAAATAATAT GGAATATGTA
ACAGAAGTAT TTGGAGCAAT TCCAACAGAA GAACAGGTTA GCTATCAAAA AGAGGAACTT
ACAGCATTTA TACACTTTGG TGTGAATACC TTTACGGGAA GGGAGTGGGG AGATGGAACA
GAAAATCCAG AAATTTTTAA CCCTACTAAT TTAGATGCTG ATCAATGGGT AAAAACTTTG
GCGGAGGCAG GCTTTGGAAG GGTTATATTA ACTGCAAAAC ATCATGATGG ATTTTGTTTG
TGGGATAGTG CATATACAAA GCATGATGTA GCTAGTAGTC CTTGGAAAAA TGGAAAGGGA
GATGTGGTTA AAGAGGTTTT AGAAGCTTGT GCTAAATATA ATATAAAATT CGGAGTTTAT
TTGTCACCAT GGGACCAAAA TTCAGAACAT TATGGAGACG GCAATGGTGG TGATTATAAT
GAGTTTTACA TGAATCAGTT AAGGGAATTA TTAACTAACT ATGGACCTAT AGCTGAAGTT
TGGATGGATG GAGCAAAGGG ATCGAATGTT AAGCAAGAAT ATAACTTTGA AGAATGGTTT
GCTCTTATAA AAAAATTACA ACCAGAATGT TTAATATTTA GTCCACAGGG GCCAGATATT
AGATGGATTG GAAATGAAAA AGGATATGCA GGAGAACCTT GCTGGTCAAC TATAGATATT
GAAAAAATGA AAGAAAGGGA AAATCCAACA TATTTGAATA ATGGAGAAGA AGGTGGACCA
AATTGGATAG TTGGAGAATC AGATGTATCC ATAAGACCAG GTTGGTTTTA CCATGAATCT
CAAGATAATG AAGTTAAATC CTTAGAAAAA ATGATGGATA TTTATTTTAA ATCCATAGGG
AGAAATTCAG TTTTATTATT AAATGTTCCC CCTAATAAGG AAGGAAAGTT ACATGAAAAT
GATGTTAATA GATTAAAGAA ATTTGGTGAA ACAATTAAAG AGTTATTTAA TGATGATTTA
GCTTTAAATA AAGAGGTAAT AGTAGATGGT TTTGCTAATA ATGATGAGAC ATATGGTGCA
AATAAAATTG TAGATGGAGA TTATGATACC TATTGGGCAC CAGATAATAG TAGTAAAACA
GGTACTATTG AAATAGATTT AGGTGGAAGT AAAGAATTTG ATGTTATTTC TTTGCAAGAG
TATATACCAT TAGGTCAAAG AGTATCCAGT TTTAATGTAG AAGTATTGCA AGGAGAAAAT
TGGAATAAGG TTTATGAAGG AAAAACAATA GGATATAAAA GACTTGTTAG AATAGCTCCA
ACTAAAGGAG AAAAAATAAG AATTAATATA ACGGGTTCAT TAGAAGTACC ACTTATAAAT
AACGTTGGAG TTTATAAACA ACCTATTAGT ATAGAACTTC CATCAGGGCC ACCAGCTGGG
TTGAAAGTAT TAAATGATGA TAATAAGGGA AATGAATTAG AACAATTTAA TTTTAGTGAT
GGATGGATAT ATGAGACTAT CCATGGAGAA GATGATTTAG GTGGAGATGC CCATTATACA
AGTAAAATTA ATGCTACAGT TAATATTAAA TTCAAGGGAA CTAAGTTTTT CTTATCAGGA
ACAAAGGATT CAGGACATGG AATAATGGAA ATTTCAATAG ATGGTGAAAA TCCAGTAGAG
GTTGACTTAT ATTCTCCTAA TAGAAAATCT AAAGAGATAG TTTTTGAAAG TGAAGATTTA
AGTGATGGAG AGCATGAAGT TACGGTTAAA TGTACTGGAA GAAAGAATTC AAATTCTAGA
GGGATAGTGG CTCATATAGA TGGAGCTTAT GTATTAGACA ATGGTGGAAA AGGTATGGTT
GAATTTGAAA AGGTAGGATA CAAAGTAAGT GAAAATATAG GAACTGCTAC TTTTAAGGTT
ATAAGAAAAG GAGGAAGTAA TGGTAAGCTT GAAGTTAACT ATGATACTTT AGCTGGCACT
GCTTTAAATG GAGTTGATTA TCAAACATGG TCTGGTACTT TAGCATTTAA TGAAGGAGAA
ACAGAAAAAA CTTTTGATAT AACAATAATT GATGATAAGG AAAAAGAAGA GCCTAAGGAA
TTCTATTTAA AATTAAGTGA TCCAATAGGT GGAATATTAG GATTTAATTC AAGAGCTACA
GTTATTATTA ATGATGATGA GCAAATTAAA TAA
 
Protein sequence
MKVKAIKSFL ITTMAMTFLV SMGQGSILTK ANTINNMEYV TEVFGAIPTE EQVSYQKEEL 
TAFIHFGVNT FTGREWGDGT ENPEIFNPTN LDADQWVKTL AEAGFGRVIL TAKHHDGFCL
WDSAYTKHDV ASSPWKNGKG DVVKEVLEAC AKYNIKFGVY LSPWDQNSEH YGDGNGGDYN
EFYMNQLREL LTNYGPIAEV WMDGAKGSNV KQEYNFEEWF ALIKKLQPEC LIFSPQGPDI
RWIGNEKGYA GEPCWSTIDI EKMKERENPT YLNNGEEGGP NWIVGESDVS IRPGWFYHES
QDNEVKSLEK MMDIYFKSIG RNSVLLLNVP PNKEGKLHEN DVNRLKKFGE TIKELFNDDL
ALNKEVIVDG FANNDETYGA NKIVDGDYDT YWAPDNSSKT GTIEIDLGGS KEFDVISLQE
YIPLGQRVSS FNVEVLQGEN WNKVYEGKTI GYKRLVRIAP TKGEKIRINI TGSLEVPLIN
NVGVYKQPIS IELPSGPPAG LKVLNDDNKG NELEQFNFSD GWIYETIHGE DDLGGDAHYT
SKINATVNIK FKGTKFFLSG TKDSGHGIME ISIDGENPVE VDLYSPNRKS KEIVFESEDL
SDGEHEVTVK CTGRKNSNSR GIVAHIDGAY VLDNGGKGMV EFEKVGYKVS ENIGTATFKV
IRKGGSNGKL EVNYDTLAGT ALNGVDYQTW SGTLAFNEGE TEKTFDITII DDKEKEEPKE
FYLKLSDPIG GILGFNSRAT VIINDDEQIK