Gene CPR_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0640 
Symbol 
ID4206453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp761437 
End bp762747 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content25% 
IMG OID642565200 
ProductF5/8 type C domain-containing protein 
Protein accessionYP_697967 
Protein GI110801781 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CTCTAGCAAA CCTAACTCCA ACTAAAGAAC AAAGAAAATA CCAAGAAGAT 
GAACTAATAG CTTTTTTAGA CCTCCAAATG AATACATTTA CAGCTAATTC TGAAAATAAT
GATGAAGCTC CTTCTTTATT AAAACAAGAG ACTTCATTTA ACACAGATAT ATGGATAGAA
AATTTATTTA AATTTGGATT TAAGAAAGTT ATTTTAACTG CTAAGCTTAA TAATGGTGTT
TGTTTATGGC AAAGCAATTG TTCAAAGGTT AATATAAATA CTTTTTCTTT AGAGAATAGT
TATGATGATT TAGTAAAAAA AGTTAGTAAA TCTTGTAAAA AGTTTGGACT TAAATTTGGT
ATTCATCTTA CTTTAAAAGA TTACCTCACT TTAAACTTAA CTCCTGAAGA ATACGATAAT
TACTACACGT CTCAATTAAA AGAACTTATT ACTAATTATT CACAAATAAG TGAAGTTATT
ATGGATATGA CTTCTTTAGA TGAATATTAT GACTCCCTTG ATTTTGAAAG ATACTTTAAA
GTAATTAAAG AAATAAATCC TAAATGTATC ATATCTAGCC CTATTGGACC AGATGTACGT
TGGACATATT ATTCTGATAT AGAGTCATCT AAAAACTATT TCTACTCTTC TATAAATTTA
AATCTTTTAA AAGAAGATTT TAACAATGAA GAAATTAGAA AAGGAAATAT TTATGGAGAC
ACTTGGATTG TTGGAGAAAG TATTTACTCC CTATCTGACT GCTTAAACCA TAATAAAGAT
TCATTATCAA ATTTAAAACA TGTGTATAAT AATTCATTAG GAAGAAATAC AAACTTGGTA
TTAGTTTTAT CTCCTAATAC AGATGGATTA TTAAATCATA ATGAATTAAG TTTACTTTCT
GATTTTTCTA AATATATAAA AGAAACTTTT TCTAATAACC TTATTAAGGG TTCTTCTATA
TTAGCTACAA ATTCATCTTC TAGCGATAGC TATAATTTAA TTGATGACTA CAAAAAATCT
TATTGGATAG CTAATGAAAA TGCTGTTAAC CCTTATATAG AAATAGATTT TAAAACTATC
ACTGAATTTA ATATTTTAGA AATTAGAGAG TGGATTGCTG AAGGTCAAAA CGTAGAAGAA
TTTAAAGTTT ATGCATATAA CAATAGTTGG TTTGAACTTT ATAATGGTAC TTCTATTGGA
TATAGACATA TAGCAAAACT TAATAATATC AAAACTGATA AAATTAAAAT TTCATTTACT
AAATATAAAA ACCCACCTAT GATTAATCAT ATTGGTGCAT ATTTAGGATA A
 
Protein sequence
MNTTLANLTP TKEQRKYQED ELIAFLDLQM NTFTANSENN DEAPSLLKQE TSFNTDIWIE 
NLFKFGFKKV ILTAKLNNGV CLWQSNCSKV NINTFSLENS YDDLVKKVSK SCKKFGLKFG
IHLTLKDYLT LNLTPEEYDN YYTSQLKELI TNYSQISEVI MDMTSLDEYY DSLDFERYFK
VIKEINPKCI ISSPIGPDVR WTYYSDIESS KNYFYSSINL NLLKEDFNNE EIRKGNIYGD
TWIVGESIYS LSDCLNHNKD SLSNLKHVYN NSLGRNTNLV LVLSPNTDGL LNHNELSLLS
DFSKYIKETF SNNLIKGSSI LATNSSSSDS YNLIDDYKKS YWIANENAVN PYIEIDFKTI
TEFNILEIRE WIAEGQNVEE FKVYAYNNSW FELYNGTSIG YRHIAKLNNI KTDKIKISFT
KYKNPPMINH IGAYLG