Gene Cpin_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4241 
Symbol 
ID8360414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5302940 
End bp5304607 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content44% 
IMG OID644966408 
Productglycoside hydrolase family 29 (alpha-L- fucosidase) 
Protein accessionYP_003123896 
Protein GI256423243 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.525329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.717793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA GAACAGTAAT AAAACAATTG GGGAAGGCGT TACCAGCTAC GTTTTTGTCA 
AAGAAACTTT CAGCTATATC TGCTATCAGC TTAAATATAG GGGAGAAAAT TATTGATGGA
CCATTTGCAC CTACGTGGGA ATCTTTAGCA CAATTTAAGA CACCGGAATG GTATCGTGAC
GCCAAATTTG GTATCTGGGC GCATTGGGGG CCACAGTGCC AGCCGGAGAA TGGTGATTGG
TATGCCCGTG AGATGTATAT GGAAGGGAAT CATCATTATA ATTTCCATCT GAAGAAATAT
GGCCATCCTT CAAAATTTGG GTTTAAGGAT GTGATAAATG ACTGGAAGGC TGAGCAATGG
AATCCTGAAG AGTTATTATC ACTATATCGA AGAGCTGGTG CCAAATATTT TGTAGCATTA
GCTAACCATC ATGACAATAT GGACTTATAT GACAGTAAGT ATCAGCCAGA ATGGAATAGT
GTCAAAATGG GGCCTAAAAG AGACCTGATC TCAGGATGGG CGAAAGCTGC CAGAAATCAA
GGTCTCCATT TTGGAGTCAG TGTACATGCT GCACATGCCT GGAGCTGGAT GGAGACAGCT
CAGCGGGCCG ATACAGAAGG TCCTTATGCC GGTGTACCTT ATGACGGCAA ATTGCGAAAG
TCTGAAGGCA AAAGCAAGTG GTGGAATGGC CTTGATCCCC AGGCACTTTA TGCGCAGGAT
CATCCTCTTA GCGAGAATAG TCTGGATAAT AGCATGATAC ATCGTCAATG GAACTGGGGG
AACGGTGTTT GTCCTCCAAC CCGGGCCTAT TGTGAAAAAT TCTATAATAG AACTATCGAC
CTTATCAACA GGTATGATCC GGATCTGGTA TATTTTGATG ATACCGCATT ACCATTGTGG
CCTGTAAGTG ACGCGGGATT GAGAATCGCC GCGCACTATT ACAATCGGAG CATAAAGAAC
AGGAATAAGC TGGATGTGGT AATTAATGGC AAAATATTGA ACGAAGAGCA GCGTAAATGT
ATGGTATGGG ATATTGAAAG GGGACAGAGT AATAGCATTG AACCATCATC CTGGCAGACG
GATACCTGCA TTGGCGGATG GCATTATGAC CGAAATATTT ATGAAAAACA TGGTTATAAA
AGTGCACAGA CCATCATACA AACACTTGTA GATATTGTTA GCAAGAATGG TAATTTGCTG
CTCAGTATAC CCGTACGTGG TAACGGAAGT ATTGATGAGG ACGAGCGGAA CATCGTTGAG
GAGATAGGAC AGTGGATGCG GGTCAATGGC GAGAGCATAT ATGGTACAAG GCCATGGATA
GTATATGGAG AAGGACCGTC ACTTGAAAAT GCTGTTCCTT TGAATGCACA GGGTTTTAAT
GAAGGGAAAG GCCGTCCTTT TGAAGCGACG GACATAAGGT TCACCACAAA AGGAGAGGTG
CTGTATGCTA CAGTGTTGAA ATGGCCGGAA AATGGTGAAG TGAAAATAGG GAGCCTGGCG
GGTAGCAGTA ACCTTGAGCC GCGTGAGATC CGAAAAGTGG AACTGCTGGG AGATGCAGGC
GAGTTGTTTT TTGAGCGTTC TTCTACAGCG CTCAAAATAG TTCTCCCTGA GAGACATTCG
TTATCATCTT ATGCTGTTGC TTTAAAGATA ACTATGGCTA CATCTTGA
 
Protein sequence
MNRRTVIKQL GKALPATFLS KKLSAISAIS LNIGEKIIDG PFAPTWESLA QFKTPEWYRD 
AKFGIWAHWG PQCQPENGDW YAREMYMEGN HHYNFHLKKY GHPSKFGFKD VINDWKAEQW
NPEELLSLYR RAGAKYFVAL ANHHDNMDLY DSKYQPEWNS VKMGPKRDLI SGWAKAARNQ
GLHFGVSVHA AHAWSWMETA QRADTEGPYA GVPYDGKLRK SEGKSKWWNG LDPQALYAQD
HPLSENSLDN SMIHRQWNWG NGVCPPTRAY CEKFYNRTID LINRYDPDLV YFDDTALPLW
PVSDAGLRIA AHYYNRSIKN RNKLDVVING KILNEEQRKC MVWDIERGQS NSIEPSSWQT
DTCIGGWHYD RNIYEKHGYK SAQTIIQTLV DIVSKNGNLL LSIPVRGNGS IDEDERNIVE
EIGQWMRVNG ESIYGTRPWI VYGEGPSLEN AVPLNAQGFN EGKGRPFEAT DIRFTTKGEV
LYATVLKWPE NGEVKIGSLA GSSNLEPREI RKVELLGDAG ELFFERSSTA LKIVLPERHS
LSSYAVALKI TMATS