Gene Cpin_6806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_6806 
Symbol 
ID8362998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp8524585 
End bp8525943 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content48% 
IMG OID644968939 
ProductAlpha-L-fucosidase 
Protein accessionYP_003126408 
Protein GI256425755 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00560113 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000369598 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAAAC ACTACCTATT GCTGCTGGCA GCTTTGTTAT GTTATTGCCC GTTAACCTTT 
TCACAGAAAA AGATTGGCTT TGAAAAACCC GGAGACCAGC AGAAAAGAAT GGCCTGGTGG
ACCAATGACC GTTTTGGTAT GTTCATTCAC TGGGGATTGT ATGCCTTACC CGCCCGGCAT
GAGTGGGTGA AGAAGCGGGA ACAGCTGACC AATGACGATT ATCAACAGTA TTTTGACCAG
TTCAACCCGG ATTTGTATAA TCCGAAGGAG TGGGCTAAAA TGGCCAAAGC CGCCGGAATG
AAGTATGCTG TTATTACCAG CAAACACCAC GAAGGATTTT GTCTGTTTGA CTCAAAATAT
ACTGATTATA AAGCCCCTAA TACCCAGGCA AAGCGTGATC TGATAAAGGA ATGGGTAGAT
GCTTTCAGGG CGGAAGGTCT GAAAGTGGGT TTTTACTATT CCCTGCTGGA CTGGCACCAT
CCTGATTTTA CGATAGACAA GCACCATCCG CAGCAGCCGG CTGGCGATAG CGACACCGCT
TATGCCAGGC TGAATCAGGG AAAGGACATG AGCAAATACC GCGAATACAT GTATAACCAG
ATCAAGGAAC TGCTCACCAA ATATGGTAAG ATCGATATCA TGTGGCTGGA CTTTTCCTAT
CCTGGTAAGA ACGGTAAAGG CCGCGACGAC TGGGGATCGC TGGAGCTGAT GAAAATGATG
CATAAGCTGC AACCAGGGAT TATTGTGGAT AACCGCCTCG ACCTGAATGA CTATGAGGAC
GGCTTTGACT TTGTCACCCC TGAACAGACG CAGGTGGCGG AATGGCCGAC TGTAAACGGC
AAAAAGGTAG CCTGGGAGAC CTGTCAGACC TTCTCCGGTT CCTGGGGATA TTATCGCGAT
GAAGACAGCT GGAAGAGCCC GTCTCAATTA TTGCAGCTGT TGATCGGTTC TGTGAGTAAA
GGAGGGAACC TGTTACTGAA TGTAGGTCCT ACCGCCCGGG GTTTGTTTGA CTACCGTGCA
AAGGATGCAC TGGGTGCAAT CGGAGAGTGG ATGGCTGTAA ACAGTCCTTC CATCTACGGC
TGTACACAGG CGCCAGAAGA GTTCCGGGCG CCTAACGGAA CGATGCTGAC CTACAATCCG
ACCACCCGTA AGCTATACCT ACATTTACTG TCATGGCCAC TCAATAAACT GGTGCTGCCA
GGATTTAACC CAAAAGTTAA ATATGTGCAA TTCCTGCATG ATAATTCAGA AATAAAGTAT
ATTGGAAAGG AAAATCAGGG CAGCAATGAT CTAGTACTCA CGATGCCTTT GAAAAAGCCT
CCGGTGGAGA TACCAGTGAT TGAGCTGACT TTGAGATGA
 
Protein sequence
MRKHYLLLLA ALLCYCPLTF SQKKIGFEKP GDQQKRMAWW TNDRFGMFIH WGLYALPARH 
EWVKKREQLT NDDYQQYFDQ FNPDLYNPKE WAKMAKAAGM KYAVITSKHH EGFCLFDSKY
TDYKAPNTQA KRDLIKEWVD AFRAEGLKVG FYYSLLDWHH PDFTIDKHHP QQPAGDSDTA
YARLNQGKDM SKYREYMYNQ IKELLTKYGK IDIMWLDFSY PGKNGKGRDD WGSLELMKMM
HKLQPGIIVD NRLDLNDYED GFDFVTPEQT QVAEWPTVNG KKVAWETCQT FSGSWGYYRD
EDSWKSPSQL LQLLIGSVSK GGNLLLNVGP TARGLFDYRA KDALGAIGEW MAVNSPSIYG
CTQAPEEFRA PNGTMLTYNP TTRKLYLHLL SWPLNKLVLP GFNPKVKYVQ FLHDNSEIKY
IGKENQGSND LVLTMPLKKP PVEIPVIELT LR