Gene Cpin_5217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5217 
Symbol 
ID8361394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6591626 
End bp6593230 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content50% 
IMG OID644967365 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003124849 
Protein GI256424196 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC AGGTAACAAG AAGAAATTTT ATCGGCCTTA TCGGCATGAG CGGATTAGGT 
CTCGCTTTAA AAGGCAGTGC TTTTGCGGCT ACCCGTGCCG TCCTTAGCGG TGCTCCTTTA
GAGGCCAGTA TTGAGATCGA CTTCACACAT CCGGAAGGTA CAATAGACAA TGGCATTTAT
GGCCAGTTCA TAGAACACCT TGGCAGAGCC ATCAACGGAG GGATCTTCGA GGAAGGCTCC
CCACTCTCTG ACGCCAAAGG TATCCGCAAA GATGTACTGG AAAAGATCCG TGGACTTCAA
CCCAGCATTC TGCGATATCC CGGAGGTACC TTTACCAAGA TCTATCACTG GATGGATGGT
GTGGGTCCGC TGGCAGAAAG ACGCAGTCGT CCTAACCTGA TATGGGGTGG TGTAGAAGAT
AATCGCTTCG GGACAGATGA AGCCATTGCC TATGCCAGAA CCTTACAGGC AGATGCCTAT
TTTGCAGTCA ATATGGGTAC CGGCACGGCG GAAGAGGCAG GCAACTGGGT GGAATATTGC
AATGGCACAC AGGACACCTA TTACGCGAAT CTGCGCCGTA AAAACGGTCA TGCCGATCCT
TTTAAGGTAA AGTATTGGGG AATAGGTAAT GAAGAAGCTG CCGGCCCTGA TATTGGCCGT
TTACAGGATG TAAAAGAATT TGTAAAGGAA GCATGGTTGT ATACCAAAGC CATCAAGTTA
CAGGATAAAG ACGCAAAACT CATACTGTGT GGCGCAGATG ATACCTGGAA TGAATATGTG
CTGAAAGAAA TGGGCGCAGT ATGCGATTAC ATCTCTATGC ACCACTATGT CAGCTCTGAT
AAAAGTAAAC CGGCGTCGCT GTTTCCCCAG GTGGATCATA TGGAAAAACT GATCCTCACT
TTAAAAGGAC AGATCCGGAC GCTGACGCCG GAGAAAGTAA CCGACTTCAG CAAATGGTAC
CGCTTCCCGC ACAGGGCGAA TCCGGTGAAG ATCGCTATCG ACGAACTGGG TATCTGGGAA
CCTGGTGGTG CCGGCGCGTA CCAGCTGGAG GAATATTATA CCTGGGACCA TGCATTAGGT
ACCGCCACCT TCTATAACAT CATGCTCAGA CAGGCTTCCG TAGTAGGTAT GGCCACCTGG
GCGCAAACGG TCAATGTACT GGCTCCGATT ATGACCAGTA AAACTGCGGC CGTTTGCCAG
ACCATCTACT ACCCGATGCA GTTCTACCGC CAGCACGCGG GTAATGTCAG TCTGAAAACA
CAGGTAGTGA CACCTGACCT GAAAATGCCC GGCTCCAAAG ATGCCAAAGC CTTAGACGTC
GCCGTCACCC TGCACGACAG CGATGGCAGT CTGGTGATCT TTGCGGTAAA CCGTCATCCG
GAACAGGAAG TTAAAGCAGA CCTCCGCAGT ATCGACAGTA AAAAGTATAC GCCTGCTGCC
ATATACGAAC TGAATGCAAC TGCGATCGAC GCGATGAATA CATTGGAAAA TCCTGCGAAC
AATGTGGTCA CATCCACAGA GAAAAAGCTC TCGGGGAACC TTTCCGGCTA TACCTTCCCG
GCGCATTCCA TTACGGCGAT CAGGTATAAA CGCAACAAAC AATAA
 
Protein sequence
MNAQVTRRNF IGLIGMSGLG LALKGSAFAA TRAVLSGAPL EASIEIDFTH PEGTIDNGIY 
GQFIEHLGRA INGGIFEEGS PLSDAKGIRK DVLEKIRGLQ PSILRYPGGT FTKIYHWMDG
VGPLAERRSR PNLIWGGVED NRFGTDEAIA YARTLQADAY FAVNMGTGTA EEAGNWVEYC
NGTQDTYYAN LRRKNGHADP FKVKYWGIGN EEAAGPDIGR LQDVKEFVKE AWLYTKAIKL
QDKDAKLILC GADDTWNEYV LKEMGAVCDY ISMHHYVSSD KSKPASLFPQ VDHMEKLILT
LKGQIRTLTP EKVTDFSKWY RFPHRANPVK IAIDELGIWE PGGAGAYQLE EYYTWDHALG
TATFYNIMLR QASVVGMATW AQTVNVLAPI MTSKTAAVCQ TIYYPMQFYR QHAGNVSLKT
QVVTPDLKMP GSKDAKALDV AVTLHDSDGS LVIFAVNRHP EQEVKADLRS IDSKKYTPAA
IYELNATAID AMNTLENPAN NVVTSTEKKL SGNLSGYTFP AHSITAIRYK RNKQ