Gene Cpin_5246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5246 
Symbol 
ID8361423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6642204 
End bp6643892 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content49% 
IMG OID644967394 
ProductCarbohydrate-binding family V/XII 
Protein accessionYP_003124878 
Protein GI256424225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.876485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000498819 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTATAA CAACCAGAGC AGCGCTGAAA GCACAATTTA AGACCGGCGC TATACCCACC 
TCGCAGGATT TCTTCAACCT GATAGACTCA ACGCTGGTGA GAAGGGATGA TGCTTTTTTC
GGCAAGTGGG CAGCTGGGAC CTGCTACTAC GAAGGCGATG TAGTGATCTA CAATAATGCA
TTATACACCT GTGTACCAGC CGGCGATAAA CCCTGCGGCT GTGAAGGAAA AGAAGGAGAT
ACCAAGGCAG ACAAGTCGAA AGGACATTGC TCCGTGGACA ATCCTGAGAT TGATTGTACA
AACTGGAAAA TGCTGGATAT CGATGCTTCT GATGAAGATT GGGAGATCGT CCGCAATGAA
GATAAAGTAC CGGTTATCAT GTACGCGAAA GTATTTGGTA AGATCGGTAT GGGTACAGAA
GATCCCAAAG CGCGTGTGCA CATTCATGCA GACGAGGTGA AAGGAGATTT TCTGTTTAGT
CCGGATAATG CACAGACCCC TGAGTTTGTG ATCCAGCAGA CAGGAGGCGA AGCGCCAACG
CAATCCCTGT CTGAAAAGAT AGCAGACAAC AAAGCTGTTT TTACAACGAA TACAGACGGT
TTCCTGTTTA CAACCACGGT GCCTCCGGTG CCAGTAGAAG GAGAGGAGAA TGTACCGAAG
GAAGCAGCCG CGGTGAAGCC GGTGTTTATC ACCACACCAG ATGCCGGTGC GGCAGTAGGT
ATCGGTACAA CAGTGCCGGA AGGTGCTGTT GATATCCAGA ACCAGCCGGC AGCCAGACTG
ATACTGAATC CGGTACAATC AGTCGTACCA CAGGCGGTAT GGTTGCACAA AGGTGAATAT
GGTCAGCAGA GTTATCTGAA CACCGCGCTG GATGGTATCG CTGCTACTTT CACCACGAAT
GCGGCAGAAG GCTTTTATTT CCGTAAAGGA CTGGAAGATG GTAAGAACTA TCTGAAGAAT
ATTGCCAAAC CTGCGACGGA GACCTTAGTG TCTATCAAAC AGGATGGCCG TGTAGGGATC
GGTACTGAAT CGCCGGTCAC CAATGTAGAG ATCACCAAAG AGAACAGTGC CGGCGCATTC
CTGCTGTGTC TCGACAACAC CAATCCTGGT TTCAGTATCA TCAATAACCG TCCTAACAAC
GAGAAGAGAA ATTATCTGTT GCTGGGCGCT GATAACGACT GGGGGGCTTT CCTGACGGAT
GCTTCCAAAG GTTTTGTATT CAAGAGAGGT GGTGAATATG GTAATGGCAA CGAGCTTGAG
ATCAATCAGG GCGATGACCT GGTGACGATC TCCAACGAAG GTAAAACCAT CATAGGTGGT
TTGACTGCAC AGGGTTTTGA TCTGAATGTA AAAGGTAAAG CCAGAAGTTT TGGCTTGTAC
CTGGATACTG ATGTCCGCAA GGTGACAAAT CAGGCGAAAC TGGGCAGTGT GATCGATAAG
GTAAAGCGAC TGAATCCTAT CACTTTCAAC TTCAACCAGA AGGCGAATTG TCCTTCCAAT
GAATCACAGA TCGGTTTCCT GCCACACCAG GTTGAGGAGT TCTTCCCGGA ACTGGTGAAT
ACGGACGGCG ATGGTACACA GACCCTGGCA TATGCCAATA TGGTTGCTGT GCTGACCAAA
GCGATACAGG AACAGCAGGA TACGATTGCT GCCCTGCAAA AGCGCCTGGA TGCCCTGGAA
GGCAAATAA
 
Protein sequence
MSITTRAALK AQFKTGAIPT SQDFFNLIDS TLVRRDDAFF GKWAAGTCYY EGDVVIYNNA 
LYTCVPAGDK PCGCEGKEGD TKADKSKGHC SVDNPEIDCT NWKMLDIDAS DEDWEIVRNE
DKVPVIMYAK VFGKIGMGTE DPKARVHIHA DEVKGDFLFS PDNAQTPEFV IQQTGGEAPT
QSLSEKIADN KAVFTTNTDG FLFTTTVPPV PVEGEENVPK EAAAVKPVFI TTPDAGAAVG
IGTTVPEGAV DIQNQPAARL ILNPVQSVVP QAVWLHKGEY GQQSYLNTAL DGIAATFTTN
AAEGFYFRKG LEDGKNYLKN IAKPATETLV SIKQDGRVGI GTESPVTNVE ITKENSAGAF
LLCLDNTNPG FSIINNRPNN EKRNYLLLGA DNDWGAFLTD ASKGFVFKRG GEYGNGNELE
INQGDDLVTI SNEGKTIIGG LTAQGFDLNV KGKARSFGLY LDTDVRKVTN QAKLGSVIDK
VKRLNPITFN FNQKANCPSN ESQIGFLPHQ VEEFFPELVN TDGDGTQTLA YANMVAVLTK
AIQEQQDTIA ALQKRLDALE GK