Gene Cpin_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4398 
Symbol 
ID8360571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5475992 
End bp5479420 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content46% 
IMG OID644966557 
ProductPKD domain containing protein 
Protein accessionYP_003124045 
Protein GI256423392 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0061097 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.588694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGGA GAGATCTATG GACTTTTATG CTGTTATCAG TTATTGGAAC TTTCGGAACG 
ATCTATGCCA ATGCACAGAC ACTCCGTGTG CTTGTGTTTT CCAAAACAGA GGGCTTCCGG
CATTCTTCTA TTGAACCAGG AAAGGCGGCT TTTTCAAAAA TGGCTGCTGA AAAAAACTTT
GCGGTAGACT TCACCGAAGA CGCCTCGTTC TTCAATACCG CCGTGCTCAA ACGATACAGT
GCAGTGGTCT TTCTCAGCAC CACAGGCGAT GTACTGAACG ATGCACAACA ACAGGAGTTT
GAGCGATACA TACAGGCAGG TGGTGGCTTC GTAGGTATCC ATGCAGCCAC AGATTGCGAA
TATGACTGGC CCTGGTACGG CCGGCTTGTA GGCGCCTGGT TCCTGGACCA CCCCATGCCG
AACAATGTCC AGAAAGGTAA GTACTATGTA ACCGCTAAAA ACAGCTTTGC TACCAAAGAA
ATGCCGGACA CCTTTCAAAG GATGGACGAA TTTTATAGCT TTAAACAGAT AGATCCTGGT
ATACATCCAC TGATAAAGAT TGATGAAAAG AGCTATACCG GTGGCAAGAA TGGCGATAAT
CACCCTATGA GCTGGTACCA CGATTTTGAC GGTGGCCGTT CCTTCTACAC CAACATGGGA
CATACAGATG AAACATTCAA AGAAGACCTG TTCTTAAAAC ACCTGTATGC TGGTTTGCAA
TATGCAATGA GTGCCGGTAA ACCTGTTACA TTAGACTACT CAAAAGCGAA ACCGGAAGAA
AACAGGTTCA CCAAAGTGAT ATTGGCGGAG AAACTGAATG AACCAATGGA AATATCAGTG
CTGAATGATG GCCGTATATT GTTCGTTGAG CGACATGGTG CTGTAAAACT CTACAACATC
AAAACAAAGC AATTAAAGAC CATCGCAACC ATTCCCGTAA GTACAAAATA TAAAGATAAA
GAAGGCAAAG AGTCAGAAGC GGAGGATGGT TTATTGGGAC TGAATAAAGA CCCGGACTTC
GCTTCAAATC ACTGGATATA CCTGTATTAT TCCGATCCTG CTAAACCGCA GAATATCCTT
ACCAGGTATA CACTAAACGG CGATATACTC GACCTGAAAT CCAGGAAGGT ATTGCTGGAA
ATACCCACAC AACGTGAACA ATGCTGTCAT ACAGGAGGCT CGATTGACTG GGACGGAAAA
GGTAACCTCT ATCTGTCTAC CGGAGATAAT ACGAGTCCGC GGGCGACGAC CTACGCACCT
ATTGATGAAC GTGCCGCTAG ATCTCCCTGG GATGCACAGA AATCTTCTGC AAATACCAAC
GATTTAAGAG GTAAGATCAT TCGTATTAAA CCACAGCCGG ATGGCTCCTA TACTATTCCG
GAAGGGAACC TTTTCCCGAA AGGTACTGCG AAGACACGCC CGGAAATCTA TATCATGGGT
GACAGAAATC CTTTCAGATT AGCAGTGGAC AAGAAGTCCG GTTTCCTGTA CTGGGGAGAA
ATCGGGCCTG ACGCCAGTGA CGACTCATTA AAAGGTCCCG CAGGACAAGA TGAAATCAAC
CAGGCAAAAA AACCGGGTAA CTACGGATGG CCTTATTTTG TCGGAGACAA CATCGCTTAT
CCCAGGGTAG ATTTTACCAA CGAACAAGTG GGTGGTAAAT TCGATCCTTC AAAACCTATC
AATACATCTC CCAATAATAC CGGTCTGAAT GAATTACCGC CTGCAGAGAA GGCTTTTATC
TGGTATCCCT ACGGTGTATC TAAAGAGTTT CCTTTACTGG GAAGCGGTGG TAGAAGCGCC
ATGGCCGGTC CCGTTTTCTA TAGTGATGAT TTCAAAAAAG CACAACGTGC TTTCCCGGAC
TACTATGACG GAAAACTCTT GATATACGAA TGGATGAGAG GATGGATCAT GGCCGTAACG
CTGGACAAAG ACGCAAACTA TGTGTCAATG GAAAGAGTGA TGCCCAGCTA TAAATTCAGT
AATCCGATGG ATATGGAGTT TGCCGCTAAC GGCGATCTGT ATATGCTCGA ATATGGATCA
GGTTGGTTTT CTGCGAATGA TGATGCCAGG TTGATCCGTA TCGAATACAA CAGCGGTAAC
AGGCAACCCA TGATCCGTAT GTCAACAGAT AAACCGGGTG GCGCCATTCC TTTAACCGTG
AATCTCTCCG CCGCAGGTAC CAGTGATCCG GATGGAGATA CCCTGAAGTA TGTATGGAAA
GTCACTTCAA AGAACGGATA TACCAAGACC ATCGCCGGAC AAGACGCTGC ACTCACTTTT
GCAAAACCTG GGTTGTATAA GGCCAGTCTG ACTGTATCTG ATGGAAAAGG TAGTACCGCT
GTACAATCTA TCGAACTGAC TGCAGGCAAC GAAGCGCCTG ATGTGAGAAT CGATATCGGC
AGTAATAACA AGAGTTTCTT TAAGGTAGAC AAAACCTATA CGTATAAAGT CGATGTAAAA
GACAAAGAAG ACGGTTCCTT ATCCGCTGGT AAAATAAAAG CTGCGGATAT CGCGGTGAAC
ATTGATTATG TAATGCCCGG ACAGGACAAT CAACCGCCGG CAACAGGACA TAAAACAGCC
CCTGTTTCCT CTACAAATAC AAAAGGGCTG AAGTTGCTGA CGGCAAGTGA TTGCAGGGCC
TGTCATACGG ATTATAAAAA ATCCATAGGA CCGGCTTATT TCGCCGTGTC GAAGAAATAT
CAGGGTAATA ACAGTATATT GGAAAAGCTG GTAAAAAAGA CAATCACAGG TGGTAAAGGC
GTATGGGGAG ATGTGGCCAT GCCAGCGCAT CCGCAATTGT CCGCAGATGA TGCCGCCGAA
ATGATTAAAT ACATCCTCGA TTTATCCAAA CCTAAAACCA CTGTTAAGTC ACTACCGGTA
ACAGGTACTT ATGCAGCGAA GCTGCCGGCA GGAGAGAAGG GAGCCGGCCT GTTTGTCTTT
AATGCCACAT ATACGGACAA AGGAAGTAAT GGTCTGCCAG GTATTACTTC CGTAGACTCG
ATTACACTGC GTAATCCCAG TATCAATCCA ACGAAATATG ATATCGTAAA GGATGCAACG
AAGATGAGCT TTAGTGGCAA CAGCTTTATT ATCCCCGTAC AATCGGGTAG TTATATCGGT
TTAAATCATA TCGACCTGAC AGGTATTACC GCAATAGACT TCATGGCGAT GGCGCCAAAA
GCACAGATCA ATGCGGCAGG TGGTATTATC GAACTGCATA TAGATACCCC GGATGGTAAA
TTGCTCGGAC AAACACCCTT TATTGGCGAT GCACCCGGAG GGGCTATGTT CGGCGGTAAA
CCGACACAGT TATCCGTTAC TCCTACAAAT GGCTTCCATG ATATCTACCT GGTGTTCAGA
AACAAAGATG CAGCACCGGG ATCCTCCCTG ATGATCGTTC TGAATACAAC TTTCAGAATG
GCAGATTAA
 
Protein sequence
MVRRDLWTFM LLSVIGTFGT IYANAQTLRV LVFSKTEGFR HSSIEPGKAA FSKMAAEKNF 
AVDFTEDASF FNTAVLKRYS AVVFLSTTGD VLNDAQQQEF ERYIQAGGGF VGIHAATDCE
YDWPWYGRLV GAWFLDHPMP NNVQKGKYYV TAKNSFATKE MPDTFQRMDE FYSFKQIDPG
IHPLIKIDEK SYTGGKNGDN HPMSWYHDFD GGRSFYTNMG HTDETFKEDL FLKHLYAGLQ
YAMSAGKPVT LDYSKAKPEE NRFTKVILAE KLNEPMEISV LNDGRILFVE RHGAVKLYNI
KTKQLKTIAT IPVSTKYKDK EGKESEAEDG LLGLNKDPDF ASNHWIYLYY SDPAKPQNIL
TRYTLNGDIL DLKSRKVLLE IPTQREQCCH TGGSIDWDGK GNLYLSTGDN TSPRATTYAP
IDERAARSPW DAQKSSANTN DLRGKIIRIK PQPDGSYTIP EGNLFPKGTA KTRPEIYIMG
DRNPFRLAVD KKSGFLYWGE IGPDASDDSL KGPAGQDEIN QAKKPGNYGW PYFVGDNIAY
PRVDFTNEQV GGKFDPSKPI NTSPNNTGLN ELPPAEKAFI WYPYGVSKEF PLLGSGGRSA
MAGPVFYSDD FKKAQRAFPD YYDGKLLIYE WMRGWIMAVT LDKDANYVSM ERVMPSYKFS
NPMDMEFAAN GDLYMLEYGS GWFSANDDAR LIRIEYNSGN RQPMIRMSTD KPGGAIPLTV
NLSAAGTSDP DGDTLKYVWK VTSKNGYTKT IAGQDAALTF AKPGLYKASL TVSDGKGSTA
VQSIELTAGN EAPDVRIDIG SNNKSFFKVD KTYTYKVDVK DKEDGSLSAG KIKAADIAVN
IDYVMPGQDN QPPATGHKTA PVSSTNTKGL KLLTASDCRA CHTDYKKSIG PAYFAVSKKY
QGNNSILEKL VKKTITGGKG VWGDVAMPAH PQLSADDAAE MIKYILDLSK PKTTVKSLPV
TGTYAAKLPA GEKGAGLFVF NATYTDKGSN GLPGITSVDS ITLRNPSINP TKYDIVKDAT
KMSFSGNSFI IPVQSGSYIG LNHIDLTGIT AIDFMAMAPK AQINAAGGII ELHIDTPDGK
LLGQTPFIGD APGGAMFGGK PTQLSVTPTN GFHDIYLVFR NKDAAPGSSL MIVLNTTFRM
AD