Gene Cpin_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4144 
Symbol 
ID8360317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5166537 
End bp5169815 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content44% 
IMG OID644966315 
ProductYD repeat protein 
Protein accessionYP_003123804 
Protein GI256423151 
COG category 
COG ID 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.794308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.283099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACTG TTTGTTCATT TCACCGGAAA TGGTGTTTCC TTTTATTATT GGTAACTGGC 
ACCTGCTACA GCTATGCACA GGAACGGTCT TCTTTTGACA TGACAGCGCC AACGACTGTC
TATAAGCCAT CTTCTACTGC CAGCGAACTG GTTAAAGTCA CCGATATTCC AGTCAGTACA
TACAGTGGTA CGGCCAACAT CAGCTTTCCG TTGGCTGTCA TTCAATCAGG TGCGCTTCGT
CATCCGATCA CGCTGAATTA TACGGGCGGT GGCGGGATTC CTGTTGATCA GGAAGGTAAT
TGGGTCGGAT TAGGCTGGGA TCTTTCCGTA GGTGGCGTCA TCACCAGGTC TGCGATGGGA
AAGAATGATG AACTGGTTAC CACACCGGGT TATACTGCCG CCGCTAAGTC AACAAAACTG
CCGCTGTATA CGGATGCTAA TCCCGGTAGC TGGCTCAATA GTCTTAGTAC CTGTAATAAA
AAAGATATAG GAGAAGGACG TGTTGATCTC TCACCGGATA TCTATTTTCT CAATTTTGGC
GGGCATACGG CGAAAATGTT CTTTGATAAG AACGGGCAGG TTTTCTTCTC TCCATTCAAA
ACATGGCGAT TAGCAGGTAA TCAGACAGCC GGTTTTACCA TCACTACAGA AGATGGCAAC
AGGTATGAAT TCAGGAACAT AGAACATTCA TCTACCAGTT CGGAGACTTA TCCGGGAGAT
GCAATAAGTG TGAATGCAGG CAATTCCGCA TGGTTTCTTA CGAGAATAGT CTCTGCTACA
TTACGGGACA GCATCACTTT CAATTATACG CCGCTGACAT ACACTTATGA AGATGGCCTA
CCCTCCTATA CCGTGTATGA TCTGTTGCCA GGACAAAGCA ATTCCCCCTG TTCCGGGGGA
GGTTTCATGG ATACGCACAG AGAGAGCTAT ACACTGAATT ATCAGACATT ACATACGCAT
GTATTGAATA GCATTACTTA TAACGGCGGG AAAATACAAC TTACCAGGAG CAGTGATCGC
AAAGATGTCA ATTCAGGTAA TAAGTATAGA TTAATGGGTC TGGAAATCTT CACAGCCAGC
TACGCCGGAT TTACACTATA TAAAAAGTTT CTTTTCAATC AGCAGTATAC AAATGCCACG
AAATCAGACC CTTTGAGTAA ACGAATGCTG TTATCTTCCT TTTATGAAAC AGGTGGGACT
GACACGTTAA AGTATCTCTT TTCATACATA GATCCGGAGA CATTGCCGGC TAAGCAATCA
ATGTCACAGG ATCACTGGGG ATATTTCAAC GGGAAAAATA ACGGGACGCT AATACCTGCT
TATGATGATA ATATGGGGAA TGTTTTAGCG GGAGCTAACC GTGAACCGGA TTCCGTTTAT
ATGCAGAAAG GATTGCTGCA AACAATCACT TATCCTACCG GTGGCGTAGC AACGTTTGAC
TATGAACCTC ACCGATATTC GTACTATAAT GCACAATACC AGTATTTGCC GCGGCGAACG
GATTCAGTTA TGACTGATGC CGCTCTCACC GCTAATACAA ACTTTATCGG AACAACAACA
GGGAAAGCGG CCGACACCGT AATGATTGTA GTGCCTGATA TCGTCGGACA AAAAACAACG
ATTACCTACT TTGTAAAAGG GAAGATCGCT GGTGATGCAC TGGCGGAAGT ATTGGTCTAT
GACGCCAACT GGCGCTTAAA AGCGGCGGCC GGAGACTCGC GTAATCAGAC ACTTACGATG
GCACTTACAT TAAACAGCGG ACAACGTTAC TATCTGATCG CACGGCGGGA CCTGGCTACC
GAACAGGCCA GGATAAGTGT GTACTATAAG AAATACAACT ATATGCCTGC CGCCCCTGTG
TATAGCAAGA TGGCGGGTGG CAATAGAATA AAACGTATTA CACTGTTTGA TGGGGTTAGT
CATCGTAATG ATATTGTCAG GCGTTACCAA TATATGCTCA ATGACAGTAT CTCGTCCGGC
ATACTACTGG ATCTGCCTAA ATATGAGGAT GTTACATTTA CCGCTTACTA TTGTAATGTT
TCCACTTCAG AAGGCGGCGG ACAACCATAT AAAACAGGTG ATCTGTCCTA TTTTACCAGG
TATTCTACTT CCTTAAATTC CCTCGGACGA ACGCAGGGAT CTCCTGTGGG TTATTCCAAA
GTAAGTATAC TCAGCGGGGA ACGCGGAGAA AATGGCCGGG AAGATTTCTT CTATACCATC
ACCGGCTTGT ATGATGAAGG CGGCAATGGG TATCCCTATG CGCCTAAATC CAGCAAAGAA
GACCTGAGAG GGTTATTGCT GGCGCATAAA ATCTATAATG CGGCAGGTAC AATTCTTAAG
GCTACGATGA ATGAATACGC ACTGAACAAT GCGGGTGGAA ATGCAAATTT TGCCTGGATA
TGGGGCGCGA AGATCGGTAT CCGTAAATCA GATGGCTATC CGACTACTAT CTGCCCGGAA
GGTAGTAAAT GGAGCTTTAT TGGTAGCATG TATAAAATCT GCCAATACTG GCCAATATTG
CGTTCCAGAA CGGACAGCAC TTATGATAAC AACGGGAATG TATTGGTCAG CAAAGTAAGC
TATCAATATG ATCCATATAA CCTACAGGTA ATCAATGAGA CATCCACTAA CAGCGATCAG
CGTGTCCTAT CCAAAACATA TAAGTATCCT CTTGACTTTC GGGGTATAGC GGTATATGAT
AGTATGCTTG CCAGGAATAT CCAGCATGAG AAAATAGAAA CGATCGTTAA GCGTGATAAT
GTGCAGATTT ATAGGGAAAA AACCAACTTC GGGTTCTTTC ATACTTTCAT AGCACCAGCT
TCAAAAGAGT TGATCTATGG TGATGATCCA ATGGAAAGCC GCCTGCAGTT TGTCCGGTAT
GATACTGATG CTCATTTACT GGAGCAGGCG AAAACGGAAG ATGTACCTGA AGTATATCTA
TGGGGGTATC GTAATGAGTT TCCGGTGGCC CGCATAGTCG GCGCCACATA TGGCGAGGTA
ATCAAACTTG TAAATGCCAG CGTACTTAAT AATCCGGTCT CTGATCAGGC ACTACGGGAT
GAAATTAATA AGATCAGGAC GGCCTTTGCA GATAACAGTA CACAGGTGTA TACGTATACC
TATTCTCCAC AGGCAGGTGT GACTAGTGAA ACTGATCCTG CCGGACGGGT AACATATTAT
CAATATGACA GTTTTCAACG GCTGGCAACG TTAAAAGATG TTGACGGGAA TATTATCAAA
CACTTTGATT ACAGATACCA GCATCCTGTT AGTCAGTGA
 
Protein sequence
MFTVCSFHRK WCFLLLLVTG TCYSYAQERS SFDMTAPTTV YKPSSTASEL VKVTDIPVST 
YSGTANISFP LAVIQSGALR HPITLNYTGG GGIPVDQEGN WVGLGWDLSV GGVITRSAMG
KNDELVTTPG YTAAAKSTKL PLYTDANPGS WLNSLSTCNK KDIGEGRVDL SPDIYFLNFG
GHTAKMFFDK NGQVFFSPFK TWRLAGNQTA GFTITTEDGN RYEFRNIEHS STSSETYPGD
AISVNAGNSA WFLTRIVSAT LRDSITFNYT PLTYTYEDGL PSYTVYDLLP GQSNSPCSGG
GFMDTHRESY TLNYQTLHTH VLNSITYNGG KIQLTRSSDR KDVNSGNKYR LMGLEIFTAS
YAGFTLYKKF LFNQQYTNAT KSDPLSKRML LSSFYETGGT DTLKYLFSYI DPETLPAKQS
MSQDHWGYFN GKNNGTLIPA YDDNMGNVLA GANREPDSVY MQKGLLQTIT YPTGGVATFD
YEPHRYSYYN AQYQYLPRRT DSVMTDAALT ANTNFIGTTT GKAADTVMIV VPDIVGQKTT
ITYFVKGKIA GDALAEVLVY DANWRLKAAA GDSRNQTLTM ALTLNSGQRY YLIARRDLAT
EQARISVYYK KYNYMPAAPV YSKMAGGNRI KRITLFDGVS HRNDIVRRYQ YMLNDSISSG
ILLDLPKYED VTFTAYYCNV STSEGGGQPY KTGDLSYFTR YSTSLNSLGR TQGSPVGYSK
VSILSGERGE NGREDFFYTI TGLYDEGGNG YPYAPKSSKE DLRGLLLAHK IYNAAGTILK
ATMNEYALNN AGGNANFAWI WGAKIGIRKS DGYPTTICPE GSKWSFIGSM YKICQYWPIL
RSRTDSTYDN NGNVLVSKVS YQYDPYNLQV INETSTNSDQ RVLSKTYKYP LDFRGIAVYD
SMLARNIQHE KIETIVKRDN VQIYREKTNF GFFHTFIAPA SKELIYGDDP MESRLQFVRY
DTDAHLLEQA KTEDVPEVYL WGYRNEFPVA RIVGATYGEV IKLVNASVLN NPVSDQALRD
EINKIRTAFA DNSTQVYTYT YSPQAGVTSE TDPAGRVTYY QYDSFQRLAT LKDVDGNIIK
HFDYRYQHPV SQ