Gene Cpin_5508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5508 
Symbol 
ID8361685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp7027184 
End bp7030144 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content48% 
IMG OID644967654 
ProductTonB-dependent receptor plug 
Protein accessionYP_003125138 
Protein GI256424485 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000560804 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00878967 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAC TACTCATCAT CTGCTGGTTT TTGCTAGTAA GTGTCAGTGT GACATTTGCG 
CAACGCCAGA CAATCAAAGG GCGGGTATCC GACGCAGCAA CCGGAGAACC GCTGGTTATG
GTCAATGTGC AACTTAAAGG ATCCACAACA GGCACACAGA CCGACGCCAC AGGCACATTT
TCCCTTACCG TTCCAGATGC CCAGACTGGT GTACTGTTAA TCAGTTACCT CGGCTATCAG
GCTTTGACAC AAAACATTCA TGGTAACAGC AACATTGTAG TGAAACTGGA AAAAGATAAT
AAGCAATTGG ATGAAGTCGT GGTAGTCGGT TATGGTGAAG TAAAGAAACG TGACCTGACA
GGCGCCGTTA CATCTGTGAA AGGTGAAGAA CTCAGAAAAG TACCTTCCAC CAACGTAATG
GAATCCGTAC AGGGTAAACT GCCAGGTGTC GATATCACCA AAAGCAATGG CGCCGCTGGT
GCAAAGATCA ATGTTACCGT ACGTGGTAAC AGGTCTATCC GTGCGGATAA TGGTCCGCTG
TACATCGTAG ACGGTGTGCA ATACGAAAAT ATACAGGATA TTAACCCCAA CGATATTCAG
TCGATGGAAG TACTGAAAGA TGCTTCCTCC ACCGCCGTAT ACGGATCCCG CGGTGCAAAC
GGTGTTATTA TCATCACCAC CAAAAAAGGC GCTTCGGGTA AACCACGTGT TTTTGTGAAT
ACTTATGCAG GTATCTCACA GGTAGCCGGC TATCCTGCCA TGTCTACCGG TCCGCAGTAT
GTTGCACAGA AACGTGAAGC GAACCGTACC ACCGGCCGCT GGAAAAGTGA AGCGGATGAC
CCGCTGATCT TCAATGCTGC CGAAGTAGCA TCTATTCAGA ACAACCTCTG GACCGACTAT
GCAGACATGC TGATCCATGA TGGTTTACAG CAGGATTACC AGGTGGGTGT TTCCGGTGGC
TCTGAGAAAA CGAAAGTCTA TCTCTCGCTG GATTATTTCG ACGAAAAAGG CATCTTCAAA
CTGGATCACC TGAAACGTTA TTCTGCCCGA TTAAATATCG ATCAGACGAT CAACGATTAC
CTGAAAGCAG GTATGCAGAG CCAGGTGACT TACTACAACC AGGACAATCG TCGTGATCCG
CTGAACCAGG CCAATAAGAT TGTACCGCTG AATCTTCCCT ATGATGAAAA CGGCAACCTG
ATCCTGTTCC CGAATATGGG CTCAATGATC AATCCGCTGG CGGACGAACA ACCCAATGCC
TATCAGAACA ATGGTCGTGT AACCCGCACC TTTGTCAATG CCTATGTAGA ACTGACACCT
TTAAAGGGGC TGTCCTTCCG TTCCAACCTG GGTGTTACTT TGGATAATGC CCGCACAGGG
ATCTTCGCTT CCAAAAACAC GATCGAACGT TCTACGGCAT CGGCTTCCAG ATCACAGTAT
AACAGCGGCA ATAAACGGTA TATCAGCTGG GAAAACGTAC TGAACTATAC CCAGAAACTG
GGAGACCATA CTTTCGGTCT GACGGGTGTA GCCAGTTTAC TCTCTGATCA GCGGGATTCC
AGTTTTTTAC AGGGTGAAGG CCAGTTGCTG CCCGCTCAGT TATATTACGC AATGCAGAAC
AATGTAAGCG GTATCGCAAT CCGTTCCAGC TACATGGCGA ATAAACTGAT CTCGTTTACC
GGTCGTATCA ACTACAACTA TAAAGGAAAA TACCTGTTGT CACTCACAGG TCGCTCGGAT
GGTAGTTCCA AACTCGCGCC AGGAAATAAA TGGGCTTTCT TCCCTTCTGT AGCAGGTGCA
TGGCGTGTCT CCGACGAACC CTTCATGAAA TCACAGCGTG TGTTCAGCGA TCTGAAAATA
CGCGCCAGCT ATGGTCTTGC GGGTAATGAT GCGGTACCGC CCTATGCAAC CGCCAGTTAC
CTGACCAAGA TCCCATTCTC TTATGACGAT ACTAATTCCG CACTGGCATA TGGTATCGGT
AGTCAGATCG GTAACAGGAA CCTGAAATGG GAGCTGTCTG CCACTACAGA CATCGGCCTG
GATATGAGTT TCTTTGATGG CAGAATCGGT GCTACGATCG ATCTGTATGA TACGAAGACC
AAAGATCTGT TACTGCAAAG AACACTGCCA TCTTCCTCCG GTGTAAGTAC TGTGATACAG
AACATTGGTA AGACCAGGAA CAGGGGAATA GAAATCGGTA TCAATACCGT CAATGTCAGA
GGCAGAAACT TCAGCTGGAA TTCCAATATC GTATTCTCCA GAAACAAAGA AGAGATTGTG
GCCTTGGCGG ATGCAAATAC CAATGATGTG GCGAACGGAT GGTTTATCGG CTACCCGGTG
CGCGTGTTTT ACGACTATGA AAAGACAGGT ATCTGGCAGA CCAAAGAAGC AGATGCCGCC
AGTGCCTTTG GTTATAAACC CGGCGATATC AAAGTGCGCG ATCAGGATGG AAATGGTGTG
CTGAATTCAC AGGATCGTGT GATACTCGGA AGACAGGTAC CTACCTGGAG TGGTGGTCTG
AATAACGATA TCCGCTTTAT GAATTTTGAC CTGAACGTTT ATGTGTTCGC CCGTATTGGT
CAATGGATCA ATTCAGAATA TGCAGCGAAA TATGATCCGC AGGGCCTTGA GAATAGTGCA
CCGCTGGATT ACTGGACGCC TGAGCACGAG ACAAACGCTT ATCCGCGTCC GAATGCCAGT
GTTTCCAAAG ATGGTACACC TTTCATCAGA ACACTGGGTT ACAAAGACGG TTCTTTCGTG
AAGATCCGTA ACATTTCCCT GGGTTATACA TTGCCGGGTT CAGCTTTGAA GAACCTGCAC
CTGACCAATC TGCGTGTCTA TGTAACGGGT AAGAACCTGT TTACCTTCAG TAAAGTGAAG
GACTATGATC CGGAGAGAGG AGGAGACCTG AGTAATCCGC TGACGAAAAT GTATGTAGCC
GGTCTGAACG TAGAATTCTG A
 
Protein sequence
MKKLLIICWF LLVSVSVTFA QRQTIKGRVS DAATGEPLVM VNVQLKGSTT GTQTDATGTF 
SLTVPDAQTG VLLISYLGYQ ALTQNIHGNS NIVVKLEKDN KQLDEVVVVG YGEVKKRDLT
GAVTSVKGEE LRKVPSTNVM ESVQGKLPGV DITKSNGAAG AKINVTVRGN RSIRADNGPL
YIVDGVQYEN IQDINPNDIQ SMEVLKDASS TAVYGSRGAN GVIIITTKKG ASGKPRVFVN
TYAGISQVAG YPAMSTGPQY VAQKREANRT TGRWKSEADD PLIFNAAEVA SIQNNLWTDY
ADMLIHDGLQ QDYQVGVSGG SEKTKVYLSL DYFDEKGIFK LDHLKRYSAR LNIDQTINDY
LKAGMQSQVT YYNQDNRRDP LNQANKIVPL NLPYDENGNL ILFPNMGSMI NPLADEQPNA
YQNNGRVTRT FVNAYVELTP LKGLSFRSNL GVTLDNARTG IFASKNTIER STASASRSQY
NSGNKRYISW ENVLNYTQKL GDHTFGLTGV ASLLSDQRDS SFLQGEGQLL PAQLYYAMQN
NVSGIAIRSS YMANKLISFT GRINYNYKGK YLLSLTGRSD GSSKLAPGNK WAFFPSVAGA
WRVSDEPFMK SQRVFSDLKI RASYGLAGND AVPPYATASY LTKIPFSYDD TNSALAYGIG
SQIGNRNLKW ELSATTDIGL DMSFFDGRIG ATIDLYDTKT KDLLLQRTLP SSSGVSTVIQ
NIGKTRNRGI EIGINTVNVR GRNFSWNSNI VFSRNKEEIV ALADANTNDV ANGWFIGYPV
RVFYDYEKTG IWQTKEADAA SAFGYKPGDI KVRDQDGNGV LNSQDRVILG RQVPTWSGGL
NNDIRFMNFD LNVYVFARIG QWINSEYAAK YDPQGLENSA PLDYWTPEHE TNAYPRPNAS
VSKDGTPFIR TLGYKDGSFV KIRNISLGYT LPGSALKNLH LTNLRVYVTG KNLFTFSKVK
DYDPERGGDL SNPLTKMYVA GLNVEF