Gene Cpin_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3959 
Symbol 
ID8360132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4921603 
End bp4924737 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content47% 
IMG OID644966133 
ProductTonB-dependent receptor plug 
Protein accessionYP_003123622 
Protein GI256422969 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.734415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.166506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT CTACAGGGCT GGCGCTGATC ATGCTATTGG CGGTTTGCTC TACTTCCTTT 
GCCCAGAAAG GCAAAACCCT TCGCGGCCAG GTCGTTGCCG GTGATACGCA GCAGCCATTG
GAGCGTGTGG TCATAGTAGA GAAAGGCTCC CAGAATCATG TATTTACAGG CGCAACTGGT
GATTACTCCA TTACCCTCAC CAGTGAGAAT GCAACACTCA TTTTTTCCTA TGTAGGATAT
GCTACGCAAC AGTTGCCCGC AGGTTCAGGG GACATCCTTA ATGTGACCAT GCAGTCTTCT
CAGAAAGATC TGCATGAAGT GGTAGTCACC GCATTTGGCG TTAAAAAGGA AAAACGCGCT
TTGGGTTATA CCATTCAACA GGTAGACAAC AAAGACCTTA ACGTTAACCA TCAACCCAAC
GTAGTGAATG CGCTGCAGGG CAAAGTAGCA GGTGTGCAGA TATCCAGTTC AGGCGGTGGT
CCTGGTCAGG GTTCCCGCAT TATTATCAGA GGTATTAATT CCATTTCGGG CGATAGAAAT
AACCAGCCTT TGTTTATCGT AGACGGGGTG GAAATTGATA ATAATACGTA TACGACCGGT
GGGGCAGAAA CCCGTGGTAT GAGCAACCGT GCTGCCGATA TTAACCCCGA CGATATTGAA
AGCGTCTCTG TACTGAGAGG TGGCGCAGCA ACGGCTTTGT ACGGTATCCG TGCGGCTAAC
GGCGCGATCG TCATCACGAC GAAGTCAGGT AAGGCGGGCA AACTCCAGGT GACCTATAAC
GGTATGTATG GTTTTGATAA AGTGAATAAG ACGCCGGAGG TACAGTCGAA ATTTTCGCAG
GGATACCTGG GCGAATATGA TCCGGCCAGC TTCTGGCCTA CCTTTGGTCC GACCGTAGAA
GAAGCAAAGA AACTGGATGA TACGCATCCT GATCAGCTGT ATAATAACTA TAAACAGGCT
TTTAAAACAG GTACGCAAAC TCGTCATACG GTGAATGTCA GTGGTGGTAC CGACAAGGCG
CAACTGCTGG GTTCCATGTC TTACCTGGAC CAGGATGGCG TAATCCCTTT CAGCACCTTT
ACCAGCTATA ATGCGAGGAT CAATGGTCAG TTTAAGATCA GTGAAAAGTT TAGTGCAGGC
GTATCGCTGA ATTATATCAA TTCCGGTGGT AACAGGGTGA ATGCTGACCG TTTCGGGGAG
CAGATTATCT ACTGGTCTCC GCGTTGGGAT ATGAAAGATT ATGTAAAGCC GGATGGTACG
CAACAGACTT ACAGCAGTGG TACGAACAAC CCGATCTATA CGCTGTCAAC CAACAAGTTC
AGAGACAATG TGAACCGTAC GATCGCCAGT ACATTCATCA TGTATAAACC TGCTTCCTGG
CTGAATTTTT CCTATCGTAT CGGTAATGAT TTCTATACAG ATGGCAGGGT ACATCAGGCG
CCTGGTCCGC TGGGATTGGT AGGAGAAGCG CCTAACCTCG ATGACAATGA ATACGGTTTT
ATAAATGAAT ATAATCTCCG GAGCCGTACC CTCACATCCA CCATAATGGC AAATGTCACA
CGTACATTTA AAGAGAAATA TTCTATTGAC CTGAAGATCG GGCATGACCT CCGGGATCAG
CGGCTGAGAA GGAATAGTGT GATGGGGGAT ACGCTGGTGG TACCGGATCT TTTTCTGTTG
AATAATGCTA AACGCGTAAG AGCAGAATCC TATATCTACG ATTACAGGAA CTATGGTTAT
TTCGCTGATC TGACGCTCGG ACTGAATAAC TACCTGTTCC TCGAACTGAC CGGCAGAACT
GACCTGACGT CTACTTTATC CATCAATAAC CGCAGTTATT TCTATCCCTC TGTCAGCCTG
AGTTACATCT TCAGTGATCA GTTCAAACTA CCAGACTGGT GGACATATGG TAAGTTCAAA
GCTTCCTGGG CGAAGATCGG TAAGGATGGA GATGCGTACT CCATTACCAA TGGTTTTAAG
GCGGGTACAA CCATCGGTTC CAGCGTGCCT TTCTACCAGA ACAGGACATT GGGTAATCCT
AACCTGCGAC CGGAGTTTAC ACAGACAACA GAGCTCGGTA CGGAACTGAG ATTTCTGGAA
GGTAGATTGG GTCTGGAAGC GGTCGTTTAT CAGCAACAAA GTAAAGACCT GCTGGTACCG
GTAGACGTAT CTACCACGAC AGGTTTTGAC AAGGCATATG TCAATGCAGG AGAGATCAGT
AACAAAGGAC TGGAGCTTAC TTTGTCTGTA GTGCCGGTGC GTACAAAAGA CTTCAGCTGG
GATTTCCGGG TGAACTTTTC TACCAACAGC AACAAGGTAA AGAAACTGAA TGAAGAGCTG
CAACTGAGTG AGATCGTGTT GTCTTCTCAA TATGGTTATC TGAGTTCTAC GGTGACCACC
AAACTTGTTC CCGGACAGTC ATATGGTGCG CTGTATGGTA GGACTTACAA GCGTTATTAT
GGCAATGAAC CAGACGATAA AAAGACACTG CGCACGGATC TGCCTTTGCT GATAGGCGCG
AATGGTTTCC CTGTGCTCGA TGATGCTTCT AATCAGCGTT ACCTGGGAAG TACACTGCCT
AAATGGATTG GTAGCACCAC ACAGACTTTC CGCTATAAAC AGCTGTCATT GTCGTTATTG
CTGGACGTGC GTCATGGCAA TTACAAGTAT AATCAGCTGT CTAATTTCCT GGCTGCATTC
GGAGAATCAA AACAGACAGA AAACAGGGAT CAGACCATGG TGTTCAATGG TGTACTGGCG
GATGGTACGG CGAATACCAA AGCGGTGTAC CTGGGACAGG CAAAAGGTCC GGATGGGGTA
GACTACGGCG CTGGTTTTTA CAGAAACTAT TATCGTGGCG CGAGTGAAAA CTTTATTGAA
GATGCTTCCT GGGTCAGACT GCGTTCCCTG TCATTGTCCT ATACATTGCC GGCAAAGTGG
CTGACGCCGA CAAAGACAAT CAGTGGAGCG ACGGTATCCT TCACGGGTAA TAATCTCTGG
CTCCATACGA AGTACAGCGG TTTTGATCCG GAGACCAGTT CTTCACCTTC GGGCAGTAAT
GCGTCAGATG CATTCTCCGG ATTTACTTAT CCGGCTACCC GCAGTTTCCT GTTCAGCCTT
AATCTTCAAT TCTAA
 
Protein sequence
MKVSTGLALI MLLAVCSTSF AQKGKTLRGQ VVAGDTQQPL ERVVIVEKGS QNHVFTGATG 
DYSITLTSEN ATLIFSYVGY ATQQLPAGSG DILNVTMQSS QKDLHEVVVT AFGVKKEKRA
LGYTIQQVDN KDLNVNHQPN VVNALQGKVA GVQISSSGGG PGQGSRIIIR GINSISGDRN
NQPLFIVDGV EIDNNTYTTG GAETRGMSNR AADINPDDIE SVSVLRGGAA TALYGIRAAN
GAIVITTKSG KAGKLQVTYN GMYGFDKVNK TPEVQSKFSQ GYLGEYDPAS FWPTFGPTVE
EAKKLDDTHP DQLYNNYKQA FKTGTQTRHT VNVSGGTDKA QLLGSMSYLD QDGVIPFSTF
TSYNARINGQ FKISEKFSAG VSLNYINSGG NRVNADRFGE QIIYWSPRWD MKDYVKPDGT
QQTYSSGTNN PIYTLSTNKF RDNVNRTIAS TFIMYKPASW LNFSYRIGND FYTDGRVHQA
PGPLGLVGEA PNLDDNEYGF INEYNLRSRT LTSTIMANVT RTFKEKYSID LKIGHDLRDQ
RLRRNSVMGD TLVVPDLFLL NNAKRVRAES YIYDYRNYGY FADLTLGLNN YLFLELTGRT
DLTSTLSINN RSYFYPSVSL SYIFSDQFKL PDWWTYGKFK ASWAKIGKDG DAYSITNGFK
AGTTIGSSVP FYQNRTLGNP NLRPEFTQTT ELGTELRFLE GRLGLEAVVY QQQSKDLLVP
VDVSTTTGFD KAYVNAGEIS NKGLELTLSV VPVRTKDFSW DFRVNFSTNS NKVKKLNEEL
QLSEIVLSSQ YGYLSSTVTT KLVPGQSYGA LYGRTYKRYY GNEPDDKKTL RTDLPLLIGA
NGFPVLDDAS NQRYLGSTLP KWIGSTTQTF RYKQLSLSLL LDVRHGNYKY NQLSNFLAAF
GESKQTENRD QTMVFNGVLA DGTANTKAVY LGQAKGPDGV DYGAGFYRNY YRGASENFIE
DASWVRLRSL SLSYTLPAKW LTPTKTISGA TVSFTGNNLW LHTKYSGFDP ETSSSPSGSN
ASDAFSGFTY PATRSFLFSL NLQF