Gene Cpin_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2820 
Symbol 
ID8358981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3473850 
End bp3477023 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content46% 
IMG OID644965000 
ProductTonB-dependent receptor plug 
Protein accessionYP_003122500 
Protein GI256421847 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGC GTAATTCCAC GTGTGTACTG CTATGTATTG CAGTCATCCT AAGCATTATT 
CCTTTATTGC TGCATGCCCA GCAAAGCCGC TCCGATGTAT CGGGTATTGT AAAAAGCGAA
GAAAACGGCG AACCGCTCTG GGGTGTCAGT GTCACCGTCA AAAATGCAAC AAACAGTTTT
ACCGCTTCTG TACAAACCGA CAGCGCCGGC TATTTCACCT TCACCAGATT GCCGGCAGGT
CCGGGGTATA CTTTCTCTTT TTCCTTTATG GGATATGAGC GACAAACCCT GTCAGGCTAT
AACCTGAAAG CAGGCGCTGA ATTCTCTTTA CGGGTGGATC TTAAAAACAG CGAACAGAAG
ATAAATGAAG TCGTGGTAGT AGGATATGGC TCCATGCAGA AAAAAGACCT TACCGGCTCT
ATCAGTCAGC TGAAAACAGC CCGCCTGGAA AAGGAGAGTC CCCGCTCTAT ACAGGACCTG
CTGCGTAGCG GTGTGCCTGG ATTGTACGTC GGACAAAATA CTTCCGCAAA GGGTGGGGGA
GACATGCTGG TACGCGGACA ACGTTCCCTG AAAGCAAGTA ATGATCCGCT GATCGTACTG
GATGGCGTGA TCTTCTTCGG GGAATTATCT GAGATCAATC CACAGGATAT CGAACACTTT
GATATCTTGA AAGATGCCTC CGCCGCCGCT ATATACGGGG CTAAATCAGC CAATGGAGTG
GTCATCATTA CAACGAAAAA AGGTACTTCC GATAAACCTA CTATCCGCTT TGACGCCAAC
AGAGGTGTTG CTGCCATGGG TAAAAAACGC GAGGTATATG ATGCGGAGGG ATATCTGAAA
TTCCGCTCCG ACCTCTTTAA TAGTGGTACA AGATGGGCTA CGCCTGCTAA ATATGTAAAT
CCTACACCAG AGAATCTTAA TAAATATGGC GTCTCCCTGG ACGACTGGTT AGCTTACGAT
GCGCTGACCG GAACGCCTGA AGAGATCTGG TTGCTGCGTA TCGGTCTTTT TGAAAAAGAA
AGGCTTAATT ATTCAAAAGG CAGAACCTAT GACTGGTTTG ACGGCTCCTT CCAGCATGGT
GTACAGCAGA ACTATAATGT AAGTCTGTCA GGCAGAAATA AAGACGCCCT CAATTACTAT
ATCTCCATGG GATACCTCAA TAATGAAGGT ATCGTGGTGG GTGATAAATA CCGTGCATAC
CGGTCGAATA TCAAACTGGA CGGAAAGGTG AATAAATGGA TGCACACCGG CGTAAATATC
AATTTCCAGG ATAGATCAGA TGGTAATCTG GCTGCGGACT GGGAAGGACA GATTATTAAT
AACTCTCCCT ATGCCTCTCC GCTGGATTCA AATGGCATCC TTGACGGACA GCCCATGGGC
GCCAGTGTAA ACCAGGGTGT CAATACAGCT TATGGTAATC AGTTCAAGCA ACTGGAGAAA
GGATTTACGG TGTTAAATAC CACCATTTAC CAGACGATCA AACTGCCATT CAACATTACC
TATCAGCTGA ATTTCTCCCC AAGAATGCAG TGGTTCTATA ACAGGTATCA TGAATCCTCG
CAGAACCCAT TATGGTCTGA TAACGGAAAA GTAATCAGGG AAAACACCAA AAACTTTGAC
TGGCAGATCG ATAACATCAT TTCATGGGAT TATACCTTCG CCCGTAAACA TAAAGTGAAA
GTGACCCTGC TTCAGAACGC AGAAGAACAC CGTTCCTGGA ATGAAAGTAT TACTGCCAGG
GATTTTTCTC CTACAGATGC CCTGGGTTTC CATAATATCG GTGCTGCCAA TCCTTTAAAG
ACGACTGTCA GCAGTAATGA CCAGCATAGT ACCGGTGATG CGTTAATGGC CCGCCTGTTC
TATTCCTATG ATAACAGGTA TATGATCACG GCTTCTGTAC GCCGGGATGG TTACTCTGCA
TTTGGCGCTT CCAATCCAAG AGCCACTTTC CCGGCCCTCG CTTTTGCCTG GAACTTTGCA
GATGAACGTT TCTTCCACTG GTCGCCTATG AGTACCGGTA AACTGCGTTT GTCATGGGGT
ATGAACGGTA ACCGCTCCAT CGGTATATAC CAGGCATTGT CTAATCTGAC CACCGGTGCA
GGCCGTTATC CTTATGTGCA ATCGAATGGC ACGGTGTATG AATTGTCTCA GTTGTACGTT
GACCGTATGG CCAACTATGG TCTGAAATGG GAAGCTACTT CCTCCTGGAA TGCCGGTCTG
GATTTCGGAT TCCTCAATAA CAGGATTACC GGTAATATGG AAGTTTACTA TATGCCTACC
ACCGATCTCC TGATGGATCA GTCCCTGCCT GATTTTACCG GTTTCAGTAC CGTAACGACC
AATCTCGGGG AAGTGGTGAA CAGAGGCTTT GAATTAGGAA TCACTTCTCA GAACATACAG
AAAAAGAACT TTGAATGGAG TACCACATTT GGCTTCTTCT TTAATAGAAA TAAGGTGAAG
CACCTCTACT ATACCTATGA AGACGTATTA AATGCAGATG GTAAATTAGT CGGCTCCAAA
GAGATTGACG ATATCTCTAA CGGATGGTTT ATCGGTCACG ATCTGTCCTC AATATGGACC
TATAATGTAC AGGGAATCTG GCAGGAAAAT GAAAGAGAAC AGGCGGCTAA ATATGGAGAG
ATCCCCGGGG ATGTGAAAGT GGAAGATGTG GATGGAGATG GAAAATATAC CAATGCCGAT
AAGAAGTTTT TGGGTACGAC CACACCACGC TTCCGCTGGA CGCTGCGCAA TGACTTTGCT
ATCTTCAAAA ACTTTGATTT CTCTTTTAAT ATCTATGCGA ACTGGGGACA TAAGGCGACT
TCCACCGATT ATCTGAATAA CTTCGGTTCG CAGACGGACA GGATCAATTC TTATGTGAGA
AAATACTGGA CGCCTGAAAA TCCTTCTGAT ACTTACGCCA GACTGAATTC CACCAATGTG
CAGAACATCA CGCCTTCCCG TGTGATTGAT AAAACATATA TCCGCCTTGA TAACATATCG
GTGTCTTATT CGCTGCCTCC TGATGTAGCC AGGAGAATGG ATATGGCCCA GCTCAAGGTA
TATGCAGGTG TACGCAACGT AGCGGTATGG GCGAAAGACT GGGAATACTG GGATCCGGAA
ACCACTTCAT TAATGCCCCG GTATTATACA GTCGGAATTA CAGCCTCATT TTAA
 
Protein sequence
MKQRNSTCVL LCIAVILSII PLLLHAQQSR SDVSGIVKSE ENGEPLWGVS VTVKNATNSF 
TASVQTDSAG YFTFTRLPAG PGYTFSFSFM GYERQTLSGY NLKAGAEFSL RVDLKNSEQK
INEVVVVGYG SMQKKDLTGS ISQLKTARLE KESPRSIQDL LRSGVPGLYV GQNTSAKGGG
DMLVRGQRSL KASNDPLIVL DGVIFFGELS EINPQDIEHF DILKDASAAA IYGAKSANGV
VIITTKKGTS DKPTIRFDAN RGVAAMGKKR EVYDAEGYLK FRSDLFNSGT RWATPAKYVN
PTPENLNKYG VSLDDWLAYD ALTGTPEEIW LLRIGLFEKE RLNYSKGRTY DWFDGSFQHG
VQQNYNVSLS GRNKDALNYY ISMGYLNNEG IVVGDKYRAY RSNIKLDGKV NKWMHTGVNI
NFQDRSDGNL AADWEGQIIN NSPYASPLDS NGILDGQPMG ASVNQGVNTA YGNQFKQLEK
GFTVLNTTIY QTIKLPFNIT YQLNFSPRMQ WFYNRYHESS QNPLWSDNGK VIRENTKNFD
WQIDNIISWD YTFARKHKVK VTLLQNAEEH RSWNESITAR DFSPTDALGF HNIGAANPLK
TTVSSNDQHS TGDALMARLF YSYDNRYMIT ASVRRDGYSA FGASNPRATF PALAFAWNFA
DERFFHWSPM STGKLRLSWG MNGNRSIGIY QALSNLTTGA GRYPYVQSNG TVYELSQLYV
DRMANYGLKW EATSSWNAGL DFGFLNNRIT GNMEVYYMPT TDLLMDQSLP DFTGFSTVTT
NLGEVVNRGF ELGITSQNIQ KKNFEWSTTF GFFFNRNKVK HLYYTYEDVL NADGKLVGSK
EIDDISNGWF IGHDLSSIWT YNVQGIWQEN EREQAAKYGE IPGDVKVEDV DGDGKYTNAD
KKFLGTTTPR FRWTLRNDFA IFKNFDFSFN IYANWGHKAT STDYLNNFGS QTDRINSYVR
KYWTPENPSD TYARLNSTNV QNITPSRVID KTYIRLDNIS VSYSLPPDVA RRMDMAQLKV
YAGVRNVAVW AKDWEYWDPE TTSLMPRYYT VGITASF