Gene Cpin_4416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4416 
Symbol 
ID8360589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5505487 
End bp5508477 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content47% 
IMG OID644966575 
ProductTonB-dependent receptor plug 
Protein accessionYP_003124063 
Protein GI256423410 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000494503 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTAA AAATTCTACG ATTACTGAGC TGCTTTGCCC TGATCTTAGT CCACGCTTAC 
CCGGTACTTG CCCAGAACAG AGTCATAGAA GGTAAAGTTA TTGACTCTTC CAGCAACAGT
CCATTACCAG GAGTGACTGT CAGAATTGCA GGTACCAAAT CGGGCACCGC CACCGACATC
AACGGCGTAT TCCGGCTGAC GCTTACGGCT GATGCCCGTG AACTTATTTT TTCTTCCATT
GATTATGCAC CCCAGACAGT TGCTATTGCC AACCAAAGTA CTTTCTCTGT AAAACTTGCT
CCGCTCAGCC GTGAACTTAA AGGCGTTGAA GTGGTGAGTG TAGGTTATGG TACAATGGAC
AGAAAGGAAG TTTCCAGTGC GATTACGCAT GTGCGGGCGG AAGATCTGAA ATCTGTTGCT
GCCAATAACC CGCTGATGTC CTTACAGGGA AAGGTAGCCG GTTTAACCGT ATCCAACACC
TCCACGGCTG ATCCTAACTC CTCTCCGAGC ATCCAGCTGC GAGGCGTATC CTCCCGTAGT
GCGGGTCTGG GACCTTTGTA TGTCATCAAT GGCGTACCAG GTGGTAATAT CGACAACATT
AACCAGAACG ATATCGAATC CATCGACGTG CTGAAAGGCG GCGCCGCTTC AGCGATTTAC
GGTACGAGAG GTAGTAATGG TGTCATCATC ATCACAACCA AGAAAGGCAC TGGTGATTCC
AAAATGGTAT ACAGTGGTTA TGTGAGCATG GATCGTCTCA CCATGAAGCC TGAAGTACTC
ACGGCAGAAG AGTTCCGGAA GTTCCGTGTC GGCGCTACAC CGGCACAGGG TATCGACTAT
GGCGGTAATA CCGACTGGAT GAAATCAGTC ACCCGTGAGC CTGCGTATGC ACAGAAGCAT
ACCCTGCAGA TTTCCGGTGG AACGGCCAGG GATAACTATT TCGCTTCGGC TGATTACAGG
AATGCGGATG GTATTGACCT GCGTGCTTCC AAAAAGGAAT ATGGCGCACG TCTGAACCTG
AATCATACTT CCAAAGAAGA TGTTTATACG GTTTCCTTCA ATGTTGCACC GAGGTATATG
AAAACCAGTA ATGCAGACCA GGGAAACTTC AATAATGCGC TTACGCTGAA TCCGACGCTG
CCTATTTACG ACAGCACCGG CTATCGTTAT ATCAATACCG GCTTTTTCTC CAATAATCCG
GTGGAGAATG CACGGCTGAT CAAAAGTCAG GCGGAGATCA AACAACTGGA TATGAGCGGC
TCTTTCCGGG TCAATATCAC CAGGAATCTG AACTCCATGG TGACGATCTC CAACATCCGG
TCTTCCTACA AGAATCTGTT CTTCTCTCCT TCTACCCTGA CGACGATTGT ACATGCAGGG
CAGGTAACGA AGACGAACTA TGCTTCACAG GAGCAGCAGG ATAATGACCA GCGTAATCTG
GAATGGACCA TCAATTATGC GTTGAATCTG AACAGGAATC ACTTTAAGTT CCTGGGCGGT
TATTCTTACA GCTACTTCAA TTACCAGCAA TTCGCGGCTA ACAACTATGA TTTTCCATTC
GATACTTTCC TGTGGAATAA TCTGGGTTCC GGTACCTGGA ATGGAGGTGA TAAAGGCAAC
GGACAAAGCG CAGTGTCTTC CACACAGAAT GACTCCAGAC TGGCCGCATT CTTTGGCAGG
ATCAACTATG ATTATGATAA TAAGTATATC CTGACGGTGA GTCTGCGTCA TGAAGGTTCT
TCTAAATTCG GCAGAAACAA CAAATGGGGT AGTTTCCCGG CGGTATCTGC TGCATGGCGT
ATCTCTGACG AGGATTTCCT GAAGGATAAA ATCAGCTGGC TGAATGACCT GAAACTGAGA
GCCGATTACG GTGTGACCGG TAACCAGGAT TTTGGTAATT ATCTGTCCTT ACTGTTGTAT
GGCGGATATG GTTATTTCCC TTTCAACGGC ACGACTTACC AGGTGTATGG ACCTTCGCAG
AATATCAATC CTAACCTGGG TTGGGAAAAA GCAATCAACT TTAACGTAGG TCTTGACTTT
GATCTGTTTA AAAGCCGTGT GTCAGGTTCG GTGAACTACT ATGTACGTAC CAACAAGGAC
CTGCTGGGTT ATTACAATGT ACCGTTGCCA CCCAATCCGC AGTCACAGAC CTTCAGCAAC
GTGGGTACTA TGAAAAACAC CGGTCTGGAA ATCCAGCTGA ACGGATCCGT GGTACAGTCA
AAAGACTTCA GCTACAATGT GTCTTTTGCG ACGGCTTTAA ATAATAACAA ATTCGTCTCT
TTCTCTAATA AACTGTACCA GGGAGCACCA TTCCAGGATG TAGCGGGATT ACCTGCACCA
GGTAGTCCGG GTAACATCCA GCGTTTACAG GAAGGACATC GTATCGGGGA CTTTTACATG
CTGAAGTCAG CAGGTGTAAA TGAGAATGGC GCATTGCTGG CGTATAAAAA AGACGGCAGT
ATTGTAACGG CTAACCAGAC TAATGCAGAT GATAAGCAGT TTGTGGGGAA TGGTTTGCCG
AAATTCACCG GTTCATTAGG TAACACTTTC CGTTATAAGA ACTGGGACCT GAATATCTTC
TTCAGGGGTA ATTTCGGCTA CAAGCTGTTC AACATGCAGG CATTTTATGT GGGTACGCCG
GCTACACAGT CCGACGCCAA TACGCTGAAA TCTGCCTACG ATCCGAAGAG CAAGTATGCT
AAACTGACCA GTTCTTCTAC CACAGCCCTT GCTTCCGACT ATTTCCTGGA ATCGGGTTCT
TTCGTGAAGA TCGACAACGT ATCGCTGGGT TATGGCCGTG AAGTCAATTC CAAGTACCTG
CATGCGTTCA GGGTATATGC TTCAGCGAGC AACCTGCATA CGTTCACCAG CTTCAAGGGA
GGCGATCCTG ATCTGTACCC TATTAACGGC TTAACGCCTG GCGTGCAGGG TAATCTGAAT
TTCTATCCGG CGACAATTCA GTTACTGGGA GGCTTACAGG TAACATTCTA A
 
Protein sequence
MFLKILRLLS CFALILVHAY PVLAQNRVIE GKVIDSSSNS PLPGVTVRIA GTKSGTATDI 
NGVFRLTLTA DARELIFSSI DYAPQTVAIA NQSTFSVKLA PLSRELKGVE VVSVGYGTMD
RKEVSSAITH VRAEDLKSVA ANNPLMSLQG KVAGLTVSNT STADPNSSPS IQLRGVSSRS
AGLGPLYVIN GVPGGNIDNI NQNDIESIDV LKGGAASAIY GTRGSNGVII ITTKKGTGDS
KMVYSGYVSM DRLTMKPEVL TAEEFRKFRV GATPAQGIDY GGNTDWMKSV TREPAYAQKH
TLQISGGTAR DNYFASADYR NADGIDLRAS KKEYGARLNL NHTSKEDVYT VSFNVAPRYM
KTSNADQGNF NNALTLNPTL PIYDSTGYRY INTGFFSNNP VENARLIKSQ AEIKQLDMSG
SFRVNITRNL NSMVTISNIR SSYKNLFFSP STLTTIVHAG QVTKTNYASQ EQQDNDQRNL
EWTINYALNL NRNHFKFLGG YSYSYFNYQQ FAANNYDFPF DTFLWNNLGS GTWNGGDKGN
GQSAVSSTQN DSRLAAFFGR INYDYDNKYI LTVSLRHEGS SKFGRNNKWG SFPAVSAAWR
ISDEDFLKDK ISWLNDLKLR ADYGVTGNQD FGNYLSLLLY GGYGYFPFNG TTYQVYGPSQ
NINPNLGWEK AINFNVGLDF DLFKSRVSGS VNYYVRTNKD LLGYYNVPLP PNPQSQTFSN
VGTMKNTGLE IQLNGSVVQS KDFSYNVSFA TALNNNKFVS FSNKLYQGAP FQDVAGLPAP
GSPGNIQRLQ EGHRIGDFYM LKSAGVNENG ALLAYKKDGS IVTANQTNAD DKQFVGNGLP
KFTGSLGNTF RYKNWDLNIF FRGNFGYKLF NMQAFYVGTP ATQSDANTLK SAYDPKSKYA
KLTSSSTTAL ASDYFLESGS FVKIDNVSLG YGREVNSKYL HAFRVYASAS NLHTFTSFKG
GDPDLYPING LTPGVQGNLN FYPATIQLLG GLQVTF