Gene Cpin_4523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4523 
Symbol 
ID8360696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5638528 
End bp5641524 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content49% 
IMG OID644966678 
ProductTonB-dependent receptor 
Protein accessionYP_003124166 
Protein GI256423513 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0414328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0764127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCTAC CCAAAACCGG CTTGCTTGTA TTGCTGTTCT TATGCAGTAC ACTGGCCTTA 
TTCGCGCAGC AAAACATCTC TGGTAAGATC AAAGATGCTG TGAGCGGGAA GCCCATTCCC
GGCGTCACAG TCAGAATACA GGGCACTAAC AGAGGAACTA TCTCGGATGC CACCGGCGTC
TATTCTTTGT CCGTTCCCTC CGGAAACGCC ACCTTACTCG TAACCTTTAC AGGTTACAAG
ACACAGACCA TTAAAGTAAG TCCGGGCGCC AATGTTGGTG ATATCACCAT GGAAGAGGAC
TTCGCAAAAC TGGATGAAAT TGTGGTAACA GGTCTCGCTA CCAGCGTAAA ACGTAGTAAC
CTGGCTAATA CCGTGGTAAC CATCAATGCC AATCAACTGG CCGGTACCGC TCCGGCACAG
ACCTTTGACG CCGCCCTGAG CGGTAAGGTG CCTGGCGCCC TGATCACCGC TAACTCCGGT
GCGCCGGGTG GAGGTATCTC CGTTAAAATG AGAGGTATCA CTTCCGTATT TGGTAACTCC
CAGCCCCTCT ACGTAGTAGA CGGTATCTTC TTCAATAACA GCAGTATCCC TGCTGGTTTG
AATGATGTTA CCGGCGCCGC TACAGCGGGT AACCCCAACA ACCAGGATAA TCCTTCCAGC
CGTATCGCTG ACCTGAACCC CGCAGATATT GAGAATATAG AAATCCTCAA AGGCGCCTCC
GCTGCCGCAT TGTATGGTGC GAAAGCAGCA GCTGGTGTGG TCATCGTAAC GACTAAAAAA
GGTCGTGCCG GTAAAACAAA AGTGAGCGTC AACCAGGAAA CCGGCTTTGC CAAAGTAAGA
CACCTTATGG GCGTTCGTAC TTTTGATGCA GAAAAAGCAG CTGATCTTGC CGGTGCAGCC
AGTAACACTG ATCCGGTTGT ACAACAACGC CGTCAGGCAT ACAGAGACCA GTTCAATGCC
GCCGCCGCAG CTGGTAAGAT CTATGACTAC GAAAAAGAAA TGTATGGCGA AACCGGTCTG
ATACTCAATA CCGGTCTCTC TATCAGTGGC GGTAATGAAA AAACGACATT CTATATGTCC
GGTAACCGCA GACAGGAACA TGGTATCGTG AAGAATACCG GCTATTTCAA CAACTCCGCC
CGACTGAATA TTGATCATAA ACTGTCAGAT CGTATCTCCC TGGGTGTTAC GATGAGTTAT
ATTCACTCTG ATGCAGACAG GGGACTGACC AATAACGATA ACAACGGTGT CACCTACGGT
GTCGCCCTGT CATCTACACC TACTTTCGTG GACCTGTTCC CGAATGCATT AGGTGAATAT
CCGCGTAATC CTTTTGCGGC TTCCAATCCG CTGGAAACAC GCGATAAAAT GACCAATAAT
GAGGTGACCA ACCGTTTTGT AGGTGGTGCT AACCTGGAGG TGCGTTTACA GCAAAGCGAG
CATTCTTCGA CTAAGTTTAT TGGTCGTGGT GGTGTGGACT ATTTCAACTA TAAGACCGCT
GCCTTATTCC CCCGCGATCT TCAGTTTGAA GAAAATGCCC TGCAGGGACA CTCCATCCAG
GGTAATACCA ATAACACCAA TACAAACCTC GGTGGTTTCC TGACCAATAC ATTTACACCG
AATGAAAATA TAGGTCTTAC CAGTACCCTG GGCGCTACCC TGGAAACCGG TTACATGGAT
AACATCGTCA CTGCGGCTAC TAACCTGGTA TCCGGACAAA GCAACCTGGA CGCCAGTGCC
AATACTACCA CCCGTCAGTT CAGACAGAAA TACCGCGATA ACGGTATCTT CCTGCAGGAA
GACCTCTCTT TACTGGATGC GATCACCTTG AGTGGTGGTG TGCGTTTTGA CCGTTCTACC
AATAACGGCG ACTATAAGAA ATACTATGTA TTCCCTAAAG GCGCTGTTTC CTGGAACATT
GCCCGTATGA AATTCTGGGA TGTGAAAAGT ATCCAGAGTC TGAAACTGCG TGCCGCCTAT
GGTCAGTCAG GTAACGTACC GCCTTACGGC AGTAAGTTCA CCGCCATGTT GGGTAGCAAT
ATCGGTGGAT TACCAGGTGT ACTCGTAGAC AACCTGCTGG GTAACTCAGA TATCAAACCG
GAACGTCAGT CAGAGTTTGA AGCAGGTCTG GATCTGAGTG TGCTGGACGG TCGTGTTACC
CTGGAAGCAA CCTATTACAA TAAACAGATC CAGGACGTAT TACTGCGCCA TGCATTACCT
GGTTCTTCCG GTTATGCGAA TGAGTGGAAG AACAGTGGCG ACCTGAGAAA CAAGGGTATT
GAGCTGGGGC TGACAGTGAT TCCTGTCAAC ACCAAAACAG TGAAATGGAG CTCTACTATC
AACTGGTGGC GTAACCGCTC CAGAGTAACC AAATTGCTGA TCCCTCCATA TGCGATCGGT
GCGTTTGGTA ACTCACTGGG TACTTTCTAC CTGGAAGAAA ACGAGCCGGC TACGCAGATC
AAAGGAACTG TGGGTAGCGA ACTGAAACTG ATCGGTAACT CTGAGCCGAA ATTCCAGATG
AGTTTCTTCG ATGAACTGAC CATCCTGAAA AATATCTCTA TCCGCTTCCT GATACACTGG
AAGAAAGGCG GAGATAATAT CAACCTGACG CAGTTGCTGA CTGACCTCGG TCAGACTTCA
TTTGACTATG ACGATATCCT CCATGGACAG AAAGCAGGCA TGTACCGTAC AGGTGCAGGC
GATGCGTCTA TCTATGTACA GGATGCCTCT TATGTACGTA TCCGTGAGAT CGGTGTTTAT
TACAACGTGC CGCTGCGGAA TACCAATATC ATCAAAGGCA TTCGTCTGGG TGTGTCAGCC
AATAATTTTT TCACCTGGAC AAGTTATGTA GGATATGATC CGGAAGTGTC CAACTTCGGT
AGTAACACGA TTACGACCTC TACCAGCCGC GGTAGTAACG GTCTGTCTAC CGGTGTGGAC
GTAACGCCTT TCCCTGCTTC CAAAAGAGGT AGTTTCCATA TAGGCGTTGA TTTCTGA
 
Protein sequence
MLLPKTGLLV LLFLCSTLAL FAQQNISGKI KDAVSGKPIP GVTVRIQGTN RGTISDATGV 
YSLSVPSGNA TLLVTFTGYK TQTIKVSPGA NVGDITMEED FAKLDEIVVT GLATSVKRSN
LANTVVTINA NQLAGTAPAQ TFDAALSGKV PGALITANSG APGGGISVKM RGITSVFGNS
QPLYVVDGIF FNNSSIPAGL NDVTGAATAG NPNNQDNPSS RIADLNPADI ENIEILKGAS
AAALYGAKAA AGVVIVTTKK GRAGKTKVSV NQETGFAKVR HLMGVRTFDA EKAADLAGAA
SNTDPVVQQR RQAYRDQFNA AAAAGKIYDY EKEMYGETGL ILNTGLSISG GNEKTTFYMS
GNRRQEHGIV KNTGYFNNSA RLNIDHKLSD RISLGVTMSY IHSDADRGLT NNDNNGVTYG
VALSSTPTFV DLFPNALGEY PRNPFAASNP LETRDKMTNN EVTNRFVGGA NLEVRLQQSE
HSSTKFIGRG GVDYFNYKTA ALFPRDLQFE ENALQGHSIQ GNTNNTNTNL GGFLTNTFTP
NENIGLTSTL GATLETGYMD NIVTAATNLV SGQSNLDASA NTTTRQFRQK YRDNGIFLQE
DLSLLDAITL SGGVRFDRST NNGDYKKYYV FPKGAVSWNI ARMKFWDVKS IQSLKLRAAY
GQSGNVPPYG SKFTAMLGSN IGGLPGVLVD NLLGNSDIKP ERQSEFEAGL DLSVLDGRVT
LEATYYNKQI QDVLLRHALP GSSGYANEWK NSGDLRNKGI ELGLTVIPVN TKTVKWSSTI
NWWRNRSRVT KLLIPPYAIG AFGNSLGTFY LEENEPATQI KGTVGSELKL IGNSEPKFQM
SFFDELTILK NISIRFLIHW KKGGDNINLT QLLTDLGQTS FDYDDILHGQ KAGMYRTGAG
DASIYVQDAS YVRIREIGVY YNVPLRNTNI IKGIRLGVSA NNFFTWTSYV GYDPEVSNFG
SNTITTSTSR GSNGLSTGVD VTPFPASKRG SFHIGVDF