Gene Cpin_4812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4812 
Symbol 
ID8360988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6002468 
End bp6005524 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content48% 
IMG OID644966962 
ProductTonB-dependent receptor 
Protein accessionYP_003124447 
Protein GI256423794 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000883537 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000117998 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAAATATA TTTTACTGTT GCTGCTGTTG GCCATTTCCC TGTCTTCGAT GGCACAGCAA 
AAAAGCATAA GCGGTACTGT GAAAGACCTG AATGGCGTAT TGCCTGGTGC GGCAGTTGTC
GAGAAGGGCG TGCCTACCAA TGGGGCTATT ACAGATGCGG ACGGCCGGTT TAAACTGACC
CTGAAGGGGA AGTCGAATGC GGTGATCGTG AAGCTGATCG GGTTTACGAC CCAGGAAATC
AGTGTAACAG GTTTAGTGAC CCTTGATGTC ATATTACAAC CTGCTACACA GGGGATCGAT
GAAGTAGTGG TGGTGGGTTT TACGCAGACA AGACGTATTA CAAATACCGG TGCGGTAAGT
GGTATCACCG GGGCTGAGAT CAGGAATGTA CCTACTGCCA ACGTGCAGAA TACTCTGATG
GGTAAGCTGC CAGGATTTGT GTCGCAGCAA CGTTCCGGTC AGCCGGGAAG AGATGCTTCC
GACTTTTTTA TCCGTGGTGT GAGTTCGCTG AACAGCGAGG GAAATAAACC ACTGATCATC
GTGGATGATA TTGAATACTC TTACGACCAG TTGCAACAGA TCAACGTCAA TGAGATTGAA
AGTATCTCTA TCCTGAAAGA TGCGTCCACT ACCGCGATCT ATGGTATCAA GGGCGCTAAC
GGCGTACTGG TGGTGACGAC CCGCAGAGGT AAAAGCGGTC GTCCGCAGGT AAACGTACGA
GCAGAATCAG GCTTACAGGC GCCGGTAAAG ACGCCCAGGT TGCTGGATGC TTACAATACG
GGATTGCTGG TGAATGAAGC GTATAGGAAT GACGGTCTGA CGCCACTGTT TACGGATGCG
GACCTGGAGC TGTTCAGGAA CGGGAAAGAT CCTTATGGTC ATCCGAATAT CAACTGGTAT
GAAACGGTGT TTAAGAAATT TGCACAGCAG GCGAATACAA ACGTCGATAT CTCCGGTGGA
ACAGAGGCGG TGAAGTATTT TATTTCCGGT GGATTTTTAA CACAGAACGG ACTTGTAAAG
GACTTTTCTG ATCCCCGTAA TGAAGTGAAT ACAAACTATT TCTTCCGCAG GTATAACTTC
CGCTCCAATC TCGATATGTG GGCAAATAAG ACGCTGACAC TGCGTCTTGA CCTGACAGCT
CGTTTTGGTG ATATCAATGA GCCACATGCA GCCAATGTGG TATCGGAAGT CTATAATTTT
GAAAAGATAC ATCCTTATTC CGCGCCTATC ATGAACCCGG ATGGTAGTTA TGCATTTGCG
GTGGATACAA AGGATAAGTT ACCGACGATC AATGCCCGTC TGGCGAATGG GGGTTATACC
CGTAGTAAGC GTACAGACTT TAATGCCTTA TTTGGCGCTA CGCAGCAACT GGACTTTATG
ACGAAAGGCT TGTCCCTGAC GGCCCGTGTG GCTTATGCCG GTGTGGAGCA GTATTCCAGG
AACCTGTTCC GTTCTGCAGA TCCGCCTTCT TATCATTATA ATCCGGTGGA TGGTTCTTAT
ACGCTGGATC CCCGTGGAAA CTACCGTTTG CAGGCTTATC GTGTGACGGG CAATACGGAC
CTGTACAGCC GGAATGTGAA CACGCAGGCA TTCCTCAATT ACGACCGTAG CTTTGGCGCC
CATACCTTTA ACTCACTCAT ACTATACAAT AGGCAGAGTT ATGCCTTTAA GTCGGATGTT
CCGGCCAACT TCCGCGGACT TTCCTTCAAG GCCGGCTACA ACTATCAACA GAAATATCTG
ATCGATTTTA ATGGGGCTTA TAATGGTTCT GACCGTTTTC AGGCGAAGAA AAGGAATGGT
TTTTTCCCGG CAGTGGGCGT GGGCTGGAAT ATTTCGCAGG AGTCGTTCTT TAAGGAACGT
AACCTGCCTT TCAGTCTGCT GAAACTGCGT GGTTCCTATG GCGTAGTAGG CTCTGATGTG
ACGTCGGGCA ACCGTTATCT CTATCGTCAG GAGTATTATT ATAGCGGTGG TTATTCCTTC
GGGGAAAATG ACTCCCAACA AGGTTCTATT TATGAAGGAG AACTGGGCAA TATGGACGTA
AGCTGGGAAA AGGCGAGGAA AGCAGATGTG GGTATCGATA TGAACCTGTT TAAAGACAAG
CTTTCTGTGA CGGTCGATTA TTTCCATGAT ATCCGCTATG ATCAGCTGGT GCGGAAGGGT
TCTATTCCTG GTATTATCGG TATTGGTTTC AGTCCGACGA ATGTGGCCAG AACGAGCAAC
AAAGGTTTCG ACGGACAGAT CAACTACCGT GGCAGCATCC GCCGTGTAAA CTTCACCACC
AGCCTGGTAT TCCAGCATTT CAAGAACAAG GTACTGTTCA AGGATGAAGC ACAGCCGGCT
TTCCCCTGGT TGAGAGAAAC GGGTCGTCCT ATCGATCAGC CATTCGGCTA TACTTTTATC
GGGTATTATA CACCGGAGGA TATTCAGAAG ATCCAGGCGA ACACACATGA CAAACCAGCT
ACACCGCTGA GTGAATATCC GATCCAGGCA GGTGATCTGA AATACAGGGA TCTTAACAAC
GACGGTATTA TTGATAATAA TGACAGGGGA CCTATCGGTC GTCCGAATCT GGCCAATACG
GTGCTGGGTC TAACACTGGC AGCTAACTAC AAAGGGTTTA GTGTGAGTCT GCTGTTCCAG
GGTGCGTTTG GATATAGTTT CAGTGTGGTA GGTACCGGTA TCGAGACTTT CCAGAGTCAG
TTTCAGCCTA TCCACCAGGA AAGATGGACA CCGGAGAATG CGGATAACGC ACAGTTCCCA
CGTCTGACCA GCAATCCGAC TACTGTAAAT AGTGCGCGCA CCTATATGTC AGATTTTTGG
TTGATCGATG CGCGTTATGT ACGTCTGAAA ACGATCGATC TGGGGTATCA GTTGCCAAGC
CGCTGGTTAC CACAACGCCT TAACAACGCA CGACTGTACC TGAGTGCCTA TAACCTTTTA
ACATGGACCA ACTACGATAA GTACCAGCAG GACCCTGAGG TATCCAGTAA CTCTGCGGGC
GACGCCTATA TCAATCAGCG CGTTGTCAAT CTGGGGGTGC AGATCGGTTT TAAATAA
 
Protein sequence
MKYILLLLLL AISLSSMAQQ KSISGTVKDL NGVLPGAAVV EKGVPTNGAI TDADGRFKLT 
LKGKSNAVIV KLIGFTTQEI SVTGLVTLDV ILQPATQGID EVVVVGFTQT RRITNTGAVS
GITGAEIRNV PTANVQNTLM GKLPGFVSQQ RSGQPGRDAS DFFIRGVSSL NSEGNKPLII
VDDIEYSYDQ LQQINVNEIE SISILKDAST TAIYGIKGAN GVLVVTTRRG KSGRPQVNVR
AESGLQAPVK TPRLLDAYNT GLLVNEAYRN DGLTPLFTDA DLELFRNGKD PYGHPNINWY
ETVFKKFAQQ ANTNVDISGG TEAVKYFISG GFLTQNGLVK DFSDPRNEVN TNYFFRRYNF
RSNLDMWANK TLTLRLDLTA RFGDINEPHA ANVVSEVYNF EKIHPYSAPI MNPDGSYAFA
VDTKDKLPTI NARLANGGYT RSKRTDFNAL FGATQQLDFM TKGLSLTARV AYAGVEQYSR
NLFRSADPPS YHYNPVDGSY TLDPRGNYRL QAYRVTGNTD LYSRNVNTQA FLNYDRSFGA
HTFNSLILYN RQSYAFKSDV PANFRGLSFK AGYNYQQKYL IDFNGAYNGS DRFQAKKRNG
FFPAVGVGWN ISQESFFKER NLPFSLLKLR GSYGVVGSDV TSGNRYLYRQ EYYYSGGYSF
GENDSQQGSI YEGELGNMDV SWEKARKADV GIDMNLFKDK LSVTVDYFHD IRYDQLVRKG
SIPGIIGIGF SPTNVARTSN KGFDGQINYR GSIRRVNFTT SLVFQHFKNK VLFKDEAQPA
FPWLRETGRP IDQPFGYTFI GYYTPEDIQK IQANTHDKPA TPLSEYPIQA GDLKYRDLNN
DGIIDNNDRG PIGRPNLANT VLGLTLAANY KGFSVSLLFQ GAFGYSFSVV GTGIETFQSQ
FQPIHQERWT PENADNAQFP RLTSNPTTVN SARTYMSDFW LIDARYVRLK TIDLGYQLPS
RWLPQRLNNA RLYLSAYNLL TWTNYDKYQQ DPEVSSNSAG DAYINQRVVN LGVQIGFK