Gene Cpin_4975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4975 
Symbol 
ID8361151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6221890 
End bp6225057 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content48% 
IMG OID644967124 
ProductTonB-dependent receptor plug 
Protein accessionYP_003124609 
Protein GI256423956 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC TCTACGTACT CGTAGTGCTG TTGCTATTTG CTTTCCTCCA AGGCTTTTCG 
CAGCAGACAA ACAACGTTAC TTACTCAGGA ACAGTAACAG ATTCCACAGG CATTCCCATT
CCGGGCGCCA CCGTATCAGT GAGGAACAGT AACAAAGGAG TGGTCACGAA GAATGATGGT
TCGTTTTCCA TTCAGGCCAC GCCCGGCGCT GTTATTACCG TCAGCATCGT AGGTTTCCAG
ACCAAGGAGC TTGTACTGGG AAGGGAAGCA AAATTGCAGA TCACTGTCTC CTCACAGGCC
AGTAATCTGA CAGACATTGT CGTGGTAGGT TATGGTACGC AACGAAAGGC CTCCGTTACA
GGCGCCGTTA GTACCCTTAA ATCGGACGAT CTCGTACGAA CACCTGCTAC GACCACGTCG
GCTGCCCTCG TGGGTAAAAT GCCGGGTATA ACGGCCCGTG CCACCGATTC CCGTCCGGGT
AATGGTACCA ATCTTCAGAT CCGTAACCTG GGTACGCCGC TCTTCGTTAT CGATGGGGTG
CCTTATACAG GTAATACGGC TACATCCGCC TTCGGATTAA CCACCGGCTC CGGACAGGAT
ATCTTCAATA CACTAGGTCT GGAGGATATT GAAAGCATCA CCATCCTGAA AGATGCTTCT
GCTTCCATTT ATGGTTTGCG CGCCAGTAAT GGCGTGGTAC TCGTTACTAC CAAAAAAGGA
ACGAAGAATG AACAACCGCG TATTAACCTG TCTGGTTATT ACGGTTTCCA GAACTTTACG
CGTTACCCAT ATCCGGCGAA TGCCGGACAA TATGTGCGCG CACTTGTAGA GTCAGAACAG
AATCTCGGCC GTGATCCGTC ATTGCTGTAT ACGCCCGGCG AACTTTCGAA ATGGGAAGCC
GGAACGGAGA AAGGTTATAA GAGTTATGAC TACTACAAGG AAGTACTCCG CCCTAATGTA
CCTCAGAACT ACATCAGTGC GAATGCTTCC GGTGGTTCAC AGCGTACCAG TTATTATATG
TCTGTCAGCA GACTGAGTCA GGACGCGCTC GTGAAAGACT TTACCTATGA GCGTACCAAT
CTGCAGGCCA ACCTGGAGGC TAGTCTGGCG AAAGGATTAA AAGTCGGTAC ACAGATCAGT
GGCCGTCTTG AAAAACGTCA TAACGTAGGT GTACCCGGTC TGGATGACTA TTTCAATCCC
TTCCTGAGCA TCTTCAGTAT GTGGCCTACG GAAAGCCCTT ACGCGAATGA TAATCCCAAT
TATATTCATC AGACGCATAA CGTAAACGTT AACCCTGTGA CTTACAGAGA TGACGTGACC
GGTTACCTGG ACGAATGGTG GAGAGGGATG AACGTCAATC TGAATGCGCA GTATGATTTT
GATTTCGGTC TGAGCCTGAA AGGCGTATAC TCCTACAACT ATCTCAACGA GGAGTTTGAC
GGCTTTGAGT ATACCTGGAA CGCGTATAAA TACGATGCGA ACACAGACAG CTATTTTACC
GAACCAGGCT TTGGTAATCA GAATCCATGG CGTGAAAGAC ATAAGCGAAA TGTTATTTCC
CGTTATGCAC AGTTTCAGCT GAGCTATGCG CGTAAATTCA AGAACCATAA TGTTTCCGCT
ATTGCAGCAT ATGAGTTGTC GGACTATGAA AACTCATACT ATGTGGTGCA TACCATCCCT
ACGAACAACT ATGTGCCTGT ACAGTACCTG GTTGAACAGG ACTATCTCTC AGATGAGTGG
AACCTGGAAG CCAGAGCAGG TTATATCGGC AGGATCAACT ATGACTATAA ATCCAAATAC
CTGCTGGAAA TACTTGGTCG TTATGATGGT TCCTACCTCT ACGCTGCGGG TAAACGCTGG
GGCTTCTTCC CGGGCGTATC ACTCGGTTGG AGGGTCTCCG AAGAATCCTG GCTGAAAGGG
AAGTTCGGTA ATGTGCTGAA TGACCTTAAA CTGAGAGCTT CCTATGGTGA AACGGGTAGC
GAGATCGGTT TCTCCAACGG GAATGCACCA AATGCATTTG ACTATCTCGC CGGTTATAAT
TTCAACCAGG GAGGCGCTGT AATGAATGGT AGTTATGTAA TTGGTTTGCG ACCACGTGGT
TTACCGGTAA CAGAACTATC CTGGGTGAAG AATAAAAACT TCAACCTAGG TCTTGACTTT
ACGCTCCTGA ACAACAGTGT AAGCGGACAA CTGGATGTGT TTGAACGCCG TCGCTCAGGA
CTGCCAGCAG CCCGTTATGA CGTATTGTTG CCTAGTGAAG TGGGCTATTC GCTGCCGAAT
GCCAACCTCA ATAAAGATGC AACACGTGGT ATCGAAGGTA TGGTGACTTA TACCGGTACT
TCCGGAGATG TGCGTTATTC CATCGGCGTG AATGCCACCT ATGCACGACT GAGAAGCGTC
AGTACTTATA AACCGCGTTT TGGTAATGCA TGGGATAAAT ACCGAAACTC TATTGAAGAC
CGCTGGTCGA ATATCACCTG GGGGTATCAT GTGACGGGCC GTTTTGAGTC GGAAGAGCAG
ATTAAAAACT ATGGTATCGA CAATGACGGA CAGGGTAACC GTACCGAATT ACCGGGCGAT
TTTATGTATG AAGACGTCAA CGGCGATAAG ATCATCAATG GACAGGACCA GCGTCCTATC
GGTTATGCGC AGGGTGCGCA GCCTTATGTC AGCTTTGGTA TCAACAGTAG TGTATCCTGG
AAAGGGATCT CTCTGCGCTT TGATTTTGCA GGAGCTGCAA TGCAGTCTTT CCTGCGTGAC
TGGGAGCTGC GTTATCCTTT CCAGAATAAT GGTTCGTCTC CTGCCTATAT GCTGACAGAC
CGCTGGCATA GAGAAGATCC TTATAATGCA GACAGCAAAT GGGTAAGTGG CAAATACCCT
GCTATCAGAA AGGATAATAC TACTCATGTG AATTACGCTG TAAACGATTT CTGGATCACC
AACGTAAGGT ATATCCGACT GAAAAACCTG GAGCTGGCTT ACAACTTCTC CCGTGAGTTT
GTCAAGAGGA TTGGTATTTC CGGACTGCGT GTGTATGCCA ATGGCACAAA TCTCTTCTCT
ATAGACAACG TGAAAGAGTA TGAGATAGAT CCTGAGATTG GTTCTTCCAA TGGCCTGGTA
TATCCGCAGC AACGGCTGTA CAACTTTGGT TTTAACGTAT CCTTTTAA
 
Protein sequence
MKTLYVLVVL LLFAFLQGFS QQTNNVTYSG TVTDSTGIPI PGATVSVRNS NKGVVTKNDG 
SFSIQATPGA VITVSIVGFQ TKELVLGREA KLQITVSSQA SNLTDIVVVG YGTQRKASVT
GAVSTLKSDD LVRTPATTTS AALVGKMPGI TARATDSRPG NGTNLQIRNL GTPLFVIDGV
PYTGNTATSA FGLTTGSGQD IFNTLGLEDI ESITILKDAS ASIYGLRASN GVVLVTTKKG
TKNEQPRINL SGYYGFQNFT RYPYPANAGQ YVRALVESEQ NLGRDPSLLY TPGELSKWEA
GTEKGYKSYD YYKEVLRPNV PQNYISANAS GGSQRTSYYM SVSRLSQDAL VKDFTYERTN
LQANLEASLA KGLKVGTQIS GRLEKRHNVG VPGLDDYFNP FLSIFSMWPT ESPYANDNPN
YIHQTHNVNV NPVTYRDDVT GYLDEWWRGM NVNLNAQYDF DFGLSLKGVY SYNYLNEEFD
GFEYTWNAYK YDANTDSYFT EPGFGNQNPW RERHKRNVIS RYAQFQLSYA RKFKNHNVSA
IAAYELSDYE NSYYVVHTIP TNNYVPVQYL VEQDYLSDEW NLEARAGYIG RINYDYKSKY
LLEILGRYDG SYLYAAGKRW GFFPGVSLGW RVSEESWLKG KFGNVLNDLK LRASYGETGS
EIGFSNGNAP NAFDYLAGYN FNQGGAVMNG SYVIGLRPRG LPVTELSWVK NKNFNLGLDF
TLLNNSVSGQ LDVFERRRSG LPAARYDVLL PSEVGYSLPN ANLNKDATRG IEGMVTYTGT
SGDVRYSIGV NATYARLRSV STYKPRFGNA WDKYRNSIED RWSNITWGYH VTGRFESEEQ
IKNYGIDNDG QGNRTELPGD FMYEDVNGDK IINGQDQRPI GYAQGAQPYV SFGINSSVSW
KGISLRFDFA GAAMQSFLRD WELRYPFQNN GSSPAYMLTD RWHREDPYNA DSKWVSGKYP
AIRKDNTTHV NYAVNDFWIT NVRYIRLKNL ELAYNFSREF VKRIGISGLR VYANGTNLFS
IDNVKEYEID PEIGSSNGLV YPQQRLYNFG FNVSF