Gene Cpin_4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4268 
Symbol 
ID8360441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5328117 
End bp5330027 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content41% 
IMG OID644966431 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_003123919 
Protein GI256423266 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.582346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGA GATATGAGAT GCCTCTGCTA TTACAACCTT TGTTGATAAG TGGCGGTTTT 
ATTGAAAATA ATGTTTTATA CTACGATGCC AGATCTGGTA TCGAAAATCT GAAAAGATTT
TATAACTTTC TGGATTCCAC GGAACTGCTC ATTAAGGACA AAAGCCTGTT TACAAAAGCT
AAGAATACGC TATTCAAATA TCTGGATAGT CTGGCACTCC CCTATTTTAG TATGGATGCC
CGGGATGCAT TCAATATAGA AGAGATACCG CACAAGGAAC AGGCGGCTGT CTGGATGGCA
GATATCGCGT ATAACAATGC GATGATTACC CATGCCATGG ACAACAATAA CATCTCCCTG
TTAACTTACA GTCAACTTAA ACATGTCAGT ACGGCGTTTC ATTCATTTGC TGAACTGCTG
AATTATGTGG ACTATGATTA TGGATGGCGA CACATCTACC AGGAATCAGG AAGAGAAGAA
ATTTACAATG AGAATGGACT ATGGGGATTG AAAAATGCCC ACGGAGAAAT ATTGCTACAT
CCACAATTTG ATGAATTTTA TGAATTCAGC GAACAGGATG TTGCGGTGGT CATGAAGGAG
CAGCAGTATG GTTATGTGCA CAGATCAGGT GAGGTAATAG TGCCTCCGGA ATGGGATAAT
GCTTACGATT TTGATTATTC CAATCTGGCG ATTATTCAAC GCAATGGTCT TCTGGGACTC
ATCAATCTTT CGGGAAAAGT GGTTGTTGCA CCTACATACG AAGCCCTTAA CAAATTTGGG
TCTCAAGGTC ATTACATCGC ACAAAAGAAT GGAAATTGGG GGGTATTGTC AGAAGATGGA
AAAGTAATCA TCAATTTCAA ATATGACAAT ATAGAGCTGT TACATGACGA TGTAATCATG
CTACAGAAAG ATGGTTATTA CAGCCTTTCT GAAGATGGTG ACCAATTCGA CCTAATCGTA
CATAAAGCTC CTCCTCAGGG CTTCGCCTGG GCCATCAAAG GCAAGGAAGT ATACCTTATA
GATAAATATG GTATATCAAG AGCCAATAAA GACCTCGTCC GGCAGGATGC AGAAAATGAA
GGCTACAGTC TTTATTATGA TGATATCGTG CGTGCCAGAC TCCTGGCATA TGCAAAGTCC
TCCGCTGAAA ATACGGTTAC AGACGTTTAC ACACCGGTCG AAGAACTGTA TAATATCGGT
GTAGATGCTT ATAACAGGCA AGATTACACC TCTGCTATTT ATCACTACAC ACTTGCTGCT
GAAAAAGGAT TTGGTTATGC GATGAACAAC CTGGCTTATA TCTATTACAT GATAGAAGGC
TATGTAAATG ATGATAAAGC ATTTTACTGG TATGAGCAAG GAGCTGCTGC GCGCAATACC
AATGCGCTTA ATGGATTAAG TCTGTGTTAC CAGTATGGAA TAGGAACCAT CCCTGATATA
GAAAAAGCTA TTGACTTATT ACTACAGGCT GCTGAAGATG GTATGGCAGC TGCACATAAT
AATCTGGGAT TTCTGCTTTA TAAGACTGAT CCCGAGCTGG CATTATTTCA CTATCACCAG
GCTGAAGCAC TGGGTGAACC TGATTATGGA TTGCTGGGAT CCATGTATGA AGAAAAAGGA
GATTTCGAAA CCGCTTTCCG CTATCACCAA AAGGACGATT CGGAGATAGG TGCGTTTAAT
CAGGGACATT TCTATCAGAA GGGATTAGCT ACAGAAAAGG ATATAAAAGC AGCAATAGGC
TGTTTTCAAA CGGCGATAGA CGGTGGTTAT GATAGAGCGC ATATTGAACT GGCGCGCATA
TATCTGTTTG AAGAAGGATT TATTGATAAG GATAAAGCAA AAGTGCATAT TGTAGCGGCG
AAAGAAGCGG AGATTGAGAT TCCTGAGGAA TTCAAAGATC CATTTGATTG A
 
Protein sequence
MEWRYEMPLL LQPLLISGGF IENNVLYYDA RSGIENLKRF YNFLDSTELL IKDKSLFTKA 
KNTLFKYLDS LALPYFSMDA RDAFNIEEIP HKEQAAVWMA DIAYNNAMIT HAMDNNNISL
LTYSQLKHVS TAFHSFAELL NYVDYDYGWR HIYQESGREE IYNENGLWGL KNAHGEILLH
PQFDEFYEFS EQDVAVVMKE QQYGYVHRSG EVIVPPEWDN AYDFDYSNLA IIQRNGLLGL
INLSGKVVVA PTYEALNKFG SQGHYIAQKN GNWGVLSEDG KVIINFKYDN IELLHDDVIM
LQKDGYYSLS EDGDQFDLIV HKAPPQGFAW AIKGKEVYLI DKYGISRANK DLVRQDAENE
GYSLYYDDIV RARLLAYAKS SAENTVTDVY TPVEELYNIG VDAYNRQDYT SAIYHYTLAA
EKGFGYAMNN LAYIYYMIEG YVNDDKAFYW YEQGAAARNT NALNGLSLCY QYGIGTIPDI
EKAIDLLLQA AEDGMAAAHN NLGFLLYKTD PELALFHYHQ AEALGEPDYG LLGSMYEEKG
DFETAFRYHQ KDDSEIGAFN QGHFYQKGLA TEKDIKAAIG CFQTAIDGGY DRAHIELARI
YLFEEGFIDK DKAKVHIVAA KEAEIEIPEE FKDPFD