Gene Cpin_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_2044 
Symbol 
ID8358195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2492735 
End bp2494408 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content48% 
IMG OID644964231 
ProductPHP domain protein 
Protein accessionYP_003121740 
Protein GI256421087 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATA ATTACGCTAT TGCCGATAAT TTCTCCCTGA TCTCCAAACT CATGGACATA 
CAGGGAGATA ATAGTTTTAA AGCCAAATCT TTCGCCAGCG CCGCATTTAC CGTGGAGAAA
CTCCCTATAC AATTGCAGGT TACGCCTCCG GAAGAAATCG CTAAAATTAA AGGCATCGGT
GACTCCATCA GCAAAGCTAT TCAGGAAATG CTCCAGACTG GCAAATTCGC CGCCCTCGAA
GCATACCTAC TGAAAACACC TCCAGGCATC CTGGAGATGA TGAAAATAAA AGGCCTCGGC
CCTAAAAAAA TAGCAACCAT CTGGAAAGAA CTGGAAGCGG AAAGCCTCGG GGAACTGCTA
TACGCCTGTA ATGAAAACCG CCTTACCCTA CTCAAAGGCT TTGGACAAAA AACACAGGAC
GCCGTTAAAC AAAGCATCGA ATTCTATTTC AGCAACCAAG GCCGCTACCT CTACGCTCAG
GCAGAAGCAC TGGCACTTTC TCTGGAAAAA GAATTCAGTC AACTACTGAC GCCCGCCGCC
GTTTCATTGA CCGGCCAATT CCGTCGCAAT GAAAATATCA TCGACGAAAT AGAATTTGTC
GCCGCTCTCC CCACTCCCGC CCTGCAGGAA AAATTATCCG CATTGCCGGC ACTCACCGCA
GCTACCGTAA CCGGAGATGC GATCACATGG CAACACGAAC AACACATAAA AATTAAAGTC
TACACCGCCT CCGCTGAAGA TTTTAATAAA GTACTATTCA CAACTACCGC ATCTGCTTCC
TTCCTGGAGC ACTTTAATGC ATCGCTTGAC CTGAACAGCA TTCCCGCTAA TGCAACAGAA
GAAGCGATCT TCACACAGGC AGGAATGGAG TATATCGAGC CTTGTCTGCG TAACGGCGGA
CTCGCCATCA ATCTCGCTAA AAAGAAACAA CTGCCGGTAC TGATCACACA CAAAGACATC
AAAGGCATCA TTCACAGTCA CAGTACCTGG AGCGATGGGG AACATACCCT GCTGCAAATG
GCAACCACTG CCAAAGAACA GGGCTTTGAA TACCTCGTTA TCAGTGATCA CTCCCGCTCT
GCCTTCTATG CAAACGGACT CAGTATCGAG CGTATACAGG CACAACACGC AGAGATCGAT
GCACTGAATA AACAACTCGC TCCATTCCGT ATTTTCAAAA GCGTAGAAGC TGATATCCTT
AACGATGGCA GTCTCGATTA CCCGGATGAA ATACTCGCCT CATTCGATCT CGTCATTGCA
TCCGTACACT CCAACCTTAA AATGAGTGAA GAAAAAGCAA TGGCGCGACT CCTGAAAGCA
ATAGAAAATC CTTACACTAC TATACTCGGT CATATGACCG GTCGCCTCCT GCTCAGCCGT
AACGGTTATC CTGTCAATCA TCAGCAGATC ATTGACGCAT GTGCTACACA CAATGTGGTT
ATTGAGATCA ATGCACATCC GCGGCGTCTC GATATCGACT GGGAATGGAT TCCATACGCA
CTGGAGAAAA ATGTGCTGCT GTCCGTAGAT CCGGATGCAC ACAGCACTGA CGGCTTCCAC
GATATTTACT ACGGTACCCT CGCTGCAAGA AAAGGCGGAC TCACCGCTGC GCATAACCTG
AGCAGTTTCA GCGCCGCCGA ACTGGAAGCA TTCATCAGCA AAAAGAAAAA ATAG
 
Protein sequence
MLDNYAIADN FSLISKLMDI QGDNSFKAKS FASAAFTVEK LPIQLQVTPP EEIAKIKGIG 
DSISKAIQEM LQTGKFAALE AYLLKTPPGI LEMMKIKGLG PKKIATIWKE LEAESLGELL
YACNENRLTL LKGFGQKTQD AVKQSIEFYF SNQGRYLYAQ AEALALSLEK EFSQLLTPAA
VSLTGQFRRN ENIIDEIEFV AALPTPALQE KLSALPALTA ATVTGDAITW QHEQHIKIKV
YTASAEDFNK VLFTTTASAS FLEHFNASLD LNSIPANATE EAIFTQAGME YIEPCLRNGG
LAINLAKKKQ LPVLITHKDI KGIIHSHSTW SDGEHTLLQM ATTAKEQGFE YLVISDHSRS
AFYANGLSIE RIQAQHAEID ALNKQLAPFR IFKSVEADIL NDGSLDYPDE ILASFDLVIA
SVHSNLKMSE EKAMARLLKA IENPYTTILG HMTGRLLLSR NGYPVNHQQI IDACATHNVV
IEINAHPRRL DIDWEWIPYA LEKNVLLSVD PDAHSTDGFH DIYYGTLAAR KGGLTAAHNL
SSFSAAELEA FISKKKK