Gene Cpin_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1920 
Symbol 
ID8358071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp2344314 
End bp2345993 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content48% 
IMG OID644964108 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003121617 
Protein GI256420964 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.316407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAA ACAAATACAG CAAAACGCTC ACGCAGGATC CAACACAGCC CGCCACACAG 
GCGCAGCTGT ACGCACTCGG TTTAACTGAA GAAGACCTGA AGAAAGCGCA GGTAGGCATT
GCCAGTATGG GCTATGACGG TAATCCATGT AACATGCACC TCAATGACCT CGCTCAGGAG
GTCAAGAAAG GGGTCTGGGC TAATAATCTG GTCGGACTCA CTTTCCATAC TATTGGCGTC
AGCGATGGTA TGACAAATGG TACTCCGGGT ATGCGTTATT CCCTGGTCAG CCGCGACCTG
ATTGCTGATT CCATCGAAAC CGTTGTAGGC GCTCAGTATT ATGATGGTGT GATCACCGTA
CCTGGCTGTG ATAAAAACAT GCCTGGCTCC CTGATCGCGA TGGGACGTCT GAATCGTCCT
TCTATCATGG TATATGGCGG TTCTACAGCT CCTGGTAAAT ACCAGGGAAA AGACCTGAAC
ATTATCTCTG CATTTGAAGC GCTGGGTCAG AAAATGGCTG GTCAGCTCAG CGATGAAGAC
TTCAAAGGCA TTGTACAGCA TTCCTGCCCT GGCGCCGGCG CCTGTGGCGG TATGTATACG
GCAAATACCA TGTCTTCTGC TATTGAAGCA TTAGGTATGA GCCTTCCTTA TAGTTCTTCT
AACCCTGCAC TCAGCAAGGA CAAAAAAGAA GAATGTCTGT CAGCTGGTAA ATATATCCAC
ATCCTCCTGG AGAAAGATAT CAAGCCTTCT GATATCATGA CGCTGGAAGC ATTTGAAAAT
GCGGCTACCG TTGTGATGGC ACTGGGTGGT AGTACCAATG CCGTACTGCA CTTCATTGCA
ATCGCAAAGG CAATAGGCGT GAAATTCGGA TTACCTGAAT TCCAGCGTAT CAGCGATAAA
ACACCACTGA TCGCTGACCT GAAACCAAGT GGTAAATACC TGATGGAAGA CCTGCACAAC
ATTGGCGGCG TTCCATTAGT AATGAAATAT CTGCTGAAAA AAGGCTACCT GCATGGTCAC
TGTCTGACAG TAACCGGCAA GACCCTGGCA GAAAACCTGG AAAGTGTACC AGATCTGGAA
TTCGAAGGAC AGGATATCGT TGTTCCGGTG GAAAAACCAA TAAAAGCGAC TGGTCACATC
CAGATGCTGT ATGGTAACCT GGCAGAACTG GGTTCCGTAG CCAAGATTAC CGGTAAGGAA
GGACTGAGCT TCCGTGGTCC TGCGCGTGTG TTTGAAGGTG AATATGAACT GATCGCAGGT
ATTCAGAACG GTCGTGTGAA AGCAGGCGAT GTGGTGGTAA TCAGACAGGT AGGTCCAAAG
GGCGCACCTG GTATGCCGGA AATGCTGAAA CCAACATCCG CTATCATGGG TGTAGGTCTT
GGTAAGAGCG TAGCGCTGAT TACAGACGGA CGTTTCTCTG GTGGTACACA CGGTTTTGTA
GTAGGACACA TCACACCGGA AGCGGTAGAA GGTGGTACTA TCGGACTGGT ACAGGACAAT
GACATAATAG AAATTGACGC CGAAAAGAAT ACAATCAACG TGGAACTGAG TGCAGAAGAA
CTGGCTGCCC GCAGGGCGAA ATGGGTAAAA CCTGCGTTGA AAGTAACTAA TGGTGTATTA
TATAAATATG CAAAACTCGT TTCAAATGCA ACAGAAGGAT GTGTTACCGA TGAAGCCTGA
 
Protein sequence
MELNKYSKTL TQDPTQPATQ AQLYALGLTE EDLKKAQVGI ASMGYDGNPC NMHLNDLAQE 
VKKGVWANNL VGLTFHTIGV SDGMTNGTPG MRYSLVSRDL IADSIETVVG AQYYDGVITV
PGCDKNMPGS LIAMGRLNRP SIMVYGGSTA PGKYQGKDLN IISAFEALGQ KMAGQLSDED
FKGIVQHSCP GAGACGGMYT ANTMSSAIEA LGMSLPYSSS NPALSKDKKE ECLSAGKYIH
ILLEKDIKPS DIMTLEAFEN AATVVMALGG STNAVLHFIA IAKAIGVKFG LPEFQRISDK
TPLIADLKPS GKYLMEDLHN IGGVPLVMKY LLKKGYLHGH CLTVTGKTLA ENLESVPDLE
FEGQDIVVPV EKPIKATGHI QMLYGNLAEL GSVAKITGKE GLSFRGPARV FEGEYELIAG
IQNGRVKAGD VVVIRQVGPK GAPGMPEMLK PTSAIMGVGL GKSVALITDG RFSGGTHGFV
VGHITPEAVE GGTIGLVQDN DIIEIDAEKN TINVELSAEE LAARRAKWVK PALKVTNGVL
YKYAKLVSNA TEGCVTDEA