Gene Cpin_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4022 
Symbol 
ID8360195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5003031 
End bp5005205 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content45% 
IMG OID644966195 
Producttrehalose-phosphatase 
Protein accessionYP_003123684 
Protein GI256423031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR00685] trehalose-phosphatase
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.381996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAA CCATTATTGT ATCTAACAGA CTACCGGTAA AGATCACTGA AAAGGATGGA 
GACTATATGT TGAATCCCAG TGAAGGTGGT CTGGCTACAG GCTTGGGTTC CATTTACAGG
CAAGGCTACA ATATCTGGAT AGGATGGCCG GGCATAGATG TGAGTGAGGC TGCACAACCA
CAAATTACTG AACAACTAGG TGAGATGAAT CTGATGCCTG TGTACCTGAC ACAGGAGGAG
ATTAACAACT ATTATGAGGG ATTTTCAAAC GAGGTACTGT GGCCTGTTTT CCACTATATG
TCCGTTTATG CCCGCTACGA ACAGGTATAT TGGGATTTCT ATTATCAGGT AAATAGTAAA
TTCAGAGACG CTATATTAAG AGTCGCTGAA CCAGGTGATG TGATCTGGAT CCATGACTAC
CAGTTACTAT TATTACCAGG CATGATCCGC GCAGAAGTGC CTGATGTGGC GATCGGTTAT
TTTCAGCATA TTCCTTTCCC TTCATTTGAA CTTTTCCGCC TGATACCATG GCGTGTGGAA
TTGCTGGAAG GTATGCTGGG CGCAGATCTC TTAGGCTTCC ATACTTTCGA TGACAGCCAT
CATTTCCTGA ACGCTGTTAC CCGTCTGCTG CCAGTAAACG CGTCTGCTAA CGTCGTGACA
GTGAATGACA GGGCAGTGAT CGCTGAAACC TTTCCTATGG GTATCGACAA TGAAAAGTTT
GAACAACTTT CTTTTGATCC CGAAGTATTG CGACAACTGG AAAACCTGAA AGAAACATTC
CATAATACAC ACCTGGTGCT GTCCATCGAC AGACTCGATT ATAGCAAAGG TATCATCCAG
CGTTTACAGG CATTCGAATT GTTTCTCCAA CTCTATCCTG AATACACAGA GAAAGTAGTA
TTGTATATGA TCGTTGTTCC TTCCAGGGAT ACTGTTCCGC AATACAAAGA ATTAAAAGAG
GCCATCGACA TGCTGGCCGG TGGTATCAAT GCACGTTTCC GTACGATGAA CTGGCACCCG
GTGAATTATT TCTACCGCTC ATTCCCGGTA GAAGTATTGT CTGCGCTATA TAACTTCGCG
GACGTCGGAC TCGTAACGCC GATGCGGGAT GGTATGAACC TGGTGAGTAA AGAATATGTA
GCCAGCCGGA AAGATAATAA TGGTGTGCTG ATTCTGAGTG AGATGGCGGG TGCATCTAAA
GAGTTGATTG ATGCATTGAT TGTAAACCCG AATAATATCG GCGCTATTGC AAGGGCTTTA
CATGAGGCGA TTAACATGCC GGTGAAAGAA CAGGAACGTC GTATGAAATC AATGCGGCAG
GTCGTGCAGA AATTCAATAT TTCTCACTGG GTAAAACTGT TTATGACACG TTTGCAGGAA
GTAAAACAAC TGCAGCAATC TATGCTGGCC CGCCGTATGA GTATGGATAT GCAATCACAG
GTACGCAATC ACTATAAGAA AGCGCCGGAA CGTGTCATCT TCCTTGACTA CGATGGTACG
CTGGTCGGCT TCCAGGCGAA TATAGATCTG GCCTCTCCTG ATCAGGAACT TTATCAGCTG
CTCAAAACAC TTACGCACGA TAAAGCCAAT CATGTTGTAA TGATCAGCGG TCGTAAACAT
GAAACGCTGG AAGAATGGCT GGGACAACTC CCGCTTGATC TGATTGCAGA ACATGGTGCA
TGGCACAAGA AATATGGCGA AGACTGGCAG AAAATTCCCG GATTGAATGC TAACTGGAAA
CAGGACATCA TGCCTATTCT CGATACTTAT ATGGACCGCA CGCCAGGTTC ATTCATCGAA
GAGAAAAGCT ATTCGCTCGT ATGGCATTAT CGCAAAGTGG AAACAGGTCT GGGGGAATTA
CGTGCCAATG AACTGATGAA TACCCTCCGC TATTTTACCA ACGATATCGG ACTACAGATA
CTGCCGGGTG ATAAAGTAAT CGAAATAAAG AATGTGGAGA TCAATAAAGG TAAAGCGACG
CTGACCTGGT TACAGGACAG AAAGTTTGAA TTCACACTGG CGATCGGTGA CGATCATACA
GATGAAGACA TATTCAAAGC CCTATCCGGT GATGCTGTGA CAATAAAGGT GGGAAGCCAG
GTGTCGGCAG CAAGATATTA TCTGAGAAAC CATCATGAAG TAAGAGCGTT TCTGAGAACG
CTGGTGGCAG GATAA
 
Protein sequence
MSKTIIVSNR LPVKITEKDG DYMLNPSEGG LATGLGSIYR QGYNIWIGWP GIDVSEAAQP 
QITEQLGEMN LMPVYLTQEE INNYYEGFSN EVLWPVFHYM SVYARYEQVY WDFYYQVNSK
FRDAILRVAE PGDVIWIHDY QLLLLPGMIR AEVPDVAIGY FQHIPFPSFE LFRLIPWRVE
LLEGMLGADL LGFHTFDDSH HFLNAVTRLL PVNASANVVT VNDRAVIAET FPMGIDNEKF
EQLSFDPEVL RQLENLKETF HNTHLVLSID RLDYSKGIIQ RLQAFELFLQ LYPEYTEKVV
LYMIVVPSRD TVPQYKELKE AIDMLAGGIN ARFRTMNWHP VNYFYRSFPV EVLSALYNFA
DVGLVTPMRD GMNLVSKEYV ASRKDNNGVL ILSEMAGASK ELIDALIVNP NNIGAIARAL
HEAINMPVKE QERRMKSMRQ VVQKFNISHW VKLFMTRLQE VKQLQQSMLA RRMSMDMQSQ
VRNHYKKAPE RVIFLDYDGT LVGFQANIDL ASPDQELYQL LKTLTHDKAN HVVMISGRKH
ETLEEWLGQL PLDLIAEHGA WHKKYGEDWQ KIPGLNANWK QDIMPILDTY MDRTPGSFIE
EKSYSLVWHY RKVETGLGEL RANELMNTLR YFTNDIGLQI LPGDKVIEIK NVEINKGKAT
LTWLQDRKFE FTLAIGDDHT DEDIFKALSG DAVTIKVGSQ VSAARYYLRN HHEVRAFLRT
LVAG