Gene Caul_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1220 
Symbol 
ID5898675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1282121 
End bp1282957 
Gene Length837 bp 
Protein Length278 aa 
Translation table11 
GC content71% 
IMG OID641561705 
Producthistidinol-phosphate phosphatase 
Protein accessionYP_001682848 
Protein GI167645185 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID[TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.427245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.220917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG CCGCCGACGA CCTCGCACAG CTCGACGCCT TCATCATCGA CCTCAACCGC 
GCTTCGGCGG CGGCGATCCT GCCGCTGTTC CGGGCCGACC ATGGGTTGGA GGACAAGGGC
GCGGGCAAGA ACCTGCCGCG CGGCAGCCAC GCCGCCTTCG ATCCGGTGAC GGAAGCCGAT
CGCGGCGCCG AGGCGGCGAT CCGCAGGCTG ATCGGCGAGC GCTATCCCGA CCACGGGGTG
ATCGGCGAGG AATACGGCGA GGATCGGCCC GACGCCGAAT TCGTCTGGGT GCTGGACCCG
ATCGACGGGA CCCGCGCCTT CATCGCCGGC CTGCCGCTGT GGACCACCCT GATCGGTCTG
CGCCATCAGG GCCGCCCGGT CCTGGGCTCG ATTGGCCAGC CCTATACGGG CGAGATCTTC
ATCGGCTCTT CGGCCGGCTC GCGCCTGATG TCGCGCGGCC AGAGCCGGCC AATCCAGGTG
CGGCCCTGCG CCGACCTGAC CGACGCCGTC ATCGCCACCA CCGATCCCGA GGCCTGTTTC
GACGGCGCCG AGCTGGGGGC CTGGCGCCAG GTGCGGGCCG CCGCCAAGCT GGCCCGCCTG
GGCTGCGACG CCTACGCCTA CGCCATGGTC GCCATGGGCA AGATGGACAT GGTGATCGAG
GCCGGTCTGC AGTCCTGGGA CATCGAGGCC GCCATCCCCG TGGTGGAAGG GGCCGGCGGC
GTGGTCACCG ACTGGCGCGG CGACACGATC GGCCCGAACG GCGGCCAGAT GGTGATCGCC
GGCGACCGAC GCTGCCTGGA CGAGGCGCTA GTGGCGCTGC GGCGGTCGGC GAAGTAA
 
Protein sequence
MTLAADDLAQ LDAFIIDLNR ASAAAILPLF RADHGLEDKG AGKNLPRGSH AAFDPVTEAD 
RGAEAAIRRL IGERYPDHGV IGEEYGEDRP DAEFVWVLDP IDGTRAFIAG LPLWTTLIGL
RHQGRPVLGS IGQPYTGEIF IGSSAGSRLM SRGQSRPIQV RPCADLTDAV IATTDPEACF
DGAELGAWRQ VRAAAKLARL GCDAYAYAMV AMGKMDMVIE AGLQSWDIEA AIPVVEGAGG
VVTDWRGDTI GPNGGQMVIA GDRRCLDEAL VALRRSAK