Gene Caul_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0289 
Symbol 
ID5897563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp318626 
End bp319765 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID641560773 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001681924 
Protein GI167644261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCG CCCTCGTTAA TGGCCGCATC CTGACGCGGG ACGGGATCGT CGAGGGCAAG 
GCCCTGATGA TCCGCGGCGG ACGGATCGAC GGCGTCATCG ACAGCGCTGA CGCGCCCGTC
GGCGCGCGCC GCCACGACCT GGGCGGCGGC CTGCTGGTCC CCGGCTTCAT CGACACCCAG
GTGAACGGCG GCGGCGGGGT GCTGTTCAAC GACGCCACCA CCGTCGAGGC CATCGCCGCG
ATCGGCGCCG CCCACCGGCC CTACGGCACG ACTGGCTTCC TGCCGACCCT GATCAGCGAC
GACCTGGCCG TGGTCGACGC CGCCATGCGC GCCGCCGAAG AGGCCATAGA GGCCGGCGTT
CCTGGCGTGC TGGGCGTGCA TATCGAGGGT CCGTTCCTCA ATGTGAAGCG CAAGGGCATC
CACGATTCGA GCAAGTTCCG GACCCTGGAC GACAAGGCCG TGGCCCTGCT CACCTCGCTC
AAGCGCGGAA AAACCCTTGT CACCCTGGCC CCGGAGACCA CGACGCCGGA CAGGGTTCGC
CAGCTGACAC AGGCCGGCGT GACCGTCGCC GCCGGCCACA CCAACGCCGC CTATGGCACG
ACGCGCCGGG CGCTGGACGC CGGACTGACC GGCTTCACCC ACCTGTTCAA CGCCATGTCG
CCCCTGACCA GCCGCGAGCC CGGCGTGGTC GGCGCGGCGC TGGAAAGCCA GACGGCCTGG
TGCGGGATCA TCGTCGATGG CCGTCACGTG GATCCCGCCG TGCTGCGCAT CGCTCTGCGC
ACCCGGCCGC TGGACCGCTT CATGCTGGTC ACGGACGCCA TGCCGACCGT CGGCATGATC
GACAAGAGCT TCGACCTGCA GGGCCGCCAT ATCCGCGTGG CCGACGGCGT TTGCATCGAC
GATCACGGGA CCCTGGCCGG CTCGGACCTC GACATGATCG GCGCGGTGCG CAACGCCATG
GCGATGCTGG GCCTGACGCT GGAGCAGGCG GTATCGATCG CCTCGAGCGC TCCGGCCGCC
TTCCTGGGCC TGGCGGACGA GCGCGGCGCC ATAACGGCCG GCCAGGCCGC CGACCTGGTG
CTGCTGGACG ACGATTTGAC GGTTCGCGAG ACCTGGATCG GCGGACGAAC GGCGGCGTAG
 
Protein sequence
MQSALVNGRI LTRDGIVEGK ALMIRGGRID GVIDSADAPV GARRHDLGGG LLVPGFIDTQ 
VNGGGGVLFN DATTVEAIAA IGAAHRPYGT TGFLPTLISD DLAVVDAAMR AAEEAIEAGV
PGVLGVHIEG PFLNVKRKGI HDSSKFRTLD DKAVALLTSL KRGKTLVTLA PETTTPDRVR
QLTQAGVTVA AGHTNAAYGT TRRALDAGLT GFTHLFNAMS PLTSREPGVV GAALESQTAW
CGIIVDGRHV DPAVLRIALR TRPLDRFMLV TDAMPTVGMI DKSFDLQGRH IRVADGVCID
DHGTLAGSDL DMIGAVRNAM AMLGLTLEQA VSIASSAPAA FLGLADERGA ITAGQAADLV
LLDDDLTVRE TWIGGRTAA