Gene Caul_4921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4921 
Symbol 
ID5902383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5316811 
End bp5318220 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content67% 
IMG OID641565441 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_001686539 
Protein GI167648876 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.51327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCGA GCGACTTTGA AGATCGCCGT ATCTGCATTA TCGGCCTCGG CTATGTCGGC 
CTGACCTTGG CCGTGGCCTT GGCGGACGTG GGTTTCCACA TCGTCGGCGT GGAAACTAAT
CCGACCATCC GTGACTGTCT CGCCAACAAG AAAGCCCATT TCAGCGAAAC GGGCCTGGAC
CGTCGCCTGC AGGCGGTGAT CGAGGCTGGC CGACTGGAAG TGCACCCCGT CATCCCCGTC
GGCGACACCT CGACCACCTA CGTGATCACG GTGGGCACGC CGGTCGATGA AACCAAGCAC
ACCAAGCTTG CGGCCATCCA GAACGTCGCC TCAATGGTGG CCGGAGCCCT CAAGCCCAAC
GACCTCATCG TGCTGCGCTC CACGGTTCGC GTGGGCGTGT CCCGCACCGT GGTGAAGCCG
ATCCTCGACC AGGCCGGCGT CCCCTTCCAG ATCGCGTTCT GTCCGGAGCG CACGATCGAG
GGCAAGGCGA TCGAGGAGCT GCGCAGCCTG CCCCAGGTGG TCGGCGGCAT CGACACCGCC
TCGACCCTGC GCGCCTCTCG CCTGTTCCAC ATGCTCACCC CCTCCGTCAT TCGGGTCTCG
AGCCTGGAAG CCGCCGAGAT GGTCAAGCTG GTCAACAACA CCTATCGCGA CATCAGCTTC
GCCTTCGCCA ACGAAGTCGC CTTCGCGTCG GAGGCCGCCG GCCTCTCCGC CACCGAGGTC
ATTCGCGCGG GCAATCTGGG CTATCCGCGC GGCGGTCTGC CCTTGCCCGG TCCCGTCGGC
GGCCCGTGCC TGGAGAAGGA CCCCTACATC CTGGCCGAGG GTCTGGCGCT TCACGGTTTC
ATGCCCGAGC TGTCGCTGGC CGGGCGCCGC CTGAACGAAA CCCTGCCCCA GCGCGCGGCC
GACTATCTGG ACGGCCTGGC GGGCAAGCGG CACCTGAACG TCACCCGGAT CGCCGTGGTC
GGGGTGGCCT TCAAGGGCCG GCCCGCGACC TCGGACCTGC GGGGCACCCT GGCGCTGCCC
CTGATCGAGA GCCTGCGCGG CGTCTTCCCG GACGCTGAAA TCGTGGGCTG GGACCCGGTC
GCCTCGGCCA GCGACATCCG ACGCGAACTG GCGATCGGCG CCACGACGAC CCTGCCCGAA
GCCGTCACCG GCGCTTCGCT GGTGGTGATC CAGAACAATC ACACGGCGTT CAAGGAAATC
GATTTCCAGT GCGTTTCGCG GGCGATGGCG CCGCGCGGCG TGATCTACGA CTTCTGGGGT
CAGAATGAGA GCGAAGATCT GGCCCTCGAC AATGGCGTGG CCTACGCGAC GTTCGGCTCG
GCCTTCCTGA CCGAAGCCTC CGCGGCGCGG TCCGCCTCGA CGTCGAAGGG CGCCCGCGCC
GGCGTCGGCG ACCTTGAAAG CGCGACATGA
 
Protein sequence
MIPSDFEDRR ICIIGLGYVG LTLAVALADV GFHIVGVETN PTIRDCLANK KAHFSETGLD 
RRLQAVIEAG RLEVHPVIPV GDTSTTYVIT VGTPVDETKH TKLAAIQNVA SMVAGALKPN
DLIVLRSTVR VGVSRTVVKP ILDQAGVPFQ IAFCPERTIE GKAIEELRSL PQVVGGIDTA
STLRASRLFH MLTPSVIRVS SLEAAEMVKL VNNTYRDISF AFANEVAFAS EAAGLSATEV
IRAGNLGYPR GGLPLPGPVG GPCLEKDPYI LAEGLALHGF MPELSLAGRR LNETLPQRAA
DYLDGLAGKR HLNVTRIAVV GVAFKGRPAT SDLRGTLALP LIESLRGVFP DAEIVGWDPV
ASASDIRREL AIGATTTLPE AVTGASLVVI QNNHTAFKEI DFQCVSRAMA PRGVIYDFWG
QNESEDLALD NGVAYATFGS AFLTEASAAR SASTSKGARA GVGDLESAT