Gene Caul_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4824 
Symbol 
ID5902286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5221772 
End bp5223079 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID641565344 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_001686442 
Protein GI167648779 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG CCATGATCGG CACCGGCTAT GTGGGCCTGG TGTCTGGGGC GTGTTTCGCC 
GACTTCGGTC ACGTGGTGAC CTGCATCGAC AAGGATCCGT CGAAGATCTC GCGGTTGACG
GCGGGCGAGA TCCCGATCTT CGAGCCCGGC CTGGACGATC TGGTCGCGCG CAACGTTCGT
GAAGGCCGCC TGTTCTTCAC GCTCGACGGC GCCCAGGCGA TCAGGGAGGC CGAGGCGGTG
TTCATCGCCG TCGGCACCCC GACGCGGCGG GGCGACGGCC ACGCCGACTT GTCATATGTC
CACGCGGCGG CCGAGGAGAT CGCCGGCCTG ATCGACGGCT TCACCGTGGT GGTCACCAAG
TCGACCGTGC CGGTGGGCAC CGGTGACGAG GTCGAGGCGA TCGTCCGGAA AGTCCGTCCC
GACGCCCAGT TCGCCGTGGT CTCCAATCCC GAGTTCCTGC GCGAAGGCGC GGCGATCGAG
GACTTCAAGC GCCCTGACCG GGTGGTGGTG GGTTCCGAGG ACGAGCGGGC CCAGGCGGTG
ATGCGCGAAC TCTATCGCCC GCTGAGCCTC AACGAGACGC CGATCGTCTT CACCGGCCGG
CGGACCAGCG AACTGATCAA GTACGCGGCC AACGCCTTCC TGGCGATGAA GATCACCTTC
ATCAACGAGA TGGCCGACCT CTGCGAAAAG GTCGGGGCCG ACGTGCAGCA GGTCGCGCGG
GGCATCGGCC TGGACAAGCG GATCGGCAGC AAGTTCCTCA ACGCCGGCCC CGGCTACGGC
GGTTCATGCT TTCCCAAGGA CACCGTCGCC CTGGTGCGCA CCGCCGAGCA ATACGGCGCC
CCGGTCCGAT TGATCGAAAC CACGGTGGCG GTCAACGACG CGCGCAAGAA GGCCATGGCC
AACAAGGTCG CCCAGGTCCT GGGCCTGGAG GACCTGACCG GCAAGACCGT CGGCGTGCTG
GGGCTGACCT TCAAGCCCAA CACCGACGAC ATGCGCGACG CGCCCAGTCT GGATATCCTG
CCGGCCCTGC AGGCCATGGG CGCGACCGTC CAGGCCTTCG ACCCCGAAGG AACCCAGGAG
GCGATGCGCC TGCTGCCGGG CGTCGCGTTC AAGTCCGGGC CGTACGAGGC GGCGGAGGGC
GCCGACGTGC TGCTGATCCT CACCGAATGG GATCAGTTCC GCGCCCTGGA CCTGGATCGG
GTCAAGCTCC TGCTCAACGC CCCGGTCGTC GTCGATCTGC GCAACATCTA CCGGCCCCAC
GAAATGGTTC GCCACGGCTT TACCTACGCC TCCATCGGCA GGGGCTGA
 
Protein sequence
MRIAMIGTGY VGLVSGACFA DFGHVVTCID KDPSKISRLT AGEIPIFEPG LDDLVARNVR 
EGRLFFTLDG AQAIREAEAV FIAVGTPTRR GDGHADLSYV HAAAEEIAGL IDGFTVVVTK
STVPVGTGDE VEAIVRKVRP DAQFAVVSNP EFLREGAAIE DFKRPDRVVV GSEDERAQAV
MRELYRPLSL NETPIVFTGR RTSELIKYAA NAFLAMKITF INEMADLCEK VGADVQQVAR
GIGLDKRIGS KFLNAGPGYG GSCFPKDTVA LVRTAEQYGA PVRLIETTVA VNDARKKAMA
NKVAQVLGLE DLTGKTVGVL GLTFKPNTDD MRDAPSLDIL PALQAMGATV QAFDPEGTQE
AMRLLPGVAF KSGPYEAAEG ADVLLILTEW DQFRALDLDR VKLLLNAPVV VDLRNIYRPH
EMVRHGFTYA SIGRG