Gene Caul_0825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0825 
Symbol 
ID5898280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp884888 
End bp886105 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content71% 
IMG OID641561306 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001682454 
Protein GI167644791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATGC CCCAGCTCGC CCTCGCGGTT CTCGTCCTGA CGGCCTGCGG CCCCACCGGC 
CAGGTCCAGG CCCAGCCCCA TCCGCCGGTC GAGACGGCGC CGCCCAACGC CGCGGGCCAG
ACGCCGGCCT TCCCGCAGCA GACCCGCGCG CCCGAGGAAA AGCTGGGCGT GGCCTACAAG
GTCGAGACCC TGGCCACCGG TCTGGAACAC CCCTGGAGCC TGGCCTTCCT GCCCGACGGT
TCGAAGTTGG TGAGCGAGCG GGCCGGCCGG CTGCGGATTC TGGGCGCCGA CGGCAAACTG
TCGCCGGCCG TCACCGGCCT GCCGGCGGTC TACGCCGAGG GCCAGGGCGG GTTGTTCGAC
GTGGCGCTGG ACCCCGACTA CGCCAGGAAC GGCCTGATCT ACTGGACCTA TGCCGAGCCG
CGCGAGGGCG GCAACGGCAC GACGGCGGCG CGCGGCAAGC TGGTGCTCGG CGCCGCGCCC
AGGGTCGAGA CCGTCCAGGT GATCTGGCGG CAAACGCCGA CCATGGACTC GCCCCTGCAT
TTTGGCGGCC GCCTGGCCTT CGCCCGCGAC GGATCGCTGT TCATCACCAC GGGCGAGCGC
TCGATCATCC CCGGCCGGAT GCAGGCCCAG AAACTGGACG CCGCCCTGGG CAAGGTCATC
CGCATACGGC CGGATGGCGC GATCCCGGCC GACAATCCGT TCGTCGGCGA TCCCCAGGCC
AAGCCGGAGA TCTGGTCCAG GGGCCACCGC AACGTCCAGG GCGCGACGAT CAATCCGTGG
ACCGGCCAGC TATGGACCGC CGAGCACGGC GCCCGGGGCG GCGACGAGAT CAACACCCCC
AAGGCCGGCA AGGATTATGG CTGGCCGACC ATCACCTATG GCGAGGAATA TTCCGGCAAG
CCGGTCGGCG ACGGGATCAC CCAGCACGAG GGCATGGAGC AGCCGGTCTA TTACTGGGAC
CCGGTGATCG CCCCCTCGGG CCTGGCCTTC TACAACGCCA GCCTGTTCCC GGCCTGGAAG
GGCAGCCTGT TCGTCGGCGG GCTCAAGGGC TACCTCGTGC GCCTGACGCT CAAGGACGAC
AAGGTGGTGG GCGAGGAGCG GCTGCTCTCG GAGCTCGACT CGCGGATTCG CGACGTGCGG
GTGGGTCCCG ACGGCGCGGT CTATGTGGTG ACCGACGAGG ACGACGGGCG GGTGCTGCGG
CTGACGCCGA AGGGGTAG
 
Protein sequence
MRMPQLALAV LVLTACGPTG QVQAQPHPPV ETAPPNAAGQ TPAFPQQTRA PEEKLGVAYK 
VETLATGLEH PWSLAFLPDG SKLVSERAGR LRILGADGKL SPAVTGLPAV YAEGQGGLFD
VALDPDYARN GLIYWTYAEP REGGNGTTAA RGKLVLGAAP RVETVQVIWR QTPTMDSPLH
FGGRLAFARD GSLFITTGER SIIPGRMQAQ KLDAALGKVI RIRPDGAIPA DNPFVGDPQA
KPEIWSRGHR NVQGATINPW TGQLWTAEHG ARGGDEINTP KAGKDYGWPT ITYGEEYSGK
PVGDGITQHE GMEQPVYYWD PVIAPSGLAF YNASLFPAWK GSLFVGGLKG YLVRLTLKDD
KVVGEERLLS ELDSRIRDVR VGPDGAVYVV TDEDDGRVLR LTPKG