Gene Caul_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1198 
Symbol 
ID5898653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1261185 
End bp1262351 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID641561681 
Productglycosyl transferase group 1 
Protein accessionYP_001682826 
Protein GI167645163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCATCC TTCCCGACGA TTTCACCCTT CTGCAGGTGA CCCCCGAGCT GGAAACGGGC 
GGGGCCGAGC AGACGACGAT CGACGTGGCC CACGGCGTGA TCGCCCAGGG CGGCAGGGCT
CTGGTCGCCA CCAAGGGCGG CCGCATGGCC GCGCGGCTGG AGGCCGACGG CGGGCGCCTG
GCCCAGATGC CGGCCCAGTC GAAGAACCCC CTGGTGATGC TAGGCAACGC CGCCCGGCTG
GTCGACCTTA TCCGCCGCGA AAAGGTGAGC CTGGTCCACG CCCGCTCGCG CGCCCCGGCC
TTCTCGGCGC TCTGGGCGGC GCACGCCACC AAGGTGCCGT TCGTGGCCAC CTATCATGGG
GTCTACAACG CCAAGTCCAA CCTCAAGCGC TGGTACAACG CGGTGATGAC CAAGGGCGAC
CTGGTGATCG CCAATTCGGA ATATACCCGC GCCCATGTCG TCGCCGAGCA CGGGATCTCG
CCCGACCGCG TGGTGGCCAT CCCGCGCGGC GTGGACCTGA CCCGTTTCGA GCCCGGCCTG
GTCTCGGCCG ACCGGATCAA GGCGCTGCGC GACGCCTGGG GCGTTTTGCC CGAGGACCGC
CGGCTGAAGG TGCTGCTGGC CGGCCGCCTG ACCCGCTGGA AGGGCCAGGC CCTGGTCATC
GAGGCGATGG CGCGGCTGAA GGCGGTGGCC GACACGCGCA TCCTGCTGCT GCTGGTCGGT
GATGACCAGG GCCGCAAGGC CTATCGCGCC GAGCTCGAGC ACATGATCGC CCAGGCCGGA
CTGCAGGACA GCGTCAAGCT GGTGGGTCAC TGCGACGACA TGCCGGCCGC CTACCTGGTC
GCCGACCTGG CCATCGCCCC GTCGCTGGAG CCCGAGGCCT TCGGGCGCAC GGCCGTCGAG
CCGCAGGTGA TGGGCAAGCC GGTGATGGCC GCCGATCACG GCGCGGCGCG CGAGACGGTG
GTCGACCGCG AAACCGGCTG GCTGGTCGCC CCCGGCGACG CCGAGGCCTG GGCCCAGGCC
CTGTCCAACG CCTGCGACGC GGGGGCCGCG CGACGCCAGG CCATGGGCGC CGCGGCCCGG
GCGCGCGCCA GAAAACTGTA TTCTGTTGAC GCGATGGTCG AAGCCACGCT CAAGGTCTAC
GCACGCGTTC TGGAGACGAA GACTTGA
 
Protein sequence
MSILPDDFTL LQVTPELETG GAEQTTIDVA HGVIAQGGRA LVATKGGRMA ARLEADGGRL 
AQMPAQSKNP LVMLGNAARL VDLIRREKVS LVHARSRAPA FSALWAAHAT KVPFVATYHG
VYNAKSNLKR WYNAVMTKGD LVIANSEYTR AHVVAEHGIS PDRVVAIPRG VDLTRFEPGL
VSADRIKALR DAWGVLPEDR RLKVLLAGRL TRWKGQALVI EAMARLKAVA DTRILLLLVG
DDQGRKAYRA ELEHMIAQAG LQDSVKLVGH CDDMPAAYLV ADLAIAPSLE PEAFGRTAVE
PQVMGKPVMA ADHGAARETV VDRETGWLVA PGDAEAWAQA LSNACDAGAA RRQAMGAAAR
ARARKLYSVD AMVEATLKVY ARVLETKT