Gene Caul_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3234 
Symbol 
ID5900689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3491711 
End bp3493741 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content70% 
IMG OID641563739 
Productglucosyltransferase MdoH 
Protein accessionYP_001684859 
Protein GI167647196 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.519144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGA TGAGCGAACG CGGCGCTTTC TTCGACCTGA CCTCCTCGCG GAGCGTCGCC 
TATGACGTGG CGTTCGCCCC CAGCGAGCGG GTCGAGATCC CGACCGACGC GCCGCTGGCC
ATGCCCGTCC AGCCCCTGGC TTTCTGGCAG GGCGCGGCCA AGCCGGTCGC CACGGCCCCC
GCCAATTCCC GCTGGGACAG CCTGACCCTG CGCCGTTGGG GCGTGTTCGT CGCCACCGCG
ATCATGGGCT TCGCCGGGTG GAGAGCCACC TACGACACCA TCGCCCTGGG CGGCGTGACG
CGCCTGGAAG CCGTCTGCAT CACCCTGCTG GCCCCGCTGT TCCTGGCCCT GGCCCTGTGG
TTCTGCACCG CCGTGGTCGG CTTCGTCGTG CTGCTGGGCA AGCCGAAGGA TCCGCTGGGC
ATCGACGATT CCGAGTCCTC GCCGATGCCT CGTCCCAAGG CCCGCACCGC CATACTGATG
CCGGTGCACA ACGAGGACGC CGCCGAGGTG TTCGCCCGCC TGCGCGCCAT GGACGCCTCG
ATCGCCGAAA CCGGCAACAG CCGCGCCTTC GACATCTTCA TCCTCAGCGA CACCCGCGAC
GCCGCCGTGG CCCTGGCCGA GCAAGCCTGC TTCGCCCGCT TCCGCCGCGA GGCCAACAGC
CACGTCTTCT ACCGCAAGCG CACCCAGAAC ACGGGCCGCA AGGCCGGCAA CGTCGCCGAC
TGGGTGGCGC GCTGGGGCGG CGACTACGAG CACATGCTGA TCCTCGACGC CGACAGCCTG
ATGACCGGCG AGGCCATGGT CCGCCTGGCC GACGCCATGG AGCGTCACCC GCGCGTGGGC
CTGATCCAGA CCATGCCGAT GATCATCAAC GGCCAGACGA TCTTCGCCCG CACCCTGCAG
TTCGCCACGC GCCTGTACGG CCGCGTCGCC TGGACGGGCC TGGCCTGGTG GTCGGGTTCG
GAGAGCTCGT TCTGGGGCCA CAACGCCATC GTCCGCACCC GCGCCTTCGC CGAGACCTGC
GGCCTGCCCA GCCTGGACGG TCCCAAGCCG TTCGGCGGCG AGGTGATGAG CCACGACGCC
CTGGAAAGCG CCCTGCTGCG CCGCGGCGGC TGGGGCGTGC ACCTGGCCCC CTATCTGGGC
GGCTCCTATG AGGAGAGCCC CTCCAACCTG CTCGACTTCG CCACCCGCGA TCGCCGCTGG
TGCCGGGGCA ATGTGCAGCA CATCCCGCTG ATCGGCCTGC CCGGCCTGCA CTGGATGAGC
CGCCTGCACC TGGTGATCGG TGTGCTGAGC TACGTGCTCT CGCCCCTGTG GTTCGTGGCC
CTGTCGTCGG GGATCATCTC GCGGATGCTG ATGCCCGAGC TGAAGAAGGC CGCCTTCACC
ATGGCCGACC TGCAGGCCGC GGCCCACGCC CTGATCGACT GGCGCGAGAT CCAGGCCACC
GCCTGGGCGA TGATCATCAC CTTCGTGCTG CTGTTCGGCC CCAAGATCCT GGGCTCGATC
CTGGTCTTCT CGCGCAAGAG CGAGATCAAG GGCTTCGGCG GCCGCCGCCG GATCCTGGCC
GGCCTCGCCG TCGAGATGCT GCTCTCGGCC CTGGTGGCCC CGATGCTGAT GTTCACCCAG
ACCCGCGCCC TGGTCGAGAT CCTGGCCGGC AAGGTCGGCG GCTGGGCCAC CCAGCGCCGC
GACGCCGACA AGGTCACCGG CAAGGAAGCC TTCGCGGCCA TGGGCTGGAT CAGCGTCACC
GGCCTGGTGC TGGCCGTGGC CTTCTGGTTC ACGCCGGACC TGCTGACCGC CACCCTGCCC
ATCCTGGCCG GCCTGATCCT GGCCGTGCCC CTGACCATGC TGGGCGCCCA CAAGATCGCC
GGCCTGGGCG TCAAGGCCAA CGGCCTGTTC ATGACCCCGG AAGAGCGCCG CCCGCCGGCC
ATCGTCCGCG CGGCCCTGGG CCTGGCCTGC GAACCTCCCG CCGCCTGGTC TACCCGCCAA
CGCCACACCG CGATGGTCGC CGAACAGCCG GCGGAACAGA ACGCCGCCTA G
 
Protein sequence
MDLMSERGAF FDLTSSRSVA YDVAFAPSER VEIPTDAPLA MPVQPLAFWQ GAAKPVATAP 
ANSRWDSLTL RRWGVFVATA IMGFAGWRAT YDTIALGGVT RLEAVCITLL APLFLALALW
FCTAVVGFVV LLGKPKDPLG IDDSESSPMP RPKARTAILM PVHNEDAAEV FARLRAMDAS
IAETGNSRAF DIFILSDTRD AAVALAEQAC FARFRREANS HVFYRKRTQN TGRKAGNVAD
WVARWGGDYE HMLILDADSL MTGEAMVRLA DAMERHPRVG LIQTMPMIIN GQTIFARTLQ
FATRLYGRVA WTGLAWWSGS ESSFWGHNAI VRTRAFAETC GLPSLDGPKP FGGEVMSHDA
LESALLRRGG WGVHLAPYLG GSYEESPSNL LDFATRDRRW CRGNVQHIPL IGLPGLHWMS
RLHLVIGVLS YVLSPLWFVA LSSGIISRML MPELKKAAFT MADLQAAAHA LIDWREIQAT
AWAMIITFVL LFGPKILGSI LVFSRKSEIK GFGGRRRILA GLAVEMLLSA LVAPMLMFTQ
TRALVEILAG KVGGWATQRR DADKVTGKEA FAAMGWISVT GLVLAVAFWF TPDLLTATLP
ILAGLILAVP LTMLGAHKIA GLGVKANGLF MTPEERRPPA IVRAALGLAC EPPAAWSTRQ
RHTAMVAEQP AEQNAA