Gene Caul_3563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3563 
SymbolmdoG 
ID5901018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3847059 
End bp3848600 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content71% 
IMG OID641564071 
Productglucan biosynthesis protein G 
Protein accessionYP_001685188 
Protein GI167647525 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCGA GTTCCGTGAA CCGGCGCACG GCGCTCGGCG GGTTGTCGCT GTCGGCGATC 
CTCGCCCTGG CGCCCAGGCT GGCGGCGGCC CAGACGCCGG CCGTCGACGC CGCCTTGAGC
GAGCCGTTCT CGGTCGAGCT TCTGCGCGAC CGGGCCCGCC GCCTGGCCGC CGCGCCCTAC
GCCAAGCCGC CGGCCGGGCC GACGATGACC GGCCTGTCCT ACGACGACTA TTTCCAGATC
CGCTTCCGCA AGGACGCCGA CCTCTGGAAG GATCGCCGGC CGCCATTCCG GGCCGAGTTC
TTCCCGCCGG CCTATCTCTA CCCGCGTCCG CTGCCGATCT ACGAGGTCGC CGCGGGCCTG
GCCCGGCCGA TCGCCTTCGA CCCGGCGATG TTCGACATCC CCGGCGAGCA CGCCGACGCG
GCCAAGGGCT GGAGCGGCTT TTCGGGGTTC CGGCTGCTGT GGCCGCTGAA CGAAGCCGAG
AAGATGGACG AGATCGCGGT GTTCCAGGGC GCGAGCTATT TCCGCAGCCT GGGGCGCGGC
CAGCGCTACG GCCTGTCGGC GCGGGGCCTG GCGATCGGGG CGGGCGAGCC GAACGAGGAG
TTTCCGGACT TCACCGCCTT CTGGCTGGAA CGGCCCCCCA TCGAGAGCGA GGCCATCGAT
GGTGATGCGG TGACGGTCCA TGGCCTGATG GACAGCCCCA GCTGCGCGGG CGCCTACAGC
TTCACGATCC GTCCGGGCGA GGACGTGGTG TTCGACGTCG CCGCCGCCGT CTTTCCCCGG
ACGACCCTGG CCAAGGCCGG CGTCGCGGCG ATGAGCAGCA TGTTCCTGTT CAATGTCGCC
GACGCCTCGC CGGTCGATGA CTTCCGCACC GCCGTGCACG ACAGCGACGG CCTGGCGATC
TGGACGGGGC GGGGAGAGCA CCTGTGGCGA CCGCTGCGGA ACCCCAAGGT CCATCGCCTG
AGCCACTTCC CGGACGACAG CCCCAAGGGC TTTGGCCTGG TGCAGCGGGC GCGGGCGCTG
GAGGACTTCG GCGACCTGCA GGCCCGCTAC GACCTGCGGC CCAGCCTGTG GGTCGAGCCG
CTGGACGCCT GGGGCAAGGG CGCGGTGCAT CTGGCCGAGT TGGCGACCAC CAAGGAAACC
GACGATAACA TCGCCGTCTT CTGGCGGCCC GAACAGTCCT GGGCCGCCGG GTCTCAGGTC
GACCTGCGCT ATCGCCTCTA CTGGGGCCAG GAACGGTTCG CCAGTCCCGA CGCCCGGGTG
TTCCGCACCC GCACGGGCGC CGACGCCCAG GCCGGCCTGC GGCTGTTCAC GGTCGACCTG
CAGGGCGGCG CCCTGGCCGA GGGGCTGGAC GGCGTGGCGC TGGAGATCCA GGCCGACGGC
GGGACGATCG CCTGGAAGGA CCTGTCGCCC TATCCCGACG AAGCCTCGGC CCGGGCCGCC
TTCGGCCTGC GCCCGACCCA GGCCAGCGTG GATCTGTCGC TGCGGCTGGT CCGCGGTGAT
CAGCCGATTT CCGAGACCTG GCGCTATCCC TGGACGGCTT AA
 
Protein sequence
MFSSSVNRRT ALGGLSLSAI LALAPRLAAA QTPAVDAALS EPFSVELLRD RARRLAAAPY 
AKPPAGPTMT GLSYDDYFQI RFRKDADLWK DRRPPFRAEF FPPAYLYPRP LPIYEVAAGL
ARPIAFDPAM FDIPGEHADA AKGWSGFSGF RLLWPLNEAE KMDEIAVFQG ASYFRSLGRG
QRYGLSARGL AIGAGEPNEE FPDFTAFWLE RPPIESEAID GDAVTVHGLM DSPSCAGAYS
FTIRPGEDVV FDVAAAVFPR TTLAKAGVAA MSSMFLFNVA DASPVDDFRT AVHDSDGLAI
WTGRGEHLWR PLRNPKVHRL SHFPDDSPKG FGLVQRARAL EDFGDLQARY DLRPSLWVEP
LDAWGKGAVH LAELATTKET DDNIAVFWRP EQSWAAGSQV DLRYRLYWGQ ERFASPDARV
FRTRTGADAQ AGLRLFTVDL QGGALAEGLD GVALEIQADG GTIAWKDLSP YPDEASARAA
FGLRPTQASV DLSLRLVRGD QPISETWRYP WTA