Gene Caul_4459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4459 
Symbol 
ID5901920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4826532 
End bp4828457 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content73% 
IMG OID641564978 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001686077 
Protein GI167648414 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGAG CGGGAAAGAT CGGGGCGCAT GTGGCGCTGG CCTTCGTGGC CATGCTGCTG 
GCTCAACTGC TGACCGCCGG CGTCGAACCC TTCGCCCTCT CCGCCCTGGT CCACGCCGCG
CTGTTCGCCG CCTCGGCCCT GGCCGTCGAG ATGCTGTTCC AGGTCGAGCG CTCGCCCTGG
CGATTCTTCG CGGCCAGCGA CGGTCTGCGC CTGGCCCGGT CGGCCCTGCT GTCAGTGCTG
ACCTTCGCGC TTCTGGCCCG CATGGCCGAC GTGCGCCAGC CCGGCGGCCC GCGCACCCTG
ATCGCCGCCT TCCTGCTGCT GCTGACCCTG TTGGCCGGAC TACGCATCGT TCGCCGCACG
ATCCACGACA AGGTGCTGGT CGAATCCCTG ACCCGCCTGC GTCCGGCCCC CGCCTCGCCG
TCCCTGCCGC GCCTGCTGAT CGTCGGCTCG GCCAGCGAGG CCGAGGCCTT CCTGCGCGCG
CCCCTCGCCC TGGGCGAACG CTACGCCCCG GTCGGCGTGC TGACCCCCGA GGCCCGCGAA
ACCGGCGACG AGTTGGGCGG AGTCTGCATC CTCGGGGTGA TCGACGATTT CGACGCCGTC
ATGGCCCAGC TGCGGGACGG GGGCCTGCAG CCGTCGGCCC TGCTGTTCCT GACCGACGGC
CCGCTCAGCG CCTTCGGGAC CGAACGGCTG GGTCGGCTGA AGACCGAGGG CGTGCGCCTG
CTGCGGCGCC AGGGCCTGGT CGATATGAGC CCCATCGGAG GATCCGCCAG CGCGGCCCTG
CGCGAGATCA GCCTCGAGGA ACTGCTCAGC CGCGCGCCCG TGCGGCTGGA TCCCGAGCCG
GTCCGCGCCC TGGTCGCCGG GCGGCGGGTG CTGGTCACCG GGGCCGGCGG CAGCATCGGC
TCCGAGCTGG CCCGCCAGAT CGCCGCCAGC GGCCCGGCCC ACCTGACCCT GCTGGACTCG
GCCGAGGCCA ACCTGTTCCA CATCGACCGC GAGCTGGGCG AGGCCTGGCC CTCGCTGGCT
CGCCGCGACG TGCTGTGCGA CGTGCGCGAC GCCGACCGCG TGGCCCTGGT GTTCACCGCC
GAGAAGCCCG AGCTGGTCTT CCACGCCGCC GCCCTCAAGC ATGTCACCCT GGTCGAGAAC
CACCCCTGCG AGGGCGTGCG CACCAACGTG CTGGGCAGCC GCAACGTCGC CGCCGCCGCC
AAGGCCTGCG GCGCGGCCCA CCTGGCCCTG ATCTCCACCG ACAAGGCCGT GGCCCCGGCC
AGCGTGATGG GCGCGACCAA ACGGGTGGCC GAGGCCGTGG TGCGGCAATT CGGGGCCAGC
GAGGCGACCC GCGTCAGTGT CGTGCGGTTC GGCAACGTGC TGGGCTCGGC CGGCTCGGTG
GTGCCGATCT TCCAGCACCA GATCGCCCGC GGCGGCCCGG TCACCCTGAC CGACGCCGAG
GTCGAGCGCT ATTTCATGAC CATTCCCGAG GCCGTGCAGC TGGTCCTGCG GGCCGTGGCC
CTGTCGGCCG GCGATCTGGA GCCCCCCACC GGCGTCTTGA CGCTTGAAAT GGGCGAGCCG
GTCAAGATCA TCGACCTGGC CCGGCGGATG ATCGAGCTGC AGGGCCTGAC CCCCAGCCGC
GACATCGAGA TCAAGATCAC GGGCTTGCGG CCCGGCGAAA AGCTCACCGA GGCCCTGGTC
GATGTCAACG AGACCACCCG GCCCAAGGCC GACGGGGTCA CCGAGGCGAC GCCGCTCACG
CCCCTGCCGG TGATCGCCGC CGCCGCCCTG GCCGCGCTGG AAGCCGCCGC CCTGGCCGGC
GATGAGCTGA TGGTCCGTGA GCGGCTGTTC GCCCTGGTCG AGAGCCTGCG CGGCGCGCCC
CCCTCCAATC CGGTTCGGGC CAAGCCAGTT CGGGGCAAGA CGACCCGGGC GCGCACCCAG
GTCTAG
 
Protein sequence
MGRAGKIGAH VALAFVAMLL AQLLTAGVEP FALSALVHAA LFAASALAVE MLFQVERSPW 
RFFAASDGLR LARSALLSVL TFALLARMAD VRQPGGPRTL IAAFLLLLTL LAGLRIVRRT
IHDKVLVESL TRLRPAPASP SLPRLLIVGS ASEAEAFLRA PLALGERYAP VGVLTPEARE
TGDELGGVCI LGVIDDFDAV MAQLRDGGLQ PSALLFLTDG PLSAFGTERL GRLKTEGVRL
LRRQGLVDMS PIGGSASAAL REISLEELLS RAPVRLDPEP VRALVAGRRV LVTGAGGSIG
SELARQIAAS GPAHLTLLDS AEANLFHIDR ELGEAWPSLA RRDVLCDVRD ADRVALVFTA
EKPELVFHAA ALKHVTLVEN HPCEGVRTNV LGSRNVAAAA KACGAAHLAL ISTDKAVAPA
SVMGATKRVA EAVVRQFGAS EATRVSVVRF GNVLGSAGSV VPIFQHQIAR GGPVTLTDAE
VERYFMTIPE AVQLVLRAVA LSAGDLEPPT GVLTLEMGEP VKIIDLARRM IELQGLTPSR
DIEIKITGLR PGEKLTEALV DVNETTRPKA DGVTEATPLT PLPVIAAAAL AALEAAALAG
DELMVRERLF ALVESLRGAP PSNPVRAKPV RGKTTRARTQ V