Gene Caul_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3602 
Symbol 
ID5901057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3884925 
End bp3886223 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID641564112 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001685227 
Protein GI167647564 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATCGTA TCGCCATCAC CGGCGGCGCG CAGCTGAACG GGATCATCCC GGTGAGCGGC 
GCCAAGAACT CGGCCATCAA GCTGATGGCG GCCAGCCTGC TGACCGACCA GCCGCTGCGC
CTGACCAACA TGCCGCGCCT GGCCGACACC AAGTTCCTGG GCAAGCTGCT CACCCGCCTG
GGCGCCCAGG TCGATGAGCG CGAGGGGCTG GATGGTTCGG AGACGGTGCT GCACGCCGCC
GAGATCACCA GCGGCTTCGC GCCCTACGAC CTAGTCCGCC AGATGCGCGC CTCGTTCAAC
GTGCTGGGTC CGCTGATCGC CCGCACCGGC CAGGCCAAGG TCAGCCTACC CGGCGGCTGC
ACCATCGGCG CGCGTCCCGT GGACCTGCAC CTGCAGGCCC TGGAAGCCCT GGGCGCCAAG
ATCGACCTGC ACGAGGGCTA TGTCTACGCC CAGGCCCCGC GCGGCCTGAA GGGCGCGGAG
ATCACCTTCC CGTTCGTCTC GGTGGGCGCC ACCGAGCACG CCATGCTGGC GGCCGTGCTG
GCCGATGGCG TCACCCACAT CCACAACGCC GCCTGCGAGC CCGAGCTGCT GGACCTGCAG
ATCTGCCTGA ACGCCATGGG CGCCAAGGTG GAAGGGGCGG GCACCCCGAC CATCACCATC
ACCGGCGTCG CCAAGCTGCA CGGCGCGACC CATTCGGTGA TCCCCGACCG CATCGAGATG
GGCACCTACG CCGTGGCCGC GGCCATGGCC GGCGGCGAGG TCCAGCTGAC CCGCGCCCGC
CCGGAACTGA TCGACAGCCT GCTGGTCAAG CTGGAAGAGG CCGGGGCCGG CGTGGTCCGC
ACCGAGGATG GGGTGATCAT CAAGCGCGAC GGTACGCGTC TGAACGCCGT CGACGTCGAG
ACCCAGCCCT ATCCGGGCTT CGCCACCGAC CTGCAGGCCC AGTTCATGGC CCTGATGACC
ACGGCCAAGG GCGAGAGCCG GATCCGCGAG ACGATCTTCG AGAACCGCTT CATGCACGCC
CCCGAGCTGA TGCGCCTGGG CGCCGACATC TCGGTGTCGG GCGGCGAGGC CATTGTGCGC
GGCGTCGACA GGCTGGAAGG CGCCGAGGTG ATGGCCACCG ACCTACGCGC CTCGGTCAGC
CTAGTGATCG CCGGCCTGGT GGCGCGCGGC GAGACCACGG TCAGCCGCAT CTATCACCTG
GACCGCGGCT TCGAGCGGTT GGAAGAAAAG CTGGGCGCCT GCGGAGCCCA GGTGCGCCGG
ATCAAGGGCG ACGCGGAAGG CGGCCCGGAT CATGACTGA
 
Protein sequence
MDRIAITGGA QLNGIIPVSG AKNSAIKLMA ASLLTDQPLR LTNMPRLADT KFLGKLLTRL 
GAQVDEREGL DGSETVLHAA EITSGFAPYD LVRQMRASFN VLGPLIARTG QAKVSLPGGC
TIGARPVDLH LQALEALGAK IDLHEGYVYA QAPRGLKGAE ITFPFVSVGA TEHAMLAAVL
ADGVTHIHNA ACEPELLDLQ ICLNAMGAKV EGAGTPTITI TGVAKLHGAT HSVIPDRIEM
GTYAVAAAMA GGEVQLTRAR PELIDSLLVK LEEAGAGVVR TEDGVIIKRD GTRLNAVDVE
TQPYPGFATD LQAQFMALMT TAKGESRIRE TIFENRFMHA PELMRLGADI SVSGGEAIVR
GVDRLEGAEV MATDLRASVS LVIAGLVARG ETTVSRIYHL DRGFERLEEK LGACGAQVRR
IKGDAEGGPD HD