Gene Caul_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4541 
Symbol 
ID5902002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4914822 
End bp4916072 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content75% 
IMG OID641565060 
Product3-deoxy-D-manno-octulosonic-acid transferase domain-containing protein 
Protein accessionYP_001686159 
Protein GI167648496 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.200224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCTCT ACCGCGCGGC CACCGGCGCG CTGGAGCCGT TCGCGCCCTT CCTCCTGGAA 
CGCCGCGCCA AGGCCGGCAA GGAGGACCGC GCGCGGCTGA ACGAGCGCCT GGCCCGGCCG
ACCACGCCGC GACCGGACGG TCCGCTGGTC TGGCTGCACG GGGCCAGCGT CGGCGAGAGC
CTGTCGATCC TGCCGCTGGT CGACCGCCTG CGCGCCGAGC GGCCGGACGT CCAGGTACTG
GTCACGTCCG GCACGGTGAC CTCGGCCGAG CTCTTGGCAC GGCGCCTGCC GGCCGGGGCG
ATCCACCAAT ATCTGCCGGT CGACACCCCC CGCGGCGCCC GGCGGTTCCT CGACCACTGG
CGGCCCAGCC TGGCGGTCTT CGTCGAGAGC GAGCTGTGGC CCAACCTGCT GCTGACCGCC
AAGGCGCGCG GCGTGAAGCT GGCCCTGGTC TCGGCCAAGC TGTCGGACAG GAGCTACGCC
CGCTGGCGAG CCCGGCCGTT CGCGGCCCAT GAACTGTTCA GCGGCTTCGA CCTGATCCTG
GCCCAGGACG CCCGCGCCGC CGAGCGTCTG GCCAGCCTGG GCGGCGCGGT GGGCGGCGAG
GCCGACCTGA AGTTCGGCGC CGCGCCCCTG CCCGTCGATG AGGCGGCGCT GACCAGCCTG
CGCGTGCGGC TCAGCGACCG GCCCGTCCTG CTGGCCGCCA GCACCCATCC GGGCGAGGAC
GAGATCGTGC TGCGGGCCTG GGGCGCCCTG GCGAGCCGCC CGCGCCTGGT GGTCGTCCCG
CGCCACCCCG AACGCGGCCC GGCCATCGCC GACCTGGCGC TGGCGACCGG CACCACCGTC
TGCCTGCGCA GCCTGGAGCC GGACGACTCC GCCGACATCA TCGTCGCCGA CACCCTGGGA
GAGCTGGGCC TGTGGTACCG CCTGGCCGAC CTGGCCCTGG TGGCCGGCAG CCTGGTGGCC
GGGATCGGCG GCCACAATCC GCTGGAACCG GCCCGCCTGG CCTGCCCGAT CGTCTCGGGG
CCGCATATCG AGAACTGGCT GACCGCCTAT GCCGACCTGC GGGCCGAGGA CGCCGTGGCC
TTCGCCGACG CCTCGGTGCT GGGCGCGCGC CTGGCCGACC TGCTGGCCGG GCCGGAGATC
ATGCGGCTGC AGGCGGCTCG CGCCCAGGCC TTCGTCGCCC GCCGCGACGC CGAGGCCCGC
GCCGGACTCG ACCGGATCCT GGAGCTTCTC GACGCGGAAG GCGGGGCATG A
 
Protein sequence
MALYRAATGA LEPFAPFLLE RRAKAGKEDR ARLNERLARP TTPRPDGPLV WLHGASVGES 
LSILPLVDRL RAERPDVQVL VTSGTVTSAE LLARRLPAGA IHQYLPVDTP RGARRFLDHW
RPSLAVFVES ELWPNLLLTA KARGVKLALV SAKLSDRSYA RWRARPFAAH ELFSGFDLIL
AQDARAAERL ASLGGAVGGE ADLKFGAAPL PVDEAALTSL RVRLSDRPVL LAASTHPGED
EIVLRAWGAL ASRPRLVVVP RHPERGPAIA DLALATGTTV CLRSLEPDDS ADIIVADTLG
ELGLWYRLAD LALVAGSLVA GIGGHNPLEP ARLACPIVSG PHIENWLTAY ADLRAEDAVA
FADASVLGAR LADLLAGPEI MRLQAARAQA FVARRDAEAR AGLDRILELL DAEGGA