Gene Caul_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2839 
Symbol 
ID5900294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3078070 
End bp3079302 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content70% 
IMG OID641563334 
Productglycosyl transferase group 1 
Protein accessionYP_001684464 
Protein GI167646801 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000929043 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0434623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGA GGTTTGATCC GCTTTCGGCG CTCAGCGAAG CAGGAGAACC CCTGTTCAAG 
GCTGTCCGTC CGGGTGGAAA GCCGGTCAGG CTGGTCGATA CGACCATGCT CTATGCCCCC
CGTAGCGGCG GGGTTCGCCG CTATCTGAAC TCCAAACGAG CCTGGATCGC GGCGAATCGC
CCACAGGTCC GCCACACCCT CGTGGTGCCC GGCCCCCGCG ACGCGCACGA CGGCCATGGA
CGCGTCTCGA TCTACGCCGC GCCGTTGCCC TTCGGCGACG GCTATCGCTG GCCGGTGGTC
AAGAACGCCT GGATGGAGCG ACTGATTCGT CAGCGGCCGG ACATCATCGA GGCCGGCGAT
CCCTATACCC CGGGTCTGGC GGCCCTGAAG GCTGGCGACG CTCTGGGCGT GCCGGTGGTC
GGCTTCTGCC ACACCGACCT GGGCGCCTTG GCGGCCCTGC ACATCGGCGA ATGGGCCGAA
AAGCCCGTAC AGAAGCGCTG GGCGGCGATC TACAGCCAGT TCGACCAGGC CGTCGCCCCC
AGCCAGTTCA TCGCCGGGCG CCTGATCGAG GCCGGGGTCA AGAACGCCAT CGGCCTGCCG
CTGGGCGTCG ACACCGAGAT TTTCCGTCCG GGCCGCGGTG ACCGAGAGGC GCTACGTCGA
CGGCTCGGCC TGACCAGCCG CCATCGCATC CTGGTGTTCG CCGGCCGGCC GGCCAAGGAG
AAGAAGCTCG ACGTGCTGGT CGAGGCCGTG GAGCGGCTGG GCGATCCCTA TGTGCTGCTG
TTTGTCGGCG CGGGGGCGGG GGCGCCGTCC AGCGACCGGG TGATCTGCAT GGACTATCAG
CGCGATCCGC AGGGCCTCGC CGCGGTGCTG GCCGGCTGTG ACGCCTTCGT GCACGCCAAC
GACAACGAGC CGTTCGGCCT GATCGTGCTC GAGGCCATGG CCTGCGGCCT GCCGGTGATC
GGCGTGGCGG CCGGCGGGGT GGCCGAATCG GTCGATGAGA CGGTCGGAGC CCTGGCCACG
GCTTCGGAAG CCCGCGCCTT CGCCGAGGCC GTGGAATCGG TGTTCGCACG CGACGTCATC
GCCCTCGGCC AGGCCGCGCG CCTGCGGGCC GAGCAGCGGC ACGGCTGGGA CCCGGTGTTC
CGCAAGCTTT CGGCGATCTA CGGCCGGCTG ACCGGCTGCG CCGCGTTCGA GGACGCGCCC
GCGCCGGTCG CCGAACCGCC CGGCTGGAAC TAG
 
Protein sequence
MNLRFDPLSA LSEAGEPLFK AVRPGGKPVR LVDTTMLYAP RSGGVRRYLN SKRAWIAANR 
PQVRHTLVVP GPRDAHDGHG RVSIYAAPLP FGDGYRWPVV KNAWMERLIR QRPDIIEAGD
PYTPGLAALK AGDALGVPVV GFCHTDLGAL AALHIGEWAE KPVQKRWAAI YSQFDQAVAP
SQFIAGRLIE AGVKNAIGLP LGVDTEIFRP GRGDREALRR RLGLTSRHRI LVFAGRPAKE
KKLDVLVEAV ERLGDPYVLL FVGAGAGAPS SDRVICMDYQ RDPQGLAAVL AGCDAFVHAN
DNEPFGLIVL EAMACGLPVI GVAAGGVAES VDETVGALAT ASEARAFAEA VESVFARDVI
ALGQAARLRA EQRHGWDPVF RKLSAIYGRL TGCAAFEDAP APVAEPPGWN