Gene Caul_4805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4805 
Symbol 
ID5902267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5198326 
End bp5199345 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID641565325 
Productglycosyl transferase family protein 
Protein accessionYP_001686423 
Protein GI167648760 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AAAAATCGAC CTTCGACGAC GTGCTGATCG TCATTCCCTG TCTCAACGAG 
GCCCGGCACC TGCCCGGACT GTTGACGGTG CTGGGGCGGG AGGCGCCGGC GGCGCTGATC
GTCGTGGCGG ATGGCGGCAG CACCGACGGC AGCCTCGACA TCGTTCGGGA CTTCGCCGCG
CGCGGCGCCC GAGTCCAGCT AATGGAGAAC CCGCGCCGCA TCCAGAGCGC CGGCGTCAAT
CTGGCCGCTC GCCGGTTCGG GGCCGGTCGC AGCTGGATGA TCCGCGTGGA CGCCCACTGC
GGTTATGGTC CCGGCTTCCT GACCGGGCTG CTGGCGGCCG CGGACCGGAC AGGAGCCACC
TCGGTGGTCG TGCCGATGGC GACGGAGGGC GAGACCTGCT TCCAGAAGGC CTGCGCCGCG
GCCCAGAATT CGGTGCTGGG CACCGGCGGT TCGGCGCACC GGCGCCTGGG CGACGGGCAG
TTCGTCGACC ATGGCCATCA CGCCCTGTTC CGGCTGGAGG CGTTCCTGGC GGCGGGCGGC
TACGACGAGA CCTTCAGCCA CAACGAAGAC GCCGAACTGG ACGCCCGACT GGTCCAGGCG
GGCGCCCGGA TCTGGCTCGA GCCGGCCGCG GCGATCGTCT ACTACCCGCG CCGGACGCCG
GGGGCGCTGT TTCGGCAGTA CATCAAATAC GGCGAGGGGC GGGCGAAGAC CATCCAGCGT
CACCGGCCAA AACTGAAGGT CCGGCAGATG TTGCCCCTGG TCGTGGCCCC GGCGGTGCTG
GTCGCCCTGG CCGGGTTCGC CTGGCCGCCG CTGGCCCTGC CGGCCCTGAT GTGGGCCGCG
CTCTGCCTGG GATTCGGCGT CCTGCTCGGC GTGCGCCAAC GCAGCCCTTG CGCGGCGCTG
GCCGGCGTGG CGGCGATGAT CATGCACTTC GCGTGGTCGG CCGGTTTCCT GCGCCAGATG
CTGCTGGGCC GCCGCCCCGG CGCGACGCCC GTCGCGCTGA GCACGGAGCC CGCCGGATGA
 
Protein sequence
MTTQKSTFDD VLIVIPCLNE ARHLPGLLTV LGREAPAALI VVADGGSTDG SLDIVRDFAA 
RGARVQLMEN PRRIQSAGVN LAARRFGAGR SWMIRVDAHC GYGPGFLTGL LAAADRTGAT
SVVVPMATEG ETCFQKACAA AQNSVLGTGG SAHRRLGDGQ FVDHGHHALF RLEAFLAAGG
YDETFSHNED AELDARLVQA GARIWLEPAA AIVYYPRRTP GALFRQYIKY GEGRAKTIQR
HRPKLKVRQM LPLVVAPAVL VALAGFAWPP LALPALMWAA LCLGFGVLLG VRQRSPCAAL
AGVAAMIMHF AWSAGFLRQM LLGRRPGATP VALSTEPAG