Gene Caul_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2084 
Symbol 
ID5899539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2233104 
End bp2234366 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content71% 
IMG OID641562573 
Productglycosyl transferase group 1 
Protein accessionYP_001683710 
Protein GI167646047 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGCT TGGCTCCGCA TCGCGACGCC TTCGCCACGT CGTACCGACC TGGCTGGGCG 
GTCGCGACCA CCCGGCCAGA CATCGTGTTG GATCTATCCC GCCTGCTGTC CCGGGTGTTT
CACGCCACGC CGACCGGGGT CGACCGGGTC GAGATGGCCT ATGCCCAGAC CCTGTTGAAA
CTGGCTCCCC AGCGGTTGCG CTTCACGGCC ATCCACCCCG CCGGATGGCA CGGGGATCTG
TCCACGGTCG CGGCGCGGCG TTTCCTTGAC GCCACTTTGG AGCAATGGAC TCACGGCGAA
GGCCACCGAA CGATCCTGGC GCGATGCCGG GACGCCCTGG CCGTCGGGCG GCGGCACGCG
CCGGGGTCGG GGGCGTGGCC CGCGGTCTAT CTGCATCTCT CCGCGCGCGG CCTGGAGCGA
ACCAACCTCC TGCGGTCCGC CCAAAGACGC CACCAGGCCA GGTTTGTCCC GTTCGTCCAC
GACCTCATTC CGCTGGAGCA CCCCGAATAC GCACGTCCCG GGGGGATGGC GCTGTATCGG
CGCAAGATCG CCGCCGTGGC CGACCTGGCC GACGCCGTGC TGGTCAATTC CGAGGCCACG
GCGCGCGCCC TGGCCCCCTA TCTGCACGCC GCCGGCCGCG AGATCTCGAT CCACGTCGCG
CCGCTGGCGT CCACCTTCGC CCCGCTCCCC GCGCGGGCCG AGGCGTCGGA CACGCCGCCC
TATTTCGTAG TGCTCGGCAC CATCGAGCCC CGCAAGAACC ATCTGCTGCT GCTGAACGTC
TGGCGACGGA TGGTCGAGAC CCTGGGGCCG TCGGCCACGC CTCGCCTGGT GGTGATCGGC
CGGCGCGGCT GGGAGAACGA GAACGTCCTG GACCTTCTCG ATCGCTGCCC GGCGCTTGAG
GGCGTAGTGA TCGAACGTGG TCGCCTGGCC GACCCGCAGG TTCGCGGCCT GGTGGCCGGC
GCCCGCGCCG TCCTCGCGCC GTCCTTCGCC GAAGGGTTCT GCCTGCCGAT GGTCGAAGCC
CTGGCCCTCA GGACGCCAGT CATCGCCAGC GATCTTGCTG TCCTGCGCCA GACCGGCGGG
GACGCGCCCG ACTATCTCGA TCCCCTCGAC GGCCCCGCCT GGTTCCGGGC GATCCTCGAC
TACGCCCTCC CGGGATCCGT GCCGCGCCGC GCCCAACTGG CCCGCCTTTC CGCCTGGCGT
CCGTCGAGCT GGGAGGACCA TGTCGGCGGG GTGCTCGACT TCCTGGCGGA GACGCAACCA
TGA
 
Protein sequence
MNSLAPHRDA FATSYRPGWA VATTRPDIVL DLSRLLSRVF HATPTGVDRV EMAYAQTLLK 
LAPQRLRFTA IHPAGWHGDL STVAARRFLD ATLEQWTHGE GHRTILARCR DALAVGRRHA
PGSGAWPAVY LHLSARGLER TNLLRSAQRR HQARFVPFVH DLIPLEHPEY ARPGGMALYR
RKIAAVADLA DAVLVNSEAT ARALAPYLHA AGREISIHVA PLASTFAPLP ARAEASDTPP
YFVVLGTIEP RKNHLLLLNV WRRMVETLGP SATPRLVVIG RRGWENENVL DLLDRCPALE
GVVIERGRLA DPQVRGLVAG ARAVLAPSFA EGFCLPMVEA LALRTPVIAS DLAVLRQTGG
DAPDYLDPLD GPAWFRAILD YALPGSVPRR AQLARLSAWR PSSWEDHVGG VLDFLAETQP