Gene Caul_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2081 
Symbol 
ID5899536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2228539 
End bp2230992 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content66% 
IMG OID641562570 
Productglycosyl transferase group 1 
Protein accessionYP_001683707 
Protein GI167646044 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.994259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCTAG GTTTGAAGCT CAAGAAGTCC GCCGGCGTAC CGGTGACCGC TGCGGTCCAC 
CGCCTGATTG CCGCCGCCAA CGCCGCGCGC GCCCGGTCGG ACTGGCGCGC CGCAGCCAAG
CATTACGACG CCGCCCTCAG GCGCGACCCG CACCTTGGGC ACGTCTGGAT CCAACTGGGG
CATGCCCTGA AGGAAAGCGG TGAGTTGTCG GCGGCCGACC GAGCCTATCA TCGTGCTGAA
AGCCTGCGGC CGGACGACGC CGACGCACAC CTGCACCTGG GCCATGTGGC CAAGCTACGC
GGCGATGTCG CGGGCGCGAT TCGCAGCTAT CTGACCGCTG CGCGGCTGGC GCCCAAGGCC
CCCCACGCCA TCGGCGAACT GCACACCCTC ATCGCCAACG GAGCCAATGT GCCAATCGAA
GCCATCCAGG GCCTGATCGA CCTCGAGGAT GACCCGATCA CGCGCTCGCC GCCAATGAGC
AGCGCCATCG CGGCAGCCCA GACCGCGATG ACTGCCCTGG TGACAGCCCT CAAGCAGCAG
GGCGGTCAGC CCGGCGCGCT GGAACGCGCG ACCTCGGCCG CCCACCTGAT CGCCGATCTC
GCCAGTGATC CGGTCTCGTC CGGCTCCCAA GACTCGGGGC CAGCCCTGAT CTTCGACGTC
TCGGACCTGC TGACCTATTT CCGTAACGTA CGTTCGCCAA CGGGCATTCA GCGCGTCCAG
ATCGAGATCA TCCTCAGCAG CCTGCAGTCC GGCAATACGG CGGTTCGCAT CTGCTGTTTC
CTGGAACAAC GCGACGAATG GGTCGAGATC CCCGCCCCGC TTTTCCTGCG GCTGAGTTGG
CTCAGCCTGG GCGACGCCGA GGACGATGGC GGCGAATGGA CCGCCGCCCT GACCCGCACG
CTGTTGCTGC TTAACATGGC GCCGCCGCTT GATTTTCCGA GGGGGGCCTT CCTGATCAAT
CTGGGCACCT CCTGGTGGCT CCAGAACTAT TTCCTGTTCG TGCGGCGGAT CAAACGCGAG
CGCGGTGTCC GTTACATCCC GTTTGTGCAC GACATGATTC CGGTCCTGCA CGGGGAGTTC
TGTCCCAAGG TGCTGACCCA GGACTTCATC TCCTGGGCGA TCGGCGTCTT CGAGCATGCC
GATTTCTTCT TCGTGAACTC GCAGTCCACC CGACGCGACC TGATCAAGGT GGGCGCGTTC
CTGGGGCGCG AAATCGATCC GCTAGCCATC TCGGTCGTGA CGCTGGACGC CGACACACGC
AAGCCCGACG CGCCTGCGCC GCGCGGGAAG ATATTGCGGC GTTGGGGGCT CAACGCCATC
CCCTACGTGC TGTTCGTCTC CACCATCGAG CCACGGAAAA ACCACCTGCG TGTCTTCGAG
GCGTGGATCG CGCTCCTCAA GCGCCATGGC TCGCGCAAGA CGCCCAAACT GGTCTGTGTG
GGCCATCCGG GTTGGCTCAA CGACAGCATA CACGATCAGT TGAACGCCCA TGAGGACCTG
CGCGCTCACG TCCAGGTGTT GCGTTTCGTG TCGGACGCCG ACCTGGCCGA ACTCTACAGC
GGCTGCCTGT TCACCCTCTA TCCCAGCCAT TACGAGGGTT GGGGCCTGCC GGTCACCGAA
TCCCTCTGCT ACGGCAAGGC GCCCTTGGTC GCCAACACCT CGTCCCTGCC CGAGGCCGGA
GGCCGCTTCG CGGTCTATTT CAACCCGGAT TCCACGGTCG AACTGATCGC CGCGTTGGAG
ACCCTGGCCT TCGACCACGA GGCGCGGCGC GCGCGCGAAC GGCTGATTAC GGCCGAGTTC
AAGCCCCGCG GCTGGGCCGT GCTGGCGCGG CAGATGGCTG ACGACCTCGT CGCCTGGGAA
GGCGTCGGTC GCCCCGTCCT CGGCGCCGAG GCGCCGGCCG CGCTTGTGGG CGCCTACCAC
TCGCTCGGCC GCAATCTGAA AACCAGGGTC TGGCCGGGCA TGCGGTCGGG CGAGGTATAT
CGCAGCGGTC CCAATTGGTG GGGTCCCGAC AACTGGGGGT GCTGGACCAA GCCGGGCGGT
TCGACCCTCC GCATGACCGT GCCGCAGCCG GGACCCATAA TCGCCTATCT GCACCTGCAG
GGGCTACCCG CTCAACGCTG CGGTTTTGTC GTCAAGACGA CGGGCGACGC GATCGTGCGA
CACGGCGAGA TCGACCGCGG CCAGCACAAA TGGCTGGCGA TCGAGATCGC CCCCGACGAG
TCCGAACCGC GCACCGTGAC GCTGGAAATC GAGGGAACGG CTTGCGAGAG CCTGGCGAAC
GTCACCGACA ATTCGGACGC CCGCGTGGTC TCGCTCGGTG TGGCGGGCTT CTTCCTTTGC
CGGGCCGATG ATCCGGCCGC CCGGGCCGCT TTCCTTGAAG CGGTAGCGAT CGGGAACATT
CACGATCTCG ACTTCAGTCG GGAGCCCCTC GAATACACCC CCTTGATCTC GTGA
 
Protein sequence
MVLGLKLKKS AGVPVTAAVH RLIAAANAAR ARSDWRAAAK HYDAALRRDP HLGHVWIQLG 
HALKESGELS AADRAYHRAE SLRPDDADAH LHLGHVAKLR GDVAGAIRSY LTAARLAPKA
PHAIGELHTL IANGANVPIE AIQGLIDLED DPITRSPPMS SAIAAAQTAM TALVTALKQQ
GGQPGALERA TSAAHLIADL ASDPVSSGSQ DSGPALIFDV SDLLTYFRNV RSPTGIQRVQ
IEIILSSLQS GNTAVRICCF LEQRDEWVEI PAPLFLRLSW LSLGDAEDDG GEWTAALTRT
LLLLNMAPPL DFPRGAFLIN LGTSWWLQNY FLFVRRIKRE RGVRYIPFVH DMIPVLHGEF
CPKVLTQDFI SWAIGVFEHA DFFFVNSQST RRDLIKVGAF LGREIDPLAI SVVTLDADTR
KPDAPAPRGK ILRRWGLNAI PYVLFVSTIE PRKNHLRVFE AWIALLKRHG SRKTPKLVCV
GHPGWLNDSI HDQLNAHEDL RAHVQVLRFV SDADLAELYS GCLFTLYPSH YEGWGLPVTE
SLCYGKAPLV ANTSSLPEAG GRFAVYFNPD STVELIAALE TLAFDHEARR ARERLITAEF
KPRGWAVLAR QMADDLVAWE GVGRPVLGAE APAALVGAYH SLGRNLKTRV WPGMRSGEVY
RSGPNWWGPD NWGCWTKPGG STLRMTVPQP GPIIAYLHLQ GLPAQRCGFV VKTTGDAIVR
HGEIDRGQHK WLAIEIAPDE SEPRTVTLEI EGTACESLAN VTDNSDARVV SLGVAGFFLC
RADDPAARAA FLEAVAIGNI HDLDFSREPL EYTPLIS