Gene Caul_4941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4941 
Symbol 
ID5902403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5338696 
End bp5339940 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content70% 
IMG OID641565461 
Productglycosyl transferase group 1 
Protein accessionYP_001686559 
Protein GI167648896 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.421428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGC TCAGGGCGTT GGCTTGGCGC GTCGCGCCCG ATCTTTGCCA TGTCATTTCG 
CTCCGCCTCC CCGGCCTGCG CGCCTACTGG TTTCATGGAA GACGCCGGCG GCGCGGGCGG
TCCTTGTCCG AAGGTCCGGT GATCGTCGCG GGATTTCACG GCGCGGTCCT GGGTCTGGGC
GAGGCCGCCC GGGGCACCGT CACGGCGCTG GCCGCGACCG GCATCGAGGC CCAGGCTTGG
GACGTGTCCG CCCAGCTGGG CCACGTGCGC CGCTTCGATA TCGGCGAGGT GGCGACCCCC
CCGCCCGGCC CCGGCACGAT CATCACCCAG ATGAACCCGG CGGAGCTGAT TCGCCTGGTC
AGCGCGACGC GCGGCGCGCC CTTCGAAGGA AAGCGCTCCA TCGGCTACTG GGCCTGGGAA
TTGATGGACA TTCCCGAGGC CTGGAAGCCG GCCTTCCGCT ATGTGGACGA GATCTGGACG
CCGTCGAACT TCTGCGCCGA GGCCATTCGC CGTTCCGCGC CTCGCGACCT GCCGATCAAG
GTCGTCCCTC ATCAGGCTCC CCTGAACCAC GCCGCGCCCA ACCGGGAGCG GTTTGGCCTG
TCGCCGGACC ATGTCGTCGT GCTCTGCGCC TTCGATCTGA GATCCACCCT GGCCCGCAAG
AATCCGCTGG GCGCGCTGGA GGCCTTCCGG ATCGCCGCGG CCAAGGCCAA GCGGCCGGTG
ACCCTGGTGT TCAAGACGGT CGGCGGCGCC GACGCTCCCG ATAGCCTGGC GACGCTGCGC
GCGGCGATCG GCGACACCCC CGACGTGCTC GTGCTGACCG AGTCGTTGAG CATGGGCGCT
CGCGACCAGC TCATGGCCAG CTGCGACATC TTTCTTTCGC TGCACCGATC GGAGGGCTTC
GGCCTGCTGC TGGCCGAGGC CATGGCCGCC GGCAAGGCCG TGGTGGCGAC GGGCTGGTCG
GCCAACATGG ACTTTATGGA CGCGGAGTCG GCGATGCTCG TGCCCTACGC CCTTTGCCCC
GTCCGCGACC CCCAGGGTCT GTACCAAAAA GGCGTCTGGG CCGAGCCCGA CACAGAGGCC
GCCGGCCGGG CCCTCGCGGA ACTGATCAAC AACCCCGATC AACGCGCCGA ACTCGGCGCC
AAGGCCCTGG CCGCCGTCCG CCAACGTCTG AGCCCGCCGG CCATCGCCGC GATCATGCGA
CGGGCCTTTG ACGGGTCGCC CGTCCGCAAG GGGGCCAACG GGTGA
 
Protein sequence
MKQLRALAWR VAPDLCHVIS LRLPGLRAYW FHGRRRRRGR SLSEGPVIVA GFHGAVLGLG 
EAARGTVTAL AATGIEAQAW DVSAQLGHVR RFDIGEVATP PPGPGTIITQ MNPAELIRLV
SATRGAPFEG KRSIGYWAWE LMDIPEAWKP AFRYVDEIWT PSNFCAEAIR RSAPRDLPIK
VVPHQAPLNH AAPNRERFGL SPDHVVVLCA FDLRSTLARK NPLGALEAFR IAAAKAKRPV
TLVFKTVGGA DAPDSLATLR AAIGDTPDVL VLTESLSMGA RDQLMASCDI FLSLHRSEGF
GLLLAEAMAA GKAVVATGWS ANMDFMDAES AMLVPYALCP VRDPQGLYQK GVWAEPDTEA
AGRALAELIN NPDQRAELGA KALAAVRQRL SPPAIAAIMR RAFDGSPVRK GANG