Gene Caul_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2119 
Symbol 
ID5899574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2282486 
End bp2283748 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content72% 
IMG OID641562608 
Productglycosyl transferase group 1 
Protein accessionYP_001683745 
Protein GI167646082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0632513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.128982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC AGGCGCGCCT TGGCGAGCCG TTCGGGGAGG GGGGCCGGGG CGAAGCCTCG 
CCGACCCCGC CGGAAGTCGT TGTCGATGTC TCCGGCCTGC TGTTCGGATC CCACCACGAT
ACGCCGACGG GCATAGATCG TGTCGAGATG GCCTACGCGG AAACCCTGCT GCGGCGCCTG
CCCCACCGTG TGAGCTTCGC GGCCCGCTAT CCGGGCGGCG GCTATGGGCG GCTTTCGAAC
GGCGCCGTGG ACACCTTCCT TTCGGCCGTC CGCGACGTCT GGACGGATGG GGACGGCGGG
GGCTCGGTCC GGCGGTGCTG GCGCGTGGCG AAGGCGATGC TCGGCGCGCG GGCGGTCTTC
GCCGGCGCGG CGGCCCCGGG ACCCCGTGTC TATCTGCAAC TGTCGCCCCG GGGGCTGGAG
CGGACGGACC ACTACCGGTC GGTGCTGCGG CGCGAGCAGG CCCGCCTGGT CTTGTTCGTC
CACGATCTGA TCCCGCTCGA GCGCCCTGAG TTCGTGCGGG ACGGCGGCGC CGCGCGGTTC
GCGCGCAAGC TTGAAACCGT CGTCGGTCTG GCGGACGGCC TGCTGGTGAA TTCCCGCGCG
ACGGCGGCGG CGCTAGAGCC GTATCTGGTC CAGGCTCGCC GCGACATCCC GATGCGCGTC
GCGCCGCTGG GCGTTTCGGC CGCTGTGCCG GCTCCGGCCG CGGCGAGACC GGGCAAACCC
TACTTCGTCG CGCTTGGCAC CATCGAGCCG CGCAAGAACC ATCTGCTGCT CCTGCACATC
TGGCGGCGCT GGGTCGAGCG CGAAGGAGCG GCGGCGACGC CGAGCCTGGT GTTGATAGGC
CGGCGCGGCT GGGAGAACGA GAACGTGCTC GATCTCCTCG ATCGCTGCCC GGCCTTGAAA
GACGCCGTGA TCGAGCACGG CCGACTCGGC GACGCCGAGG CGCGGGTCCT CATGCGCGGC
GCGACAGCCG TGCTCTGCCC CTCCTTCGCC GAAGGCTACG GCCTACCGGT GGCCGAAGCG
CTGCAACTGG GTGTCCCTGT CCTGGCCAGT GACATCGCCG CCCACCGCGA GGTCGGCGGC
CATGCGCCAG ATTATCTCGA CCCGCTGGAC GGCCCTGCCT GGGCCGCGGC CGTGCGCGAC
TACGCCCAGC CGGGCTCGGC GCGGCGGCGG CGGCAGTTGG TTCGCCTGGC GGGCTGGAAG
GCCGCGACCT GGGCCGATCA CTTCGAGACC GCGCTCGATC TCATCCAGGA CGTGGCGCGA
TGA
 
Protein sequence
MIDQARLGEP FGEGGRGEAS PTPPEVVVDV SGLLFGSHHD TPTGIDRVEM AYAETLLRRL 
PHRVSFAARY PGGGYGRLSN GAVDTFLSAV RDVWTDGDGG GSVRRCWRVA KAMLGARAVF
AGAAAPGPRV YLQLSPRGLE RTDHYRSVLR REQARLVLFV HDLIPLERPE FVRDGGAARF
ARKLETVVGL ADGLLVNSRA TAAALEPYLV QARRDIPMRV APLGVSAAVP APAAARPGKP
YFVALGTIEP RKNHLLLLHI WRRWVEREGA AATPSLVLIG RRGWENENVL DLLDRCPALK
DAVIEHGRLG DAEARVLMRG ATAVLCPSFA EGYGLPVAEA LQLGVPVLAS DIAAHREVGG
HAPDYLDPLD GPAWAAAVRD YAQPGSARRR RQLVRLAGWK AATWADHFET ALDLIQDVAR