Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2119 |
Symbol | |
ID | 5899574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2282486 |
End bp | 2283748 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641562608 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001683745 |
Protein GI | 167646082 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0632513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.128982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC AGGCGCGCCT TGGCGAGCCG TTCGGGGAGG GGGGCCGGGG CGAAGCCTCG CCGACCCCGC CGGAAGTCGT TGTCGATGTC TCCGGCCTGC TGTTCGGATC CCACCACGAT ACGCCGACGG GCATAGATCG TGTCGAGATG GCCTACGCGG AAACCCTGCT GCGGCGCCTG CCCCACCGTG TGAGCTTCGC GGCCCGCTAT CCGGGCGGCG GCTATGGGCG GCTTTCGAAC GGCGCCGTGG ACACCTTCCT TTCGGCCGTC CGCGACGTCT GGACGGATGG GGACGGCGGG GGCTCGGTCC GGCGGTGCTG GCGCGTGGCG AAGGCGATGC TCGGCGCGCG GGCGGTCTTC GCCGGCGCGG CGGCCCCGGG ACCCCGTGTC TATCTGCAAC TGTCGCCCCG GGGGCTGGAG CGGACGGACC ACTACCGGTC GGTGCTGCGG CGCGAGCAGG CCCGCCTGGT CTTGTTCGTC CACGATCTGA TCCCGCTCGA GCGCCCTGAG TTCGTGCGGG ACGGCGGCGC CGCGCGGTTC GCGCGCAAGC TTGAAACCGT CGTCGGTCTG GCGGACGGCC TGCTGGTGAA TTCCCGCGCG ACGGCGGCGG CGCTAGAGCC GTATCTGGTC CAGGCTCGCC GCGACATCCC GATGCGCGTC GCGCCGCTGG GCGTTTCGGC CGCTGTGCCG GCTCCGGCCG CGGCGAGACC GGGCAAACCC TACTTCGTCG CGCTTGGCAC CATCGAGCCG CGCAAGAACC ATCTGCTGCT CCTGCACATC TGGCGGCGCT GGGTCGAGCG CGAAGGAGCG GCGGCGACGC CGAGCCTGGT GTTGATAGGC CGGCGCGGCT GGGAGAACGA GAACGTGCTC GATCTCCTCG ATCGCTGCCC GGCCTTGAAA GACGCCGTGA TCGAGCACGG CCGACTCGGC GACGCCGAGG CGCGGGTCCT CATGCGCGGC GCGACAGCCG TGCTCTGCCC CTCCTTCGCC GAAGGCTACG GCCTACCGGT GGCCGAAGCG CTGCAACTGG GTGTCCCTGT CCTGGCCAGT GACATCGCCG CCCACCGCGA GGTCGGCGGC CATGCGCCAG ATTATCTCGA CCCGCTGGAC GGCCCTGCCT GGGCCGCGGC CGTGCGCGAC TACGCCCAGC CGGGCTCGGC GCGGCGGCGG CGGCAGTTGG TTCGCCTGGC GGGCTGGAAG GCCGCGACCT GGGCCGATCA CTTCGAGACC GCGCTCGATC TCATCCAGGA CGTGGCGCGA TGA
|
Protein sequence | MIDQARLGEP FGEGGRGEAS PTPPEVVVDV SGLLFGSHHD TPTGIDRVEM AYAETLLRRL PHRVSFAARY PGGGYGRLSN GAVDTFLSAV RDVWTDGDGG GSVRRCWRVA KAMLGARAVF AGAAAPGPRV YLQLSPRGLE RTDHYRSVLR REQARLVLFV HDLIPLERPE FVRDGGAARF ARKLETVVGL ADGLLVNSRA TAAALEPYLV QARRDIPMRV APLGVSAAVP APAAARPGKP YFVALGTIEP RKNHLLLLHI WRRWVEREGA AATPSLVLIG RRGWENENVL DLLDRCPALK DAVIEHGRLG DAEARVLMRG ATAVLCPSFA EGYGLPVAEA LQLGVPVLAS DIAAHREVGG HAPDYLDPLD GPAWAAAVRD YAQPGSARRR RQLVRLAGWK AATWADHFET ALDLIQDVAR
|
| |