Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2085 |
Symbol | |
ID | 5899540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2234403 |
End bp | 2235524 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641562574 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001683711 |
Protein GI | 167646048 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.817087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGACG GCGCCCCGCC CCAAGCGGCG ATCGTCGCCT GCGTCGGGGT CGCGGCCTGG AAGCGGGCCT CGGTGGCGCG GCTGATCGCG GCGGACGGCC GAGCAAGCGG ACCGTCCGCG GACCTCGGTC GCCCAGCCTT CCGCGCGCGC GCCGGCCCGG CCCTGCGGCG GGCTCGCGAC CTGGGCGGCG CTGTCGGCGT CTGGCCGTCG CGCGCGCCCG CCGACATCGA AAGCCAGGCC CAGGCGCTGG GCGTGCCCCT GGTCTGCATC GAGGACGGCT TCATCCGCTC GGCGGGCCTG GGCGCGGAGT GCCGCCCGCC GGCCTCGATC GTCCTTGATC GCGCCGGCGT CCATTTCGAC CCCCGCCGCC CGAGCGACCT GGAGTTGCAC CTGACGCATG ACCATTTCGG CGAAACCCTG ACGTCCCGCG CCCAACAGCT GATCGAACGG ATCGTCGCCC TTGGCGTCAC CAAGTACAAT CTGTCGGGCC AGGCGCCGCC ATTCCGCGGC GGCCGGCGCA CCGTTCTCGT TCCCGGCCAG GTCGAGGACG ATCTGTCGGT CAAGCTGGGC GGGGCCGGCG TCGCCGGCAA TCTCGACCTT CTTCGACGGG TCCGTCAGAT CGAGCCGGAC GCGGTTGTGC TCTACCGCCC CCATCCGGAC GTCGAGGCGG GGTATCGCAA GGGCCTGATC CGCGACGCCG ACGCGCTGCG ATACGTCGAT CAGGTCCTCC GCGGCCACGC CCTGCCAGCA CTCCTGACCA GCGTCGATGC GGTCCACGTC CTCACCTCGC TGACCGGTTT CGAGGCCCTG CTGCGGGGGC GGGAGGTCGT CGTCCACGGC CAGCCCTTCT ATGCCGGCTG GGGTCTGACG CGGGACCTGT CCCCGCCCCC GCGCCGGGGA CGGCGGCTCG CCCTGGCCGA GTTGGCCGCC GCCGCCCTGA TCCTCTATCC CCGCTATATC GACCCCATGA CGGGCGAGAC CTGTTCGCCC GAGACCCTTG TCGCTCGCCT GGCCGACCAG CCAGAGCCGC GACCGGGCCT GCTGCCGGCG CTCCGGCGCC TACAGGTCGG CGCGTTCCGA TCCCTCGACA TGGCCGCGGC GACGAGCATG GCCAATGGCT GA
|
Protein sequence | MIDGAPPQAA IVACVGVAAW KRASVARLIA ADGRASGPSA DLGRPAFRAR AGPALRRARD LGGAVGVWPS RAPADIESQA QALGVPLVCI EDGFIRSAGL GAECRPPASI VLDRAGVHFD PRRPSDLELH LTHDHFGETL TSRAQQLIER IVALGVTKYN LSGQAPPFRG GRRTVLVPGQ VEDDLSVKLG GAGVAGNLDL LRRVRQIEPD AVVLYRPHPD VEAGYRKGLI RDADALRYVD QVLRGHALPA LLTSVDAVHV LTSLTGFEAL LRGREVVVHG QPFYAGWGLT RDLSPPPRRG RRLALAELAA AALILYPRYI DPMTGETCSP ETLVARLADQ PEPRPGLLPA LRRLQVGAFR SLDMAAATSM ANG
|
| |