Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2079 |
Symbol | |
ID | 5899534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2226349 |
End bp | 2227533 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562568 |
Product | polysaccharide export protein |
Protein accession | YP_001683705 |
Protein GI | 167646042 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.238275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCGT CCCTGTGGCG TTCGCCGTTT CCGCGGTCGT GGAGCCGGGG CGTCCGGCCG CTCCTGGGCT CGGCCCTGGC CTTGGCGCTG GACGCCTGCG CCAGCCTCCC GTCCAGCGGT CCGACGGCGG ACGCCATCAG CGCCGCGCAA CGACGAACCG CCGCTTTCAG CCTCGTGACC ATCGACACGG CCCTGGTGGA GCAGCTTGCC GCGCCGCCGC CGCCGGATCC GTCGCGCCTG GCCGGCCTGG GCGCCGCCGG GGCGGTCGAC GTTCTGGGGC CGGGCGACGT GCTTCAGGTG TCGATCTACG AGGTCGGCGC CGCCCTGTTC TCGGGTCGGT CGGGAGGGGC GATGGCGAGC GCGGCGGGCG CCTTCTCGCC GCCCTCCGGA TCGGCCGAGA CCCTGCCGCC CATCGTCGTT GGCCGCGACG GCGCGATCAA CCTGCCTTGG ATCGGTCGGC TGGCGGCGGC CGGCAAGACG CCCGACGATC TGGCCGCCGA GATCGCCGCG GCGCTTCACG GCAAGTCCCA GGATCCTCAG GTCGTGGTCA GTGTGCGGGA GAACGTGACC AACACCGTCA TGATGACGGG CGAGGTCAAG AAGCCGGGCC GCCTACCCTT GAGCCTCGCC GGCGAGCGCC TGTCGGACGC CATCGCCATG GCCGGCGGCC CGGCAAACGC GGTCCAGGAC AGCGTCGTTC TGCTTAGCCG CGGCGAACTC ACCGTTTCGG CGCCGCTCGG CGTCGTCGTG GCCGGCTCGC CGCAGGACGT GGCGCTTCGC CCGCGCGACC GGATCACCGT GCTCTATCAA CCCCGGACGT TCACCGTCTT CGGGGCCAGC GGGAAGGTGT CGGAGATCCC CTTCCAGAGC CCGCGTGTAT CGCTGGCCGA AGCCATCGCC CGGGCCGGCG GACCGGACGA CAGGCAAGCC GATCCCTCCG CCGTCTTCGT CTTCCGCTAT GCGCAGGCCG CGTCCGACGG TACGCCGCTG ACTGGCGCCA AACCCGTCGC CTACAGGCTC GACCTGCTGC AGGCGCAAAG CTACTTTCTG GCCCAGGGGT TCGAGATGAA ACCGCGCGAC GTGATCTACA TCGCCAACGC CCGCGCCAAT CAGCCCACCA AGTTCATCCA GATCCTCAAC ACCTTCTTCT CGCCAGTCTA CACGGCCAAG GTGCTGGCGC AGTGA
|
Protein sequence | MIPSLWRSPF PRSWSRGVRP LLGSALALAL DACASLPSSG PTADAISAAQ RRTAAFSLVT IDTALVEQLA APPPPDPSRL AGLGAAGAVD VLGPGDVLQV SIYEVGAALF SGRSGGAMAS AAGAFSPPSG SAETLPPIVV GRDGAINLPW IGRLAAAGKT PDDLAAEIAA ALHGKSQDPQ VVVSVRENVT NTVMMTGEVK KPGRLPLSLA GERLSDAIAM AGGPANAVQD SVVLLSRGEL TVSAPLGVVV AGSPQDVALR PRDRITVLYQ PRTFTVFGAS GKVSEIPFQS PRVSLAEAIA RAGGPDDRQA DPSAVFVFRY AQAASDGTPL TGAKPVAYRL DLLQAQSYFL AQGFEMKPRD VIYIANARAN QPTKFIQILN TFFSPVYTAK VLAQ
|
| |