Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2168 |
Symbol | |
ID | 5899623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2352137 |
End bp | 2353858 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641562659 |
Product | TPR repeat-containing protein |
Protein accession | YP_001683794 |
Protein GI | 167646131 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.28547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.066147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGTA TCAAGCTTTT AGCCGCCCTC TTGTCGGCCT CGGCCCTGAC CGCCTGCGCC ACCGCGACGC CGGGTCCGGC CCTGCGCGGC CACGTTTCCA CCGACTCCGG CTGGGGCGAC ACCCGCTCGG CCTACGGCCA GTTCCTCGCC GGGCAATCGG CCCTGCAGAA CGGCTCCAAC GCCCAGGCCG CCGCCTATTT CGACGAGGCG CGCGAGCTGG GCGAGGATCC CGGCGTGCTG ATCGACCGCG CCTTCACCGC GGCCTTGCTG GCCGGCGACG TCCAGCGGGC CGCCGCCATC GCCCCGCGCG GCGGCGAGAC CAACGAGGCC GTCCGCCGCC TGGGCGTCCT GACCCGCGTG GTCGACCTGC TGGCCAGCGG CAAGGGCCAC GACGCCCAGA CCCTGCTCAA GAGCGAGACG ATCGGCTTCC CGCATCGCCC GGCCGCGGTG CTGCTGGCCC CCTGGCTGGC CGCCGCCGCG GGCGACAACA ACGGCGCCGT GGTTCGTCCC GAGCTGCAGG GCGACCGCTT CGTGCAGTAT TTCGGTCAAC TGGGCCAGGC GCGGCTGTAT GAGCGGGCCA GGCGCTATGA CGAGGCCGAG ACCGACTACA AGGCGCTGAT GGCGGCCGAG AACGCCCGCG CCCTGTTCGT CGAGGACTAC GGCGCGTACC TGGAGCGCCG CAAGCGCCAC GACGACGCCG TGGCGCTGTA CGACTCGGCC CTGACCCGCG ATCCGGACGA CCAGGGGCTG CTCAAGGCCC GCGCCCGCGC CGCCGCCCAC GGTCCGGCCC CGGCCATGCC GACCGAGAAG CAGGGCGCCG CCCACGCCCT GGTCGCCTGC GCCGCCACCT TCGCCCAGGA GCGCCAGAGC CAGTTCAGTC TGGCCTATCT GCGCCTGGCG CTTCGGCTGG ACCCCAAGCG CGACGACGCC TGGCTGCTGG TCGGCGACCT GCTGGCCCAG GACGACGACG ACGCCGGCGC GCGCGAGGCC TACGCCAAGG TGCTTCCCGG CTCGCCCCGC TATGTGGCCG CCCAGTCCAA GCTGGCCTGG AGCTATCAGA GCGGCGGCGA CAAGCCCAAG GCCATCGCCC TGGCCCAGCA GGTCGCCGCC GCCGCGCCCA AGGACCGCGA CGCCCAGATC ACCTATGCCG ACCTGCTGCG CGCCAACGAT CGCTGGGCCG AGTCGGCCGC GGCGCTGGAC CCCCTGATCG CGTCGGAGGC CGCCAAGCCC GACTGGCGGC TGCTCTACAT GCGCGGCATC GCCCTGGAGC GCGCCGGCCG CTGGAGCGAG TCCGAGCGCG ACCTGCTGGC CGCCCTCAAG CTCAGCCCGG ACGATCCCGA CCTGCTGAAC TATCTGGGCT ATTCCTGGAT CGATCGCGGC GAGCACCTGC CCGAGGCGAT CGCCATGGTC CAGAAGGCCG TCGAGGCCCG CCCGCAGTCG GGCGCCATGC TCGACTCCCT GGGCTGGGGC TACTACCGCC AGGGCGACTA CAAGACCGCC GTGGCCAAGC TGGAGCAGGC CGTCGAGCTC GAGCCCGGCG ATCCGGACGT CAACGGCCAC CTGGGCGACG CCTATTGGCG CATCGGGCGC AAGGTCGAGG CGCGCTACCA GTGGCAGCGC GTGCTCAGCC TGGAGCCGGA CGAAAAGCAG AAGGCCGAGG CGATGGCCAA GCTGAAGGAC GGGCTGGACG GTCCGCCGGC CAAGGTCGCC TCGGCGGGGT AG
|
Protein sequence | MTRIKLLAAL LSASALTACA TATPGPALRG HVSTDSGWGD TRSAYGQFLA GQSALQNGSN AQAAAYFDEA RELGEDPGVL IDRAFTAALL AGDVQRAAAI APRGGETNEA VRRLGVLTRV VDLLASGKGH DAQTLLKSET IGFPHRPAAV LLAPWLAAAA GDNNGAVVRP ELQGDRFVQY FGQLGQARLY ERARRYDEAE TDYKALMAAE NARALFVEDY GAYLERRKRH DDAVALYDSA LTRDPDDQGL LKARARAAAH GPAPAMPTEK QGAAHALVAC AATFAQERQS QFSLAYLRLA LRLDPKRDDA WLLVGDLLAQ DDDDAGAREA YAKVLPGSPR YVAAQSKLAW SYQSGGDKPK AIALAQQVAA AAPKDRDAQI TYADLLRAND RWAESAAALD PLIASEAAKP DWRLLYMRGI ALERAGRWSE SERDLLAALK LSPDDPDLLN YLGYSWIDRG EHLPEAIAMV QKAVEARPQS GAMLDSLGWG YYRQGDYKTA VAKLEQAVEL EPGDPDVNGH LGDAYWRIGR KVEARYQWQR VLSLEPDEKQ KAEAMAKLKD GLDGPPAKVA SAG
|
| |