Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0246 |
Symbol | |
ID | 5897520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 272743 |
End bp | 273981 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641560730 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001681881 |
Protein GI | 167644218 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.81066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG CAGCCTCGTC CGCCCCCACC AGCCTGATCC GTTCTCTTCC GCCGCGCGCG CTGAAGGCCG CGATCGAGGC GCACACGCGC TACCTGAAGG GGCGCCCGGG CGGGACGCGC GCCAACCTCT CCTACGTCGA TCTCGACAAC GTCAACCTGG AGGGCGTCGA CCTCTCCGAG GCCGATCTGA CCGGCGTTCG CCTGGCCGGC GCGCTGCTGT CACGGGCCAT ACTGGCGCGC GCCACGCTGT ACGGCGCCGA CCTGCGGGAC GCCGACCTGC GTCAGGCCAA CCTCGCCCGC TGCGACCTGC GCGGCGCCTG CCTGCGCGGG GCCAATCTGA GCGGCGCCGA CCTGTCGGGC TGCGACCTGC GCGAGGGCGT CACCGCCACC CAGGATTCGG CCGAGGGCTT CAAGATTCTC GGCCATCGAA CCCGTTCCGG CGAGTTGGAC TACGCCCTGG CGCGCGGGGC CAAGCTGGCC GGCGCCCAGA TGGGCGGGGC CTTCGCCCAG GCCGTCGACC TGACCGACGC CGACCTGACG GGCGTCAGCC TGCAGGGCGC CCGGCTGACC CGGGCGGTGC TGAACGGCGC CAACCTCTCC CAGGCCAATC TCTATAACGC CGACCTGACC GGCGCGTCGC TGCGGCGGGC GGTGCTGACC GGGGCCGACG TCGCCGGCGC GTCCTTCGAC GGCGCGGACC TGGCCGAGGT GCTGCGCGCC CCGCCGCCGA TGATCTATGT CGATGACGCG CCGCTGCACG AAATCCTCGA GGCGCACGAA CTGTTCGTGA CCAGCGACGG CCGCGACGGC GCGACGGCCA AGACCCCCTC GGTGGACTTC CGGCCCCTAA GGCGCCTGAA GGGCCGGCGG CTCAGCGGCC TGTCGGCCCC CGGCGCGATC TTCTTCGGCA TGAACCTGGA AGGCGTCCAG CTGCAGGGCG CCAACCTGGC CGACGCCGAT CTGCGCGGGG TCAATCTGCG GGGCGCCGAC CTGCGGGGCG CGCGGCTGGT GGGGGCGCAA CTGTCGCGGG CCGACCTGAC CGGCGCCAAC CTGGGACCGC TGGCGATCGC CCAGGGCCGC GTCCTGCGCG CCGACCTCAG CCGCGCGGTC CTGCGCGGCG CCGATCTGAC CGGGGCCAGC GCTCGCCGCG TCCGCCTGAT CGACGCCGAC CTGACCCGCT GCAAGCTCGA AGGCTGCGAC CTGACGAGCG CCGAACTGCC CGAGGGCTTC GGGGCGTAG
|
Protein sequence | MSAAASSAPT SLIRSLPPRA LKAAIEAHTR YLKGRPGGTR ANLSYVDLDN VNLEGVDLSE ADLTGVRLAG ALLSRAILAR ATLYGADLRD ADLRQANLAR CDLRGACLRG ANLSGADLSG CDLREGVTAT QDSAEGFKIL GHRTRSGELD YALARGAKLA GAQMGGAFAQ AVDLTDADLT GVSLQGARLT RAVLNGANLS QANLYNADLT GASLRRAVLT GADVAGASFD GADLAEVLRA PPPMIYVDDA PLHEILEAHE LFVTSDGRDG ATAKTPSVDF RPLRRLKGRR LSGLSAPGAI FFGMNLEGVQ LQGANLADAD LRGVNLRGAD LRGARLVGAQ LSRADLTGAN LGPLAIAQGR VLRADLSRAV LRGADLTGAS ARRVRLIDAD LTRCKLEGCD LTSAELPEGF GA
|
| |