Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1211 |
Symbol | |
ID | 5898666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1273555 |
End bp | 1274814 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561696 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001682839 |
Protein GI | 167645176 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.645913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCC CGCCCGCGGC AGGCCAAGAG CACAGGCGCC TGACGCAACA CGACGTGGAT CTGATCTGCG CCAAGCATGA CCGCCTATGG TCGGCTCGGC TCGGGGGGGC GCGCGCGGTG TTCGCCTTCT GCGACCTGTC GGGGCTGTCG GTCACGGGCC GCAATCTCTG CGACGCCGAC TTCACCGGCG CCCTGCTGAT CAATTGCGAT ATGCGGCGCG TCAAGCTCGA CAACGCCAGC CTCTACGGCG CCGATCTGCA TGGTTCCGAC CTGACCGACG CCTCGATGCG TCGCTGCGAC CTGCGCGGCT CCAGCCTGCG CGGCGCCAAC CTGACCGGCG CCGACCTGTT CGAGGCCGAC CTGCGCGAAG GTCAGATCGC CGCCGCAGAC CGCAAGGAAG GCTATCGCGT CATCGAACCC ATGCAGCGCG AGGCCCAGGC CCACGGCGCG ATCCTGGCCG GGGCCAATCT GGAACGCTCG CGCCTGTCGG GCATCATCGC CACCAAGGCC GACTTCAGCG ACGCGATCCT CAAGGACGCC AAGCTGGTGC GGGCCAACCT CAAGCAGGCC AATTTCAACG GCGCCAACCT GGCGGGGGCC GATCTGTCGG GCGCCAACCT GACCGGGGCC GACCTGCGCA ACGCCGTCCT GGTGGGCGCC AAGACGATGT CGTGGAACAT CAGCGACACC AATCTGGACG GGGCGCTGAC CGACAAGTCC TCCGGCACCG ACGTGGCGGA CATGCCCTAC GAGCAGATGA TCGCCGACCA CGCCCGCTGG TGCGAGACCG GCGGCGCCGA GGGCAAGCCG TCGGTGTTCG ACAAGGCCGA CCTGCGCAAC CTGAAATCCG TGCGCGGCTT CAACCTGACC GCTTTGTCGG CCAAGGGGGC GGTGTTCTAC GGCCTGGACA TGGAAAGCGT GCAGATGCAG GGCGCCCAGC TGGAGGGCGC GGATCTGCGG TCCTGCAACC TGCGCCGGGC CGACCTGCGC GGGGCGCGGC TGAAGGGGGC CAAGCTGGCC GGATCCGACC TGCGCGAGGC TCAGCTGGGG CCGCTGCTGA TCGGCCGCGA CCGCCTGCTG CCCAGCGACC TGACCGGCGC CCTGCTGACC AACGCCGACC TGGCCCGCGC CGACCTGCGC CAAGCCTGCC TGAGCGGCGC GGACCTGTCG CGCGCCAACT TCACCAACGC GATGCTCAAG GATGTCGACG TCACCGGCGC GATCCGCACC GGCGCGCGCG GCCTGGACGA CGTGATCTAG
|
Protein sequence | MSAPPAAGQE HRRLTQHDVD LICAKHDRLW SARLGGARAV FAFCDLSGLS VTGRNLCDAD FTGALLINCD MRRVKLDNAS LYGADLHGSD LTDASMRRCD LRGSSLRGAN LTGADLFEAD LREGQIAAAD RKEGYRVIEP MQREAQAHGA ILAGANLERS RLSGIIATKA DFSDAILKDA KLVRANLKQA NFNGANLAGA DLSGANLTGA DLRNAVLVGA KTMSWNISDT NLDGALTDKS SGTDVADMPY EQMIADHARW CETGGAEGKP SVFDKADLRN LKSVRGFNLT ALSAKGAVFY GLDMESVQMQ GAQLEGADLR SCNLRRADLR GARLKGAKLA GSDLREAQLG PLLIGRDRLL PSDLTGALLT NADLARADLR QACLSGADLS RANFTNAMLK DVDVTGAIRT GARGLDDVI
|
| |