Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1212 |
Symbol | |
ID | 5898667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1274932 |
End bp | 1276194 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561697 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001682840 |
Protein GI | 167645177 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.643735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG CGCAGACCCT TGCTGGACGG CGAAGGCTCA GCCAGGCCGA ACTGGACATG ATCGTCGCCG CGCACGAGAA ATTCGTCACC GGCAAGCAGG GCGGCAAGCG GGCCTCGCTG CGTTTCATGA ACCTGTCGGG CCTCGACCTG TCCTTCCGCA ATCTGGCCGA CGCCGATTTC TCCGCCTCCA TCCTCGACGG CTGCCGCATG GTCCGCACGC GGCTGGAACG CGCCAACCTG TTCGGCGCCG ACCTGCGCAA GGCCGACCTG CGCCAGGCGG TCCTGATCCG CGCCGACCTG CGCGGCGCCT GCCTGCGCGG GGCCAACCTG TCCCAGGCCG ACCTGACCCA GGCCGATTTC CGCGAAGGCC AGGTGGCCAT TCCGCATCCG CGCAAGGGCC TGGAGACCGT TCGCCACGAG ACCCGCACCG GCGAGGTGGA CGAGGTCAAT TTCTCGGGCG CGACGCTGGA CGGCTCGCAG TTCGCCGGCG TCTCGGCCTT CAAGGCCGAT TTCAGCGACT GCTCGTTGCG TGGAGCCAAG CTTGCGGGCG CCAACCTCAA GGAGGCCAAC CTGACCGGCG CCATCCTGGA TGGGGCCGAT GTCAAGGGCG CCAATCTGGA AGGCGCCAAC TTCACCGGCG CGGTGATGGC CGGCGTCGAC ATCTCCACCG CCCGCACCCA GGGCGCGGCC ATGCAGGGCT GTCTGACGGA CTCGACCGAG CGCGCGCTGT CGCGGGTCGA CGAGATCCTG GAGCGCTGCA TGGGCAACCA GGCCTGGTGC AAGACCGGCG GCAAGGAAGG CGCGCCCGCT CGCCTCGACG GCGAGGACCT GCGTCCGCTC GGCGACCGCC TCAAGGGGCT GCGCCTGACG GCGATGAGCG CCTCGGGGGC CTGCATGATC GGCCTGGACC TGTCCGGCGC CCAACTGCAG GGCGCCAATC TGCAGAACGC CGACCTGCGC TCGGCCAACC TGCGCGGCGC CGACCTGCGC GGGGCCAAGC TGTCGGGCGC GAACCTGACC AAGGCCGACC TGCGACAGGC GTTCCTGTCG CCCTTGCCGC TGGGGCCGGA ACGCAAGACC CTGGTCAATC TGAAGGCGGC GCGCCTGCGC TACGTCCAGT TCCAGGCGGC CGACCTCAGC GAGGCGGTGC TGGACGGCGC CGACCTGCGC GGCGCGGACT TCACCGGCGC GCACCTGGGC AAGGTGAGCC TGCGTGATTG CGACCTGACC CAGGTGCAGG GACTGGAGCT GGTTCCGGGC TGA
|
Protein sequence | MTAAQTLAGR RRLSQAELDM IVAAHEKFVT GKQGGKRASL RFMNLSGLDL SFRNLADADF SASILDGCRM VRTRLERANL FGADLRKADL RQAVLIRADL RGACLRGANL SQADLTQADF REGQVAIPHP RKGLETVRHE TRTGEVDEVN FSGATLDGSQ FAGVSAFKAD FSDCSLRGAK LAGANLKEAN LTGAILDGAD VKGANLEGAN FTGAVMAGVD ISTARTQGAA MQGCLTDSTE RALSRVDEIL ERCMGNQAWC KTGGKEGAPA RLDGEDLRPL GDRLKGLRLT AMSASGACMI GLDLSGAQLQ GANLQNADLR SANLRGADLR GAKLSGANLT KADLRQAFLS PLPLGPERKT LVNLKAARLR YVQFQAADLS EAVLDGADLR GADFTGAHLG KVSLRDCDLT QVQGLELVPG
|
| |