Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2141 |
Symbol | |
ID | 5899596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2316260 |
End bp | 2317282 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562631 |
Product | LacI family transcription regulator |
Protein accession | YP_001683767 |
Protein GI | 167646104 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0418584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0217452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA TCCACGATGT GGCGCTGCAA GCGGGCGTGT CGCCAAAGAC CGTCTCGCGG GTGCTGAACG ATCACGAGAG CGTCACCGCC AAGACCCGCG AGCGCGTACG CGGCGCCATG CAGGCCCTCG ACTATCATCC CAACGCCGTG GCGCGCGGCC TGCGCTCGCA CGCCGCCCCG GCCGTCGGCA TCCTGATGGG CGACCCCAGC GGCGGCTACC AGACCCGCAT CCACCACGCC CTGATGGTCG CCTGCCTGCA GAACGGCCGC CACCTGTCGG CCGAGTTGGT CGAGGGCGAC ATGGCCGGCT GGCAGGATCG CATCCGCGCC TTCGTCACCG AGGGCGGGAT CCGCGAGATG ATCCTGCTGC CGCCCGAATG CGACTTCGCC CCGCTCAAGA CGCTGCTGCG CGAGCACGAC GTGCGCTGTG TGCTGATCTC GCCCACCAGC CCCGATTCGC AATCGCCCAG CATCGTGATG GACGACCGCG CCGCCGCGCG CGAGGTGGTC GAGCACCTGT TCAGCCTGGG CCATGAACGG ATCGGCCATA TCGCCGGCCA CCCGGACCAC GCCGCCAGCA CCCTGCGCCG CAATGGCTTC AACGAGGCCT ACGCCGCCGC CGGCAAGCCG CGCCCCGATC CGGCGCTGAT CGTACCCGGC GACTTCACGT TCAAGGGCGG CCTGGCCGGC GCCCAGGCCC TGCTGGACAT GGAAAACCCG CCGACCGCCA TCTTCGCGGC CAATGACGAC ATGGCGGCCG CCACCTGCAT GGAGGCCCAG CGCCGCGGCC TGCGCATTCC CGACGACCTG TCCGTGGTCG GGTTCGACGA CGCGCCGATC GCCGCCGCGA TCTGGCCGTC CCTGACCACG ATCCGCCAGC CCTTCGACCA GATGACCCAG CGGGCCATCA CCGCCCTCGG CGCCTGGAAC GCCAACGCGG CGCTCGGCAA GTCGGCGGCG ACGATCCTGA CAAAGCACAG TCTGGTCGTC CGCGAATCCA CCGGCCCCGT CAGGGCCGGG TAA
|
Protein sequence | MATIHDVALQ AGVSPKTVSR VLNDHESVTA KTRERVRGAM QALDYHPNAV ARGLRSHAAP AVGILMGDPS GGYQTRIHHA LMVACLQNGR HLSAELVEGD MAGWQDRIRA FVTEGGIREM ILLPPECDFA PLKTLLREHD VRCVLISPTS PDSQSPSIVM DDRAAAREVV EHLFSLGHER IGHIAGHPDH AASTLRRNGF NEAYAAAGKP RPDPALIVPG DFTFKGGLAG AQALLDMENP PTAIFAANDD MAAATCMEAQ RRGLRIPDDL SVVGFDDAPI AAAIWPSLTT IRQPFDQMTQ RAITALGAWN ANAALGKSAA TILTKHSLVV RESTGPVRAG
|
| |