Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2515 |
Symbol | |
ID | 5899970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2729311 |
End bp | 2730432 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641563006 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001684140 |
Protein GI | 167646477 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGACC TGGTCCTTCT CGAAACCCTC GCGCGCGGAG TCGCCGTCGG CGGCTTTGGC GTCACCGGTC TGGCCCTGGT GATGGACCGG CGGCCCACGC CCGTGCGCTG GGTCGGCGGA TTGTTCTTCG CCTGCGCCAT CGCCCATGTG ATCGACGGCT TCAGGCTGGG CGGCGCGGTC CACGCCTTGC CGGTCATCTG GGCGATGTCG GTGGCGACCA CCGGCCTGTT CTGGACCTTG GCCTACGCCC TGTTCGCCGA CGAGCGACGC TTCTCGCCCC ATCGCCTGTG GCCAGCGGCG GGGCTGGTGA TGTTGTGGGC GCTGGCCCGG TTCATGCCCG AGGCGACCTG CCGACCCCTC TGGCTGGTCT TCAACCTGGT CTCCGTTGGC CTGGTGCTGC ACGCCCTGCT GGTGATCTGG CGCGGCTGGC GCGGCGACCT GGTGGTCCAG CGCCGCCGGC TGCGCGGGCC GGTGATGATC GCCGCCGCCG GCTACATCCT GTTGCTGAGC GCGCAGGACG TGGCGTGGGC GACGGGCCTG CCCTGGGTCC ATGCGTCCAG CCTGTTGCAG GCCTGTGTTC TGGCCGCCCT GGCCGTCGCC GGAGCCTCGG CCCTGCTTCG GGCCGAGCCC CTGCTGGTCG AGGCGCCGAT CGCGGCCGGC GGCTCCGCGC CCGCGGCCCC CTCGCCCGCC CTTGATCTCA CGCCCGCCGA TCGTCTCGTC CTGGCCCGGT TGCAGAACGC CATGGACGAA AACGAGGTCT GGCGCGGCGA GGACCTGTCG ATCGTGACCC TCGCGGCCCT GGTCGGCGCG CCCGAGCATC GTCTGCGCCG CCTGATCAAC GGCGTCCTGG GTCATCGCAA CTTCGCCGAC TACGTCAACG GACGACGCAT CGAGGCGGCC AAGACGGCGC TGGCCAACCC CGACCTGGCC CTGAAGTCTG TCTCGACGAT CGCCTACGAC CTGGGATTCG CCTCGCTGGG TCCTTTCAAT CGCGCCTTCC GCGCCGCGAC CGGCGTCACC CCGTCGGCCT GGCGGTCCGC CAAGACGCCT TCCCCGGTAT CGGCCCGCCT GCGGCTGGTC GAAACCGCCG ATCCCGCGTC GAAAGCCGAC AAGTCCGCCT GA
|
Protein sequence | MVDLVLLETL ARGVAVGGFG VTGLALVMDR RPTPVRWVGG LFFACAIAHV IDGFRLGGAV HALPVIWAMS VATTGLFWTL AYALFADERR FSPHRLWPAA GLVMLWALAR FMPEATCRPL WLVFNLVSVG LVLHALLVIW RGWRGDLVVQ RRRLRGPVMI AAAGYILLLS AQDVAWATGL PWVHASSLLQ ACVLAALAVA GASALLRAEP LLVEAPIAAG GSAPAAPSPA LDLTPADRLV LARLQNAMDE NEVWRGEDLS IVTLAALVGA PEHRLRRLIN GVLGHRNFAD YVNGRRIEAA KTALANPDLA LKSVSTIAYD LGFASLGPFN RAFRAATGVT PSAWRSAKTP SPVSARLRLV ETADPASKAD KSA
|
| |