Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2399 |
Symbol | |
ID | 5899854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2608709 |
End bp | 2609680 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562890 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001684024 |
Protein GI | 167646361 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.33789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAAGG CCCTGGAAGA GCTGCGCCGC CTGGCCGCGC GCGCCGAGAA CCGTCGAACC GAGACCGGAA TACCGCGGCT GGCCATGGTG CGGGGCGCGA TACCCGAACA TGAACTGGCG GCCGTCTACG AGCCGATGAT CAACCTGATC CTGCAGGGAT CCAAGTCGAT GACGATCGGC GGCCAGACGC TGCGCTACGA CCCGGCGACC TATTTCGTGA TGTCGGTCGA CCTGCCGGCG ATCGGGGCCG TCCATTCCGC ATCGACCGGG GAGCCCTATC TCGCGGTCAG CTTGACCCTT GAGCCTTCCG TGATCGCCGC GATCCTCGAT CTGGCGCAAC CATCGGCGGG GCAGGGCTCT GGCTTTTCAG TCGCGACGGT GACGCCTGAG CTTCTGGACG CCTGGGTGAG GCTGATGGGG CTCATGGAGC GTCCGCTGGA CATTGCCGTT CTTGGCCCTG GTTATGAGCG AGAAGTCCTC TACCGGGTTC TGCAGGGACC CCAGGGCTGG ATGCTGCGTG ACATCGCCAC GCCCGATACG GCGCTTTCCC GCATCCACGC GGCTGTTCGC TGGATCCGCG AGAATTTCAA CAAGCCCCTA CGGATCGAAG ACCTGGCGCG CCAGGTGGCG ATGAGCGTCT CGGCCTTCCA CAGGCAGTTC AAGACGGTCA CGAACATGAG CCCCCTGCAG TTCCAAAAGA CCATACGCCT GATGCAGGCG CGCCGTTTGA TGACCACGGA GGCGCTGGGC GCCGCCGCCG TCGCAACGCG TGTCGGCTAC GAAAGCGCCT CTCAGTTTAG CCGCGAGTAC GCCCGTCTCT TCGGGCGCTC GCCCGCACGG GACGCCGCCG CTTTGGCGAG CGAGTGGAGA GCCGGGCGTC GCGTCGTGGA TGAGATTGGC TCCAAGCCGC CGCCGACGAC GCCCATCATG CTGATGACGC CGCCGGCGGC GATCCTGGTC GTTAACGACT AG
|
Protein sequence | MEKALEELRR LAARAENRRT ETGIPRLAMV RGAIPEHELA AVYEPMINLI LQGSKSMTIG GQTLRYDPAT YFVMSVDLPA IGAVHSASTG EPYLAVSLTL EPSVIAAILD LAQPSAGQGS GFSVATVTPE LLDAWVRLMG LMERPLDIAV LGPGYEREVL YRVLQGPQGW MLRDIATPDT ALSRIHAAVR WIRENFNKPL RIEDLARQVA MSVSAFHRQF KTVTNMSPLQ FQKTIRLMQA RRLMTTEALG AAAVATRVGY ESASQFSREY ARLFGRSPAR DAAALASEWR AGRRVVDEIG SKPPPTTPIM LMTPPAAILV VND
|
| |