Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4073 |
Symbol | |
ID | 5901535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4412209 |
End bp | 4413726 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641564594 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001685696 |
Protein GI | 167648033 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGAC CCATGGATCT CGACCCCGAC GTTTGCTACC GCGCCATCCA GACCCGCGAC GCCCGCTTCG ACGGCCGGCT GTTCGTGGCG GTCCGGACGA CGGGGATCTA TTGCCGGCCG GTCTGCCCGG CCCGCACGCC GCTGCGCCAG AACGTCACCT TCCACGCCAC CGCCGCCTCG GCCGAGGCGG CGGGCTACCG CGCCTGCCTG CGCTGCCGGC CCGAGACCTC GCCGCAACTG GGGGCCTGGA ACGGCGCGTC CAACACCGTC TCTCGCGCCC TGGCCCTGAT CGAGGCCGGC GCTCTGGATG GCGGCGACGT GGAAGGCCTG GCCGAGCGCG TCGGGGTCGG CGGGCGGCAA CTGCGGCGCC TGTTCCTGCG GCACCTGGGC GCGACGCCGG TCGGGGTGGC CCAAACCCGG CGGGTGCTGC TGGCCAAGCA GCTGATCCAC GAGACCGACC TGCCGATGGG CGAGGTGGCC CTGGCCGCCG GCTTCGGCAG CGTGCGACGC TTCAACGAGA CCTTCCAGCA GCTCTATGAT CGGCCGCCCG CCGCGCTGCG CCGTCGCAAG TCCGCCTCTC CCGTTGGTGA GCCGCCGCCC GGCGAGGCCG TCGCCCTGAC CCTGCGCTAC CGTCCGCCCT ACGACTGGGA CGCCATGCTG GCCTTCCTGG CCCTGCGCGC CATTCCCGGC GTCGAGGTGA TCGAGAGCAA TACCTACCGC CGGGTGATCG CCCTGGACGG CGCGGCCGGG ACCATCGCCG TCAGTCCGAT CGACGGCGAC CGGCTGAGCG TGGCGGTGCG CTTTCCCAAG CTTTCGGCCC TGCCCCGCAT CCTGGCCCGC GTGCGGGGGG TGTTCGACCT GTCGGCCGAC CCGGTCGGGA TCGCGGCGGT GCTGTCGCGC GATCCGGACC TGGCGCGGAT GGTCGGCCTG CGTCCCGGCC TGCGCGTGCC CGGGGCCTGG GACGGGTTCG AGCTGGCGGT GCGGGCGATC CTGGGCCAGC AGATCACCGT CGTTCAGGCC CGCAAGCTGG CCGGCGACCT GGTCGCGGCG CACGGCGAAC CGCTGGCGCA GCCCTGGACC GAGCCCGGCC TGACCCACGC CTTCCCGTCG GCCGAGCGCC TGGCCGCCAC CAATCTCTCA GGCATGAAGA TGCCCGGGGC CCGCATTCGC TGCCTGTCGG CCATGGCCCA GGCCATCGCC GACGCCCCCA ACCTGCTGTC GCCGACCGCC GGCCTGGACG AGATGGTTCG GCGGCTGCGC GCCCTGCCGG GTATCGGCGA ATGGACGGCG CAGTACATCG CCATGCGCCA GCTACGCGAA CCTGACGCCT TCCCCGCCGC CGACGTCGCC CTGATGCGCG CCCTCGCGGA CGTCGACGGC GTTCGTCCGA CAGCGGAGCA ACTTCTGACC CGCGCCGAGG CCTGGCGACC GTGGCGGGCC TACGCCGCCC TGCACCTGTG GGCCTCGCTG GCGGATGAAG GCGCGCCGCC CGTTCGGAAG GTGAAGCGTG CGGCCTGA
|
Protein sequence | MIGPMDLDPD VCYRAIQTRD ARFDGRLFVA VRTTGIYCRP VCPARTPLRQ NVTFHATAAS AEAAGYRACL RCRPETSPQL GAWNGASNTV SRALALIEAG ALDGGDVEGL AERVGVGGRQ LRRLFLRHLG ATPVGVAQTR RVLLAKQLIH ETDLPMGEVA LAAGFGSVRR FNETFQQLYD RPPAALRRRK SASPVGEPPP GEAVALTLRY RPPYDWDAML AFLALRAIPG VEVIESNTYR RVIALDGAAG TIAVSPIDGD RLSVAVRFPK LSALPRILAR VRGVFDLSAD PVGIAAVLSR DPDLARMVGL RPGLRVPGAW DGFELAVRAI LGQQITVVQA RKLAGDLVAA HGEPLAQPWT EPGLTHAFPS AERLAATNLS GMKMPGARIR CLSAMAQAIA DAPNLLSPTA GLDEMVRRLR ALPGIGEWTA QYIAMRQLRE PDAFPAADVA LMRALADVDG VRPTAEQLLT RAEAWRPWRA YAALHLWASL ADEGAPPVRK VKRAA
|
| |