Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1804 |
Symbol | |
ID | 5899259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1906513 |
End bp | 1907463 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641562294 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001683431 |
Protein GI | 167645768 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0697858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCT GGTCGTTCAG CACCGACAGC CATCCTCGCG CCGAACGGGC CGAGGCCTGG CGCGAGGCGA TGAGCCGGCT GGGCCTGCCG ATCGAGGGCT TGTCGGACGC CGAGCCGGCC AGCGCCTCGG TCGTCTGCCT GACCTCTCCC CTGGGCATCG AGTTCGCCCT GGTCGAGGCC GGGGCGCAGG CGATCTCCGG CCGGCTCAGC GGCCAGCCGG CCGCCGTCTG GCTGGCGGTG CTGCTGAGCG GCGAGGCCGC GCTGGTCACG GACGACCTGG CTGAAGAGTT GACGCCTGGC GACATCGCCT ACGGCCCGAC CGGCCAGGCC GCGGCCCTGC GGCTGGAGAC CCGCTGCCGG CTGATGTTCG TCCGCGCCCC CCGCGTGGCC CTGGACCACC GGCTGATCGC GCCCGTCAAC CTGCGGGTGG GGCGGCTGGA ATGCGCGACC GGCGTCGCCC ACATCCTGTC GGGCCTGCTG CGCGCCACGG CCGACGGGCT GGAGGACCTG ACCGTCGACC AGCTTCGTCC CGTCGAGCTG GCCCTGACCG AGTTCCTGGC CATCTGCCTC GTCGAGGGCG GGGCGACGAC CGATATCCTG GGCGCGGGGA GCGGCGCCCC GACCGCCCAC CTGCAGCGTC TCTGCCAGAC CATCGAGACC CTGCTGCCGG ACCCGGACCT GTCCCTGCGC CGGGTGGCCG ACGAGGAGGG GGTCTCGCCC CGCTACGTCC AGAAACTGTT CGCCAGCGCC GACGAGACCT TCAGCCACTA TCTGCGCACC CGGCGCCTGG AACGCTGCCG CACGGACCTG GCCAGTCCGC AGCACGCGCG GCTGTCGATC TCCGAGATCT GCTTCCGCTG GGGCTTCAAC GGTTCGGCCC ACTTCAGCCG GGCGTTCCGC GACCAGTACG GACAGTCGCC CCGCGAATTT CGCCAGAGCG CGGCGGCTTA G
|
Protein sequence | MKAWSFSTDS HPRAERAEAW REAMSRLGLP IEGLSDAEPA SASVVCLTSP LGIEFALVEA GAQAISGRLS GQPAAVWLAV LLSGEAALVT DDLAEELTPG DIAYGPTGQA AALRLETRCR LMFVRAPRVA LDHRLIAPVN LRVGRLECAT GVAHILSGLL RATADGLEDL TVDQLRPVEL ALTEFLAICL VEGGATTDIL GAGSGAPTAH LQRLCQTIET LLPDPDLSLR RVADEEGVSP RYVQKLFASA DETFSHYLRT RRLERCRTDL ASPQHARLSI SEICFRWGFN GSAHFSRAFR DQYGQSPREF RQSAAA
|
| |