Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3779 |
Symbol | |
ID | 5901241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4096200 |
End bp | 4097219 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564302 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001685404 |
Protein GI | 167647741 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0505426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACATTT TCAGCACGGA CGATCTGACG GGCCGCTCAT CTCTCGTCGA GTGGGCAGGG CTGTTCGCGG GGCGCATCGA GGCGATGGAT TTCAAGCCCA GCAGGCCAGA GACGTTCACG GCCCAGCTGG CGGGCAGCGA TCTGGGTCCC CTGCATCTTG CCCGTCTCGC CTGCGCGAAA ACCACGATCG AGCGGGGCGA AGGGCATCTC GCGCACAAGA CTTCTCCGGC CTATTTCCTG ATGCTGCTGA TTGGCGGCAG CGGCGAGATC AGCCACTACG GCAACACGAT CACGCTCGAG GAAGGTGATT TCGTCCTGTG CGACAGCACC GCGCCGCTGA AGATCGCGTT TCCGGATGAC GCCGAGGCGC TCTTCCTGAA GGTCGGGGCG TCGACGCTCA AGGAGCATCT CCCGTCGCCC GAATGCTTTT GCGGACGGGC GCTCCGCGCG GACGAGGGCC TGACGGCGAC GGCGGGGGCC ATGGCGCTCA ATCTGTTTGG CCGCCTCGAA GCGGGGCTGG CGCTGCCCTA TCAGGACCGC GTCGCGCGGC ACCTTCTCGA TATCCTGGCG ATGGCCTACG CCCTGGCCTT CGACACGCCC ACGTCGCGGT CCTCGATCGT CAGCGGCCGG TGCGCGACGG TGAAGCTGTT CATCGAGCAG CATCTACGTG ATCCCGACCT GACGCCGTGC TCGATCGCGG CGCGGATGAA GCTGTCGTCG CGCTATCTGC GGATGATCTT CGCCAGCGAG AACGAGACGG TTTCGGCCTA CATCCTGCGC CGGCGCCTGG AGCAGTGCGC CCGGCAGATC GCCGACCCCG CCTGGCGAGG CCACTCGATG ACCGAGATCG CGTTCGGCTG GGGCTTCAAC AGCGCGCCCC ATTTCACCCG CACCTTCCGC GACCGCTACG GCATGCCGCC CCGCGAATAC CGACGCCTCA AGCTGGACGA GGGGGCCGGG GCTTTCCGGT CCGAATCGGC GCCGCGCCGC GCGGCGCAGC CCGGCGCCCG GGCGGCGTGA
|
Protein sequence | MNIFSTDDLT GRSSLVEWAG LFAGRIEAMD FKPSRPETFT AQLAGSDLGP LHLARLACAK TTIERGEGHL AHKTSPAYFL MLLIGGSGEI SHYGNTITLE EGDFVLCDST APLKIAFPDD AEALFLKVGA STLKEHLPSP ECFCGRALRA DEGLTATAGA MALNLFGRLE AGLALPYQDR VARHLLDILA MAYALAFDTP TSRSSIVSGR CATVKLFIEQ HLRDPDLTPC SIAARMKLSS RYLRMIFASE NETVSAYILR RRLEQCARQI ADPAWRGHSM TEIAFGWGFN SAPHFTRTFR DRYGMPPREY RRLKLDEGAG AFRSESAPRR AAQPGARAA
|
| |