Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0624 |
Symbol | |
ID | 5898079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 690822 |
End bp | 691721 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561106 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001682255 |
Protein GI | 167644592 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.401422 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTCC GCAAACACAC GGAGAGGGCT CGATGCGATG CGGCGGGCCT GTTCGAAGCG CCAGCGACCC CGGGCAAGCA TGACGTCGGC GTGGCGGAGT TCCAGCTGCC GACGCCGCCC GTGGATCTCA CCCCCGTCGC GCCGCGGGTC GACGCCGTGA TCGTCAGGCT GCAACTGCGG GACTATCCCC GGCAACGCTA CTGGGAGGAC GGCGTGGCGG CGGCGGTCTG TGACGTGGCG GCCGGCCAGA CGCTGTTCCA CGATCTTCGG CGTGGACCGC GCCTGCTGCT CGACCAACCC TATCACGCCC TGCACTTCCA CATTCCCCGC GCCGCCTTCG ACGCGATCGC CGTCGAGTGC AACGCCGCGC CCATCGGCGA TCTGGATCAC CAGCCGGGGG TCGCCTTCCA CGATTCGACC ATCGCCAACC TCGCCGCCTC GGTGCGGTCC GACACCCGGG AAGGCTCGCA GCGGAGCCAG TTGTTCGCCG ACCACCTGAC CCTGGCCGTG GCCACCCATG TGGCGTCGCG CTACGGCGGC ATGGCGCCGA TGTCGCGCAG CGTGCGCGGC GGGCTGGCGC CCTGGCAGAC GCGGCGCGCC AAGGAGATCC TCAGCGCCAA TCTCGACGGC GGCGTGTCGC TGGCCGAGGT GGCCCGCCAG TGCGGCCTGT CGATCGGTCA CTTCTCGCGC GCGTTCCGCC AGTCGCTGGG CACGACCCCG CACCAGTGGC TGGTCCAGCG GCGGCTCGAC GCGGCCAAGG ACCTGATCCG CTCGTGCCGG ATGCCGCTGT CGACCGTCGC CCTCAGCTGC GGTTTCGCCG ATCAGAGCCA CCTGACCCGG GTGTTCACCC GCGAGGTCGG CGCCAGCCCC GCGGCCTGGC GGCGCGAAGT CCAGCAATAG
|
Protein sequence | MIVRKHTERA RCDAAGLFEA PATPGKHDVG VAEFQLPTPP VDLTPVAPRV DAVIVRLQLR DYPRQRYWED GVAAAVCDVA AGQTLFHDLR RGPRLLLDQP YHALHFHIPR AAFDAIAVEC NAAPIGDLDH QPGVAFHDST IANLAASVRS DTREGSQRSQ LFADHLTLAV ATHVASRYGG MAPMSRSVRG GLAPWQTRRA KEILSANLDG GVSLAEVARQ CGLSIGHFSR AFRQSLGTTP HQWLVQRRLD AAKDLIRSCR MPLSTVALSC GFADQSHLTR VFTREVGASP AAWRREVQQ
|
| |