Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5598 |
Symbol | |
ID | 5153891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 5823852 |
End bp | 5824901 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640560341 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001241463 |
Protein GI | 148256878 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.165566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGC AGATCCGAAC CACATCGTCG TCCTGGATGC GTGGCGTGGC GACCACGCTG AAGGAGCAAG GTCTCGATGC GGCCATGCTG TTTGCGGAAG TCGGCCTGTC GATCGAGGAT CTCGACGATT GCGATCGGCG CTGGCCGACC GAGGCGCTGA GCCGGTTGTG GGTGCTGGCG GCGGAACGGT CAGGCAATCC CGATATCGGG CTGGCGAATG TCGACGCGCC GCGGCCGGAT CATTATGGCG TCGCGGGCTA CGCCATGATG TCGAGCCCCG ACCTGCTGAC CGGGCTGACG CGCCTGATCC GCTATATGTG CCTGGTGAGC GACGCCGTGA CCATTACGCT CGAGCACGGC CACGGCGGGC GCTGGGTACG GACCGACGTG TTCGGCGGCG AATGTCCGAT CCCGCGACAG CGCTACGACT ACGGCGTCGT CAATCTGCTC AATCTCTGCC GGTGGATGCT GGACCGTCCG CTGACGCCGC TCGCCACCCG CTTCTCGCAC GCGGTGCCGC TCTCGATTGC CGCCTACAAC AACGCCTTCC AGTCACCACT GGAATTCGAT GCGCCGTTCA ACGGCTTTCT GGTATCGGAG CAGGATCTCG CCTGCAGGCT CACGACATCA GCCCCCGAGC TGACCGCGAT CCATGATCGC ATCGCCGACG AAGCGCTGGA GCGGCTGGTC AAGACCGATA CGGCCTACCG CGCGCGGGCC GCAATCGCGC GGCTGCTGCC CGATGGCAGC CCGCTGCGCT CGGCGATTGC GGCCGCGCTC GGCCTGAGCG ATCGCACGTT TCAGCGCCGC CTTGCCGACG AGGGCCTGTC CTTCAGCGAT CTCGTCGACG GCACGCGGCG CGAACTCGCG CAGCGTCATC TCGCCGACCC GCGCCTGACG CTGACCGACA TCGGCTATCT GCTCGGCTAT TCCGACCAGA GCACGTTCTT CCGGGCCTCC AACCGATGGT TCGGCGAGTC GCCGGGCGAA TACCGCACGC GCGTGATGAA CGGCCACCGC CAGGACAAAT CGCTCGCGCG CGGGGCCTAG
|
Protein sequence | MAQQIRTTSS SWMRGVATTL KEQGLDAAML FAEVGLSIED LDDCDRRWPT EALSRLWVLA AERSGNPDIG LANVDAPRPD HYGVAGYAMM SSPDLLTGLT RLIRYMCLVS DAVTITLEHG HGGRWVRTDV FGGECPIPRQ RYDYGVVNLL NLCRWMLDRP LTPLATRFSH AVPLSIAAYN NAFQSPLEFD APFNGFLVSE QDLACRLTTS APELTAIHDR IADEALERLV KTDTAYRARA AIARLLPDGS PLRSAIAAAL GLSDRTFQRR LADEGLSFSD LVDGTRRELA QRHLADPRLT LTDIGYLLGY SDQSTFFRAS NRWFGESPGE YRTRVMNGHR QDKSLARGA
|
| |