Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3528 |
Symbol | |
ID | 6200630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | - |
Start bp | 4007298 |
End bp | 4008236 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641707483 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001834574 |
Protein GI | 182680428 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.680996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.733506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCCT CTCGAGTCGA TTTGTTGTCC CAAATGCTCG ACCTCGTACG GCTACACGGG GAATTGGTGT TTTCCGCCGA GTTGAGCCAC CCATGGGCGC TGCGGTTCGA GCCGGGGTCG GCCTATTTCT TCGTGGTCCT GGAAGGAAGC TTGATCGCGC AGGCGGCCGA CGCTCCCCCC GTGAAGGCGG TCGCAGGCGA CCTCGTCATG CTGCCGCGAG GAACCGGCCA TATCCTCGGC GATGGCAGCG ATGCGACTGC GGCAGACGCC GCCGACTTGA TGAGAGAGCA GTTCACGGCA GAACAGCTGG GTCTTCGCCA TGGCGGCAAT GGCGAACAGA CCAGGGTTAT CGCCGGGGCA TTCCATTTCG AGAGCACCGC CGTGCCATGG GTCGTCTCAG CCCTTCCGGC CGTGATCCAT ATTGCAAAGT CGGGCGGCCA GACTGGCGGA TGGCTCGAGG GATTGGCCTA TTTCATGATG ATGGAAGCCC AGGTGGTGCA CCCCGGCTCT TCGGTCATGA TCTCGCGCCT CATCGACGTT CTGATCATCC GCGTCATCCG AACCTGGGCG CAGACCAAGA ACGCCAGCGA CACAGGATGG CTGGGCGCGT TGGGGGACCC CCGCATCAGC CGCGCCCTCA AGGCCATTCA CGACGAACCT TTCCGCAAAT GGAGCGTGGC CGACCTGGCA AACGCGGCCG GAATGTCCAG GTCCAGCTTT GCCGAAAGGT TCTCTTCCCT GGTCAAGGAA GCGCCGTTGT CCTACCAGAA TCGATGGCGG CTCACCCTTG CTCACGGGCT TCTCAGCCAG GCCAATGCCC GTGTCGGCGA TGTCGCCAGG CAGGTGGGTT ACGACTCCGA TGCGGCTTTC AGCCGAGCGT TCAAAGCTCA ATTCGGGATT CCTCCCGCCG GTATTAAATC AGCAGTTTCA CAGTCATAG
|
Protein sequence | MLASRVDLLS QMLDLVRLHG ELVFSAELSH PWALRFEPGS AYFFVVLEGS LIAQAADAPP VKAVAGDLVM LPRGTGHILG DGSDATAADA ADLMREQFTA EQLGLRHGGN GEQTRVIAGA FHFESTAVPW VVSALPAVIH IAKSGGQTGG WLEGLAYFMM MEAQVVHPGS SVMISRLIDV LIIRVIRTWA QTKNASDTGW LGALGDPRIS RALKAIHDEP FRKWSVADLA NAAGMSRSSF AERFSSLVKE APLSYQNRWR LTLAHGLLSQ ANARVGDVAR QVGYDSDAAF SRAFKAQFGI PPAGIKSAVS QS
|
| |