Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1196 |
Symbol | |
ID | 4076331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1286849 |
End bp | 1287859 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006502 |
Product | AraC family transcriptional regulator |
Protein accession | YP_613191 |
Protein GI | 99081037 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCG TGACCAATGC TTTTGCGCGC GCCATGGCCC AAGCGGGCGG GATGACGCTG ACCGACGATG GAGAGCTGAT CTCTGGCGGT ACTGTGGTGC GCCGACTTCC CAAACCTGCT GATGGCAAAC TCTCTGAGAC CGATTATTTC GATCTTCTGG ATTGGATCCG GTTCCAGCAA AAGGACGAGA TGGCACTGCT TGCGGCCTAT GCCAAACTGA TCCGGGCGGA TGATATTGGT GTGCTGGGCC TTGCGATGAA AACCGCGCCA ACCCTGCGGG CATCGCTCGA ACGTTTGGAA CGGTATTGGC AGGTTGTCAC GGATACTGCG ATCTATCGAC TGGACACCTC CCAAGATCCG GCGCTCTTGA TCTTTGAGGC GCGCACGGGG CATCATCCTG TGCTGGATTT TCGCAACGAG GGCGCCTTTG CCGGATTGGC ACGGAATATG CGTCTGTTTG TCGAGGGGGA TCTGGTTCTG GACTATGTCA CCTTCAGACA TGCCTGCCGC AGCGATCCAG AGCAGTACCA GGCGCATTTT GGCTGTTCCG TCCGCTTTGA TGCGGAGCAA AACGTCATCG CGCTGCGCAA AGAGATGCTT GATCTGCCAA ACCGGTTGGG CGATGCGGCG GTTTCGGACT TTCTGACCAC GCACTTAGAG ACCGAGATAG GCACGCTACA GGATGAAACC TCGGTACGTG CGGGGCTTTT GCGACTGTTG ACGCCTGCGC TCAGTAACGG GGTGCCGCAG GCCGCAGATG TGGCCCGCGA GATGGGCATG AGCGAGCGCA CGCTCTACCG GCGCCTTGCG GACGAGGGGC TTACGTTTCG CGACGTGCTG ACCGAAGCTC AGTCGTCTCT GGCGCAGGAG CTTTTGAAAG ACAGCCGCAG CTCGATCGCG GAGATCGCCT TTTTGACCGG GTTTTCGGAG CAGAGCACCT TTAGCCGTGC CTTCAAGCGC TGGGTCGGGC AAGCGCCAGC GCAGTTTCGA CAGCAGTTCC CATCGCCCTG A
|
Protein sequence | MPTVTNAFAR AMAQAGGMTL TDDGELISGG TVVRRLPKPA DGKLSETDYF DLLDWIRFQQ KDEMALLAAY AKLIRADDIG VLGLAMKTAP TLRASLERLE RYWQVVTDTA IYRLDTSQDP ALLIFEARTG HHPVLDFRNE GAFAGLARNM RLFVEGDLVL DYVTFRHACR SDPEQYQAHF GCSVRFDAEQ NVIALRKEML DLPNRLGDAA VSDFLTTHLE TEIGTLQDET SVRAGLLRLL TPALSNGVPQ AADVAREMGM SERTLYRRLA DEGLTFRDVL TEAQSSLAQE LLKDSRSSIA EIAFLTGFSE QSTFSRAFKR WVGQAPAQFR QQFPSP
|
| |