Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2695 |
Symbol | |
ID | 4077002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2836696 |
End bp | 2837706 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638008020 |
Product | AraC family transcriptional regulator |
Protein accession | YP_614689 |
Protein GI | 99082535 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.4326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.587426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGAA ACCCCGCCTC TCCCGCTGCA TCCGATCACA GCGATGACAC CCCGCGACTG GCGGTGGAAA TATTTGTGCA GCCGGGATTT TCCCAGCTCG AGCTTTCGTT GATTCTCGCT GTGTTTGAGG CGGCAAACGC AATGGAGACC GGCATCTGGT TCTCCGTTCG CATCACCTCT GACAGCCCGG GCGTGGTGAC AGGCGGCGCG GGCATGATGG TGCGGGCAGA ACCTGCGATT GGTCTTCAGT ATCTTCAGGA TCTGATGTTT GTGGTGGGGG GGCGCAATTG CAGCGGCGGC AGCTGGCTCG CACGGGCGCG CGCAATGCAG AAACTGCGTC GCCCGGTGTT CCTGCTGTCG GATGCAGCAA CCGCCTATAT CCGCAGATGT GCGCCGCTCT CGGGGCCCGC CACCACCCAT TGGCAAGACC TGCGCGCCCT GCGTGAGACC GGCGAATACC CCACGCTCAC CGATAGCCTC GTGGCGGAAA ATGCAGGCAT TCTGACCTCG GCCGGGGGGG GATATACGGC GGAAATGGTG GTGCGTCACC TCTCGCAGAT CCTTGCACCG CAACACTGCG CCGAATTGGC CAGCGTGTTG ATGATCGAAA CCGCTCGGGG TTACAGCGGA GAACAACCCA AAGGGGCCGC GCGCAACACC AATCTTCTGG AGGCGCGGCT GGTGCGCGCT ATGGCGATCA TGGAAGAATG CATCGAATAT CCCCTGTCCA CCGCAGAGGT GGCCGAGCGG GCGGGGATTT CGGTGCGGCA TCTGGAACGC CTGTTTCTGA CCCATCTCAA CACCACACCG GCCAAACACT ACATGCAGCT GCGCCTGAAG CTGGCCAACA AGCTCATCAC CGACACCAAC CTGCCGATTG CAGAGATCGC CTTTGCCAGC GGCTTTGCGT CCTCTACGTC GCTGTCGCGC GCGTATCGGC GTGAATATAA TATGACCCCC TATCAGGTGC GCGCCCGTGA TCGGGCCGGT GCGGGTCTGC GCGCGGACTA G
|
Protein sequence | MDRNPASPAA SDHSDDTPRL AVEIFVQPGF SQLELSLILA VFEAANAMET GIWFSVRITS DSPGVVTGGA GMMVRAEPAI GLQYLQDLMF VVGGRNCSGG SWLARARAMQ KLRRPVFLLS DAATAYIRRC APLSGPATTH WQDLRALRET GEYPTLTDSL VAENAGILTS AGGGYTAEMV VRHLSQILAP QHCAELASVL MIETARGYSG EQPKGAARNT NLLEARLVRA MAIMEECIEY PLSTAEVAER AGISVRHLER LFLTHLNTTP AKHYMQLRLK LANKLITDTN LPIAEIAFAS GFASSTSLSR AYRREYNMTP YQVRARDRAG AGLRAD
|
| |