Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2412 |
Symbol | |
ID | 4076738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2551834 |
End bp | 2552994 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638007734 |
Product | AraC family transcriptional regulator |
Protein accession | YP_614406 |
Protein GI | 99082252 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.462091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGTATG CCTATCTGCG AGGACTTCGG AATGCACGGC TACAGAACAA CAACGACGAA ACATCGACAG TTTTTCGTTC CTGTTCCGTG CCAACGTTCC AAAAGATGCG GAGCGCGTAC GGGAGGCGAG TGGTGAAGGA TGTTTTTGCC CCCAATGGTG GGGTCTTTAT CAACCGGCTG GTCGAATACG TAACCGGTCA GGGGCATGAC TTCGAGAACA TGCTAGCCCA AAAGCTCGCA ATCTCTGAAC CTCTGAGCTC GAGCGCTCAG GTCCTCCCTG TTTATGAACA CTCCTTGGTC TTCGGGCTGG CGGAGAGGGT GTGCAACGAC ACGTCGATTG GCTACGCTTT GGCCTATCAA TGCACGCTGC GTGATGCAGG ACTTGTGGGC TATGCGGTAA GCGCCTCGGA GACTGCTGGT GAGGCGCTGT ACACTCTAAG CCGCCTCAGC AACGTCTTTG AGGTCGTCTC TGCCTCCGCC AGTGGCGGGC TTGTGGATCT GCGCTGGGAC TTTGGATCGC AGGGTAAACT GGATCTGCGT CACTGGAGCG AGTTCATCGC TACCCTCTTG GTTCGTAGCT TAAAAACCCT CTGCACCGGT GCGGTTGCGC CGGTAGGGGT TGAGTTCACC CATACTGCGC CATCCTCCTC AGAGCAGGCG GTCCTGGCCT TTGGTGTCAA ACCAACCTAC CGCGGGCGCC TGAATCGTCT GACCTTTCGC GAACAGGATC TGCGCCAGCC CTTGCGTAGT GCAGATGCAG GCTTGCTGAG GCTCCTGCTG GAGCATGCGG AGCTGTTGCG CCGCCGCCCG GACAGGAACA GCAATGATCT GTCGATCACC GTTGAGCGTC TGATTATGGA CGGCATGTCA GAAGGAGATG CCAGCCTGGC GCAAGTGGCA GAGTCGCTGG ATATGAGCCA GCGCACGCTG TCCCGGAAGC TTGCCAGCGA GGGGACGAGC TTCTTTGCGA TCCTGGAGGG GGTGCGAAAA TCGCTGGCCC TGCGCTACCT CCAGCAAAAC GAGAAATCCC TCTCAGAGAT CTCCTTTGCT TTGGGCTACT CCAGTCTGAG CAGTTTCAAT GACGCCTTCA GGCGTTGGTA CGACCAAAGT CCCGGAAGCT ATCGGAGCGA TGCGCTCAAA GAGGCTGCTG TGAAGTCCTG A
|
Protein sequence | MGYAYLRGLR NARLQNNNDE TSTVFRSCSV PTFQKMRSAY GRRVVKDVFA PNGGVFINRL VEYVTGQGHD FENMLAQKLA ISEPLSSSAQ VLPVYEHSLV FGLAERVCND TSIGYALAYQ CTLRDAGLVG YAVSASETAG EALYTLSRLS NVFEVVSASA SGGLVDLRWD FGSQGKLDLR HWSEFIATLL VRSLKTLCTG AVAPVGVEFT HTAPSSSEQA VLAFGVKPTY RGRLNRLTFR EQDLRQPLRS ADAGLLRLLL EHAELLRRRP DRNSNDLSIT VERLIMDGMS EGDASLAQVA ESLDMSQRTL SRKLASEGTS FFAILEGVRK SLALRYLQQN EKSLSEISFA LGYSSLSSFN DAFRRWYDQS PGSYRSDALK EAAVKS
|
| |