Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2167 |
Symbol | |
ID | 4076766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2277923 |
End bp | 2279320 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638007489 |
Product | GntR family transcriptional regulator |
Protein accession | YP_614161 |
Protein GI | 99082007 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.786725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAGCG ATTTGCCCGC CTTTATGCCG GATCGGCGTT CTGATGTGCC GATCTTTGAG CAGATCTGCA CGGCCTTGCG ATCTCGAATT CGCTCTGGGG CTCTTGCCAC CGGGGCGCGG CTCCCGGCAA CGCGGGCGCT TGCAGGGGAT CTGGGGGTGG CGCGTGCGAC GGTCGTGACG GCCTATGAGC AGCTGGTGGC CGAGGGCTAT CTCATGGCGC GGCGCGGATC GGGTTATACC GTCTGTGCCA TGGGCGAGGT GGAGCTTCCG GCGAAAGGGA CGCCCACGCC ACCTGCGGAC CTCAGCGAGC GGGGAGACCT GCTCTTGGAG GCTGGCCAGC CGGACATGCG TCTCTTTCCG CACCGCGCCT GGGCAAAAGC CGTTGCTCGC CTCTGCCGCA CACGTCCAGA AGATATGTTG ATGGGCTGCG GGCGCTTTGG TCATCCGGAC CTGCGCCAGG CCATTGCCGC CCATGTGAAC GATTGGCGCG GGATCGCCGC GCGGCCGGAT CAGGTGCTGG TCACCTCTGG CGCGACCGAG GCGTTGGATG TTTGTCTCGC GGCGCTGGCC TCGAGCACCG GAGCGGTTGG GGTGGAGGAT CCCGGCTATC CCCCGATCCG ACGGTTTGCG CAGGCGCAGG GGCGTCGCAT CCGAGACTTG GCGCTCGATT CCCAAGGGGC CGAAGTCCCT ACAACTGGCG ATGGCACCGA TGTGGTGGTT CTGACGCCGT CGCATCAATA CCCGCTTGGT GGCGCGATGA GTGCGGGCCG GCGACAAGAG TTTGTGCGCT GGGCGGCGCA GTCACAGGGC TGGATCATCG AGGACGATTA CGATTCGGAA TTTCGCTATG CGGGCCGACC GATTCCCGCG CTGGCAGGTG TCGACGGGCA GGGGCGGACA GTCTACATCG GCAGCTTCTC CAAGATTTTC TCGAACAGCT TGCGGATGGG GTACATCGTG GCCCCCGAGG CGCTGGTGGA GCGCCTCGTG CGGGCGATGC GTCGCAAGGG TGCGCGTGCC AGTGCCATGC CCCAAGCGGC TCTGGCGGAG TTTATGCAGC ACGGAGAGTT CTACCGGCAC CTGCGCCGGG TGCGCCGAAT CTATGCGGAA CGGCACCGGA CCCTGATCGC GCGGTTGCGC TCGGATTTCT CTGATGTGGG CCATGTGGAG GATTATCAAG CTGGCATGCA GGTCGTGCTG CATCTGCGTC CCGAAATTTG CGATCAGACC GTGACAACAG CCGCACGCGC AGCCGGTGTG GGTGCGGAGG CGCTCTCAAG CTACAGCCGC CGCGCGGGCG CCTTCAATGG GGTGGTTCTG GGCGCCTGCC TCAATGATCT TGATGAGCAG GCAGAAGCCC TGTCGCGCCT GCGCGCGACG ATTTCCTCAA CAGGATAA
|
Protein sequence | MPSDLPAFMP DRRSDVPIFE QICTALRSRI RSGALATGAR LPATRALAGD LGVARATVVT AYEQLVAEGY LMARRGSGYT VCAMGEVELP AKGTPTPPAD LSERGDLLLE AGQPDMRLFP HRAWAKAVAR LCRTRPEDML MGCGRFGHPD LRQAIAAHVN DWRGIAARPD QVLVTSGATE ALDVCLAALA SSTGAVGVED PGYPPIRRFA QAQGRRIRDL ALDSQGAEVP TTGDGTDVVV LTPSHQYPLG GAMSAGRRQE FVRWAAQSQG WIIEDDYDSE FRYAGRPIPA LAGVDGQGRT VYIGSFSKIF SNSLRMGYIV APEALVERLV RAMRRKGARA SAMPQAALAE FMQHGEFYRH LRRVRRIYAE RHRTLIARLR SDFSDVGHVE DYQAGMQVVL HLRPEICDQT VTTAARAAGV GAEALSSYSR RAGAFNGVVL GACLNDLDEQ AEALSRLRAT ISSTG
|
| |