Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0367 |
Symbol | |
ID | 4077697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 376581 |
End bp | 377822 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638005662 |
Product | ROK domain-containing protein |
Protein accession | YP_612362 |
Protein GI | 99080208 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCTC TTGCCTCAAA TGCAGGAGGA GCGCATAGAC CAGTCATGCC AGATCCACGC CCCTACTCCA CCGGCGCAAG CCAGAGTGAG CTTCGTGCTT ACAATGAACG CCTGTTGCTC ACGCTCTTGC ACCAAGGTGG CGCCCTACCC GGCAGTGAAA TGGCACGGCG CACCGGGCTG TCGTCCCAGA CCGTCTCGGT GATCCTGCGC AAGCTCGAAC AGGACGGTCT GGTACTGCGC GGCGAGAGCC AGAAAGGCCG CGTGGGCAAG CCTTCGGTCC CGATGGGCAT CAATCCCGAG GGCCTGTTTT CCTTTGGTGT CAAGATTGGC CGGCGCAGTA CGGATTTGGT GCTCACAGAC TTTCGAGGCG GGCTGCGCGC CGAGCGGCAA CTGCGATACG CCTATCCGCA ACCTGACGAT GTATTTGAGT TTATCAAAAC CGGCATAGAA GACATCAGCG CCGATCTGAG TGCCGAGCAA CAGGCGCGGA TCTGCGGCAT CGGCGTTGCG ACCCCCTTTG ATCTCTGGCG CTGGCATGCA CAAGTCGGCG CCCCAAAGTC CAGCCTCACC GCATGGCGCG CGCTGGATCC GACGATCGAA ATTGCCCGCT TCAGCCCCCT GCGGGTCTTT GTCATCAATG ATGCCACCGC AGCATGCCGA GCCGAGCATA TGTTCGGCGC CAGCAGCCGT TGGCGCGACA GCATCTATTT CTTCATCGCG GCCTTCATTG GCGGTGGCGT GGTGCTCAAT CACTCCGTCT ATGAAGGCGG CATGGGCAAT GCGGGGGCTC TAGGGCCCCT GCCCTCGCAA CGGGCTGATG GCAGCAATTG CCGCCTGCTC GACAGCGCGT CCATCCGCGA GCTCGAGCGG CTTCTGAATG AGGCCGATAT TGATCCACGC GCGCTCTGGG AGCAACCGCA GGATTGGAGT GCCTTTGCCC CGCAGGTGGA TCAATGGCTC GAAGAAACCG CCTGCGCCCT GGCGCGTGCA GCGCTGACCA CCTGCGCGGT CATTGATTTT CAGGCTGTTG TGGTAGATGG GGTACTGCCA AGGTCCATCC GCCAACGCCT GGTCGATCGC ATTCGCGAAG AGCTCCCCAA GCTGGATGCC CGCGGCCTGA TCCTACCCGA CGTCCAGGAG GGCCAGATCG GCCCGCATGC CTCGGCCCTT GGAGCGGCCG CCAAACCCCT GATTGCGCAG TATCTCCTTG ATACTCACGC CGGTTTCACT GCCACAAGCT GA
|
Protein sequence | MQALASNAGG AHRPVMPDPR PYSTGASQSE LRAYNERLLL TLLHQGGALP GSEMARRTGL SSQTVSVILR KLEQDGLVLR GESQKGRVGK PSVPMGINPE GLFSFGVKIG RRSTDLVLTD FRGGLRAERQ LRYAYPQPDD VFEFIKTGIE DISADLSAEQ QARICGIGVA TPFDLWRWHA QVGAPKSSLT AWRALDPTIE IARFSPLRVF VINDATAACR AEHMFGASSR WRDSIYFFIA AFIGGGVVLN HSVYEGGMGN AGALGPLPSQ RADGSNCRLL DSASIRELER LLNEADIDPR ALWEQPQDWS AFAPQVDQWL EETACALARA ALTTCAVIDF QAVVVDGVLP RSIRQRLVDR IREELPKLDA RGLILPDVQE GQIGPHASAL GAAAKPLIAQ YLLDTHAGFT ATS
|
| |