Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2237 |
Symbol | |
ID | 4077304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2350145 |
End bp | 2351275 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638007559 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_614231 |
Protein GI | 99082077 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.246635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGC TAATGTTGGC CACAGCGGCC GCGGCGCTGG CCGCGGGCGG AGCGATGGCA GAGGTCAAAG TCGGGATGAT CACCACGCTT TCGGGCGGCG GTGCGAGCCT CGGCATCGAC ACGCGGGACG GGTTCATGCT GGCGATGGAA GCCGCAGGTC GCGATGATGT CGAAGTGGTC ATCGAGGACG ACCAGCGCAA GCCCGACATC GCCGTGCAGC TTGCCGATAA GATGATCCAG TCCGAAAAGG TCGACGTGAT GACCGGTATC GTCTGGTCCA ACCTTGCAAT GGCCGTGGTG CCTGCGACTA CCGCGCAGGG GCTGTTCTAT CTTTCGACCA ACGCGGCCCC CGCACAGCTG GCGGGCAAAG GCTGCAACGC CAATTATTTC TCGGTCGCCT ACCAGAACGA CAACCTGCAT GAAGGCGCGG GCGCCTATGC AACGCAGGCG GGGTTCAAGA ACACCTTCAT TCTCGCACCG AACTACCCGG CGGGGATCGA CAGCCTCACT GGCTTCAAAC GTTTCTATGA AGGCGATCTC GCAGGGGAGG TCTACACCAA GCTCGGCCAG ACTGATTACG CGGCTGAAAT CGCGCAGATC CGCGCATCCG GCGCCGACAG CGTGTTCTTC TTCCTGCCCG GCGGCATGGG GATTTCCTTC CTGAAGCAAT ATTCTGACAG CGGCGTCGAC CTGCCCGTCG TCGGCCCGGC CTTCAGCTTT GATCAGGGCA TCCTGCAAGC GGTGGGCGAA GCGGCGCTTG GCGTCAAGAA CTCCTCCACC TGGTCCAAGG ATCTGGACAA TGAGGCCAAC GCGGCCTTTG TTGCGGCCTT CCAGCAGAAA TACGACCGTC TGCCGTCGAT CTATGCGGCG CAGGGCTATG ACACCGCAAA CCTGCTGCTG TCGGCCATCG ACAAGGCGGA TGTGAATGAT GACGCAGCCT TTGCTGCGGC CCTCAAGGAG GCCGATTTTG CCTCTGTGCG CGGCGAATTC TCCTTTGCGG CCAACAACCA CCCGATCCAG AACGTCTATG TGCGTGAGGT CATCAAGGAA GGCGACGTCT ACACCAACAA GATCGTCGGC ACCGCTCTTG AGGATCATGC AAACGCCTAT GTGGACGAGT GCAAGATGTA A
|
Protein sequence | MKKLMLATAA AALAAGGAMA EVKVGMITTL SGGGASLGID TRDGFMLAME AAGRDDVEVV IEDDQRKPDI AVQLADKMIQ SEKVDVMTGI VWSNLAMAVV PATTAQGLFY LSTNAAPAQL AGKGCNANYF SVAYQNDNLH EGAGAYATQA GFKNTFILAP NYPAGIDSLT GFKRFYEGDL AGEVYTKLGQ TDYAAEIAQI RASGADSVFF FLPGGMGISF LKQYSDSGVD LPVVGPAFSF DQGILQAVGE AALGVKNSST WSKDLDNEAN AAFVAAFQQK YDRLPSIYAA QGYDTANLLL SAIDKADVND DAAFAAALKE ADFASVRGEF SFAANNHPIQ NVYVREVIKE GDVYTNKIVG TALEDHANAY VDECKM
|
| |