Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3616 |
Symbol | |
ID | 4075043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 670359 |
End bp | 671495 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005135 |
Product | periplasmic solute binding protein |
Protein accession | YP_611845 |
Protein GI | 99078587 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4531] ABC-type Zn2+ transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0144381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCAGAA CAGTTTTGTC CGCGCTCTTT CTCAGCCTAT CTGTTGTTCC TGCCCTTGCA GAGACCCCGC GTGTCGTGAC CGACATTGCA CCCGTGCAGG GTCTGGTCGC TCGGGTCATG GATGGCGTCG GCGCGCCAGA TGTTCTGGTT CCGCCCGGAG CGTCCCCCCA TGGGCATAGC CTGAAGCCAT CGGATGCCCG CGCGTTGACT TCTGCGGATG CGGTGTTCTG GATCGGTGAC GAATTATCGC CGTGGCTGCT CGGCTCGCTC AAAGAGCTCG CAGGGGATGC GCATGTGGTG TCCCTACTCG CGGCACCGCA GACGATGCGG CTTGAGTTCC GCGAGGGGGT GGTTTTTGGT GGGGCTGACC ACGATGACCA TGGCCACGAT GATCACGACC ACGATGCCCA CGAAGATCAT GCTCACGACG GTCACGGGCA CGAAGAGCAC GATCATGATG ACCACAAGGG CCATGATGAT CACGGTGCAC ACGACCATGA CGACCATGCA CATGATCAAG ACGCGCATGG TCACGATGAG GATGCGCATG ACGCCCACGA TCACGACTCG CATGAGACCG CTCACGATGA CCATGGTCAT GGTCATGACG ATCACGCTCA CGACGGGGTT GATCCGCATG CCTGGCTGGC ACCTGAAAAC GGCAAGCAGT GGCTGGCCTT GGTTGCCGAT GAGCTGTCCG AGATCGATCC GGCGAATGCG GACACCTATC AGAACAACGC GCGTGCGGGC CAAGCCGAAA TCGACGCAAT TGTTGCTGCC ACAAAGGCAG ATCTCGGCGA AGCCCATGGG CAGTTCGTGG TATTCCACGA TGCATATCAG TACTTTGAGC AAAGCTTCGG GCTTCGTGCT CTCGGTGCAA TTGCTCTTGG AGATGCTTCC GACCCGAGCG TCGCGCGGAT CGCAGAAATG CGCGATGCGG TTGCTGGTCA AGAGGTCTCC TGTGTGTTCT CTGAGCCGCA ATTCAATGCA GGTCTTGTAG ACACCGTCGC TGATGGGCTC GACATTAAGG CCGTTGTGAT CGACCCGCTG GGGACCGAAA TCGCAACTGG GCCGTCGTTC TATACAGATC TGCTGTCCGA GATTTCTGCA GGCTTCAAAA CGTGCCTGAC GCACTGA
|
Protein sequence | MPRTVLSALF LSLSVVPALA ETPRVVTDIA PVQGLVARVM DGVGAPDVLV PPGASPHGHS LKPSDARALT SADAVFWIGD ELSPWLLGSL KELAGDAHVV SLLAAPQTMR LEFREGVVFG GADHDDHGHD DHDHDAHEDH AHDGHGHEEH DHDDHKGHDD HGAHDHDDHA HDQDAHGHDE DAHDAHDHDS HETAHDDHGH GHDDHAHDGV DPHAWLAPEN GKQWLALVAD ELSEIDPANA DTYQNNARAG QAEIDAIVAA TKADLGEAHG QFVVFHDAYQ YFEQSFGLRA LGAIALGDAS DPSVARIAEM RDAVAGQEVS CVFSEPQFNA GLVDTVADGL DIKAVVIDPL GTEIATGPSF YTDLLSEISA GFKTCLTH
|
| |