Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3312 |
Symbol | |
ID | 4075717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 321168 |
End bp | 322187 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004820 |
Product | putative periplasmic solute-binding protein |
Protein accession | YP_611546 |
Protein GI | 99078288 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00706911 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACATA TCGCTCTTAC GGTTCTGGCT GCAACCGCTC TTGCGGCACC GCTGACCGCG TCTTTTGCAT CTTCGGCGCA GGCCGAAGAC GCCATCTGCT ACAATTGCCC CCCGCAATGG GCGGATTGGG CGTCCATGCT GGAGGCCATC GAGACCGAGA TCGGTGTCAG CCTTCCGCAT GACAACAAGA ACTCCGGCCA GACATTTGCA CAGCTTGTGG CTGAAAAAGA CAGCCCCGTG GCAGATGTGG CCTATTACGG TGTGACCACC GGCATCAAGG CGGGCAAGGA AGGTCTGGTC GAGGCGTACA AGCCCGCAGG TTTTGACGAG ATCCCGGAGG GGCTCAAAGA CCCCGAAGGC AAGTGGTTCG CAGTGCATTA CGGCACCATC GGGTTCTTTG TGAATGTGGA CGCCCTTGGC GGCGCGCCCG TCCCGCAGTG CTTTGCAGAC CTGAAAAAGC CTGCCTATCA GGGAATGGTG GGTTATCTGG ATCCCTCGTC GGCCTTTGTC GGATATGCCG GGGCCGTCGC CGTCAACCTT TCCTTTGGGG GCGATCTGCA AGACTTTGAC CCCGCAATCG AGTATTTTTC CGAGCTGGCA GAGAACGCAC CGATCGTGCC CAAGCAGACG TCTTATGCGC GGGTCGTATC GGGAGAGATC CCGATCCTGT TTGATTACGA CTTCAACGCC TATCGCGCGA AATACGAAGA AGACGGAAAT TTTGAATTTG TCCTGCCCTG CGAGGGGTCG GTGCGTGTAC CCTATGTCAT GAGCCTCGTG GGCAATGCGC CTCACGGCGA GACCGGCAAG AAGGTTCTGG ATTTCATTCT CTCTGACAAA GGGCAGGCGA TCTGGACCAA CGCCTATCTG CAGCCCGCGC GCCCGGTTGA GCTGCCTGCT GAGGTGGCGG AGAAATTCCT GCCCGCCAGC GATTATGCCC GTGCACAGGC TGTGAACTAT GCAGAGATGG AAAAGGCGCA GGCCGGTTTT GGCGAACGCT ACCTGAACGA AGTCAAATAA
|
Protein sequence | MKHIALTVLA ATALAAPLTA SFASSAQAED AICYNCPPQW ADWASMLEAI ETEIGVSLPH DNKNSGQTFA QLVAEKDSPV ADVAYYGVTT GIKAGKEGLV EAYKPAGFDE IPEGLKDPEG KWFAVHYGTI GFFVNVDALG GAPVPQCFAD LKKPAYQGMV GYLDPSSAFV GYAGAVAVNL SFGGDLQDFD PAIEYFSELA ENAPIVPKQT SYARVVSGEI PILFDYDFNA YRAKYEEDGN FEFVLPCEGS VRVPYVMSLV GNAPHGETGK KVLDFILSDK GQAIWTNAYL QPARPVELPA EVAEKFLPAS DYARAQAVNY AEMEKAQAGF GERYLNEVK
|
| |