Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3307 |
Symbol | |
ID | 4075711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 315194 |
End bp | 316543 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004815 |
Product | extracellular solute-binding protein |
Protein accession | YP_611541 |
Protein GI | 99078283 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0444358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.775669 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAG TATTGATGTC AGCCGCAGCG ATCACCGCGT TGATGGCAGG AACGGCGAGC GCCCAGGATC TGATTTTTCC GGTCGGCGAA GGCGCCTTCA ACTGGGACAG CTACGCAGAG CTGGAGAAGA TCGACCTGAA CGGTGAGCAG GTCACCGTGT TTGGTCCATG GCTCGGGCCT GACCAAGAGG TTGTTGAAAA CGTACTGGCC TACTTCGCAG CCGCGACCGG TGCGGATGTG CGCTATGCAG GCTCCGACAG CTTTGAGCAG CAGATCGTGG TCGATGCCGA GGCAGGCTCT GCCCCCAATG TTGCTGTTTT CCCGCAGCCC GGTCTGGTGT CTGATATGGC CAAGCGAGGC TTCATCACGC CGCTTGGTGA AGAAACCGCC GACTGGGTGC GCGACAACTA CGCCGCGGGT CAGTCCTGGG TGGATCTCGG AACCTATCCG GGCGCAGATG GCAATGACGG GCTCTTTGGT CTGTTCTACA AGGTCGATGT GAAGTCTCTG GTTTGGTATA ACCCGGAAAA CTTTGAGGAT TTCGGATATG AAACTCCGCA GTCCATGGAA GAGCTGAAGG CGCTGACCGA GCAGATGGTG GCCGATGGCA ACACTCCATG GTGCATCGGC CTGGGATCCG GTGGCGCGAC TGGCTGGCCT GCGACCGACT GGGTCGAGGA CATGATGCTG CGCACGCAGG AACCCGCAGT CTACGACAAA TGGGTCTCCA ATGAGCTGAA GTTCGATGAT CCTGCCGTCA TCGGTGCGAT TGAGGAATTC GGTTGGTTCG CCAAGAACGA TGACTTCGTT TCTGGTGGTG CTGGTGCCGT GGCGTCTACC GACTTCCGCG ATAGCCCCAA AGGTCTCTTT GCCAGCCCGC CGCAGTGCAT GATGCACCGT CAGGCGTCCT TCATTCCGGC CTTCTTCCCA GAAGGCACCG AAATGGGTCT GGATGCTGAT TTCTTCTACT TCCCTGCCTA CGAAGGCAAA GAACTTGGCA ATCCGGTACT GGGCGCGGGC ACCATCTGGT CGATCACCAA TGACAGCCCC GGTGCTCAGG CGCTGATGGA GTTCCTGAAG GCGCCGATCG CTCATGAAGT CTGGATGGCG CAGCAAGGGT TCCTGACCCC GCTGAAGAGC GTCAACACCG ACCTCTATGC CACCGACACG CTGAAGAAGA TGGGCGAGAT TCTTCTCTCT GCAGATACCT TCCGCTTTGA TGCATCCGAT CTGATGCCGG GTGGCGTGGG CGCCGGGTCG TTCTGGACCG GCATGGTGGA TTACGCAGGT GGCAAACCTG CCGAAGAGGT TGCAACCGAG ATCCAGTCCT CCTGGGATGC GCTCAAGTAA
|
Protein sequence | MKRVLMSAAA ITALMAGTAS AQDLIFPVGE GAFNWDSYAE LEKIDLNGEQ VTVFGPWLGP DQEVVENVLA YFAAATGADV RYAGSDSFEQ QIVVDAEAGS APNVAVFPQP GLVSDMAKRG FITPLGEETA DWVRDNYAAG QSWVDLGTYP GADGNDGLFG LFYKVDVKSL VWYNPENFED FGYETPQSME ELKALTEQMV ADGNTPWCIG LGSGGATGWP ATDWVEDMML RTQEPAVYDK WVSNELKFDD PAVIGAIEEF GWFAKNDDFV SGGAGAVAST DFRDSPKGLF ASPPQCMMHR QASFIPAFFP EGTEMGLDAD FFYFPAYEGK ELGNPVLGAG TIWSITNDSP GAQALMEFLK APIAHEVWMA QQGFLTPLKS VNTDLYATDT LKKMGEILLS ADTFRFDASD LMPGGVGAGS FWTGMVDYAG GKPAEEVATE IQSSWDALK
|
| |