Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3144 |
Symbol | |
ID | 4075016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 123368 |
End bp | 124948 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004647 |
Product | extracellular solute-binding protein |
Protein accession | YP_611380 |
Protein GI | 99078122 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000116165 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACCA CGACTGTATT GACACTGACG TCGCTGCTGC TGGCGTCTGC GGCGCCGCTG AGCGCGGAAA CCCTGCGCTG GGCGCGTTCT GGCGATGCGC TGACGCTGGA TCCGCATGCC CAGAACGAAG GGCCCACGCA TACCGTCCGT CACCAGATGT ATGAGCCGCT CATCATCCGC GACGTGACCG GCGCGTTTGA GCCCGCATTG GCGACCGAAT GGGCACCCAA GGAAGGCGAT CCCAACGTTT GGGTGTTCAA GCTGCGTGAG GGCGTGAAGT TCCACGGCGG CGAGGATTTC ACCGCTGAGG ATGTGGTGTT CTCTTTTGAA CGCGCCAAGC AGGCCAACTC CGACATGAAG GAGCTTATTG GCTCCATCAC CGAGGTGCGC GCTGTTGATG ATCTGACCGT CGAGATCGTC ACCGATGGTC CGAACCCGAT CCTGCCGTCG AACCTCACCA ACCTGTTCAT CATGGACAAG GGCTGGACCG AGGCCAACAA CACCGTGAAC GTGCAGGATT TTGAGGGCGG CGAAATCACC TATGCCACCA CCAATGCCAA CGGCACCGGT CCCTATGTGC TGCAAAGCCG CGAGCCGGAC GTCAAAACCG TGATGACGCT CAACGAGAAC TACTGGGGCA AGGACCAGTT CCCGCTCGAA GTGACCGAGA TCGTCTACAC GCCGATCCAG AATCCCGCGA CCCGCGTGGC AGCGCTCTTG TCGGGTGAGA TCGACTTCCT TCAGGACATG CCGGTGCAGG ATCTTGACCG CGTCAGCGGT GCAGATGGTC TGATGGTGCG CAAGGCGCCG CAGAACCGCG TGATCTTCTT TGGCATGAAC ATGGGTGCCG ATGACATCGA AGCCGACAAC GTTGATGGCA AGAACCCGCT CGCTGATGTG CGCGTGCGCA AGGCGATGTC GATGGCGATC AACCGCGATG CAATCCAGAA GGTCGTTATG CGCGGCCAGT CGCAGCCGGC AGGCATGATC GCGCCGCCGT TTGTCAACGG CTGGACCGAA GAGATGGACT CGGAATCCAA GACAGACATC GAAGGCGCCA AGGCGCTGAT GGCCGAAGCG GGCTACGCGG ATGGCTTCTC GATCCGTCTG GACTGTCCCA ACGACCGTTA CGTCAACGAC GAGCCGATCT GTCAGGCCGC CGTGGGCATG CTGGGTCAGA TCGGGATTAC CGTGAACCTC GACGCCAAAC CCAAGGCGCA GCACTTCCCG CTGATCACCG ATGGCAAGAC CGACTTCTAC ATGCTGGGCT GGGGCGTGCC GACATACGAC TCCGAGTATA TCTTCAACTT CCTCGTGCAT GGTCGTGAGA GCGACATCGG CACCTGGAAC GGCACCGGCT TTGACAATGA CGAGCTGGAC GCGAAGATCA AATCTCTGGC GTCGAACACC GATCTTGAAG CGCGCAACCA GGACATCGCA GATATCTGGC GTGTGGTTCA GGACGAGCAG CTCTATATCC CGATCCACCA TCAGGTGCTG AACTGGGGCA TGTCCGAGAA GGTCGACATC GCTGTCGATC CCGAGGATCA GCCGAAGGTC AAATACTTCA AGATGAACTG A
|
Protein sequence | MKTTTVLTLT SLLLASAAPL SAETLRWARS GDALTLDPHA QNEGPTHTVR HQMYEPLIIR DVTGAFEPAL ATEWAPKEGD PNVWVFKLRE GVKFHGGEDF TAEDVVFSFE RAKQANSDMK ELIGSITEVR AVDDLTVEIV TDGPNPILPS NLTNLFIMDK GWTEANNTVN VQDFEGGEIT YATTNANGTG PYVLQSREPD VKTVMTLNEN YWGKDQFPLE VTEIVYTPIQ NPATRVAALL SGEIDFLQDM PVQDLDRVSG ADGLMVRKAP QNRVIFFGMN MGADDIEADN VDGKNPLADV RVRKAMSMAI NRDAIQKVVM RGQSQPAGMI APPFVNGWTE EMDSESKTDI EGAKALMAEA GYADGFSIRL DCPNDRYVND EPICQAAVGM LGQIGITVNL DAKPKAQHFP LITDGKTDFY MLGWGVPTYD SEYIFNFLVH GRESDIGTWN GTGFDNDELD AKIKSLASNT DLEARNQDIA DIWRVVQDEQ LYIPIHHQVL NWGMSEKVDI AVDPEDQPKV KYFKMN
|
| |