Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3143 |
Symbol | |
ID | 4075015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 121684 |
End bp | 123258 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638004646 |
Product | extracellular solute-binding protein |
Protein accession | YP_611379 |
Protein GI | 99078121 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0573017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCGTC TGTTATGCAC GGCTGCCTGT GCAACCGCCC TTGTGTCCCA GCTCCCTGCC TCCGCCGTGG CCAAGGAGTT CTCTTGGGCC GTGACCACCG ATCCGCAGAC CATGGATCCC CATGCGGTGA ATTCCTCGCC CGTTCTGGGC TTTTTGAACA ACGTCTACGA AGGTCTTGTG CGGCGCGGCA AGGACATGAG CATCGAGCCT GCGCTGGCCA CCGGCTGGGA GCCGATTGGC GAGGGCGAGG GCTGGCGGTT CTTCCTGCGC GAGGGCGTGA CGTTTCAGGA CGGCAGCGCC TTTGATGCGG AGGATGTGAT CTTTTCCTAT GAGCGGGCCT CGAGCCCTGA ATCCGACACC GCAAGCTGGT TTGCGCCCGT GTCGGATGTG GTCAAGGTCG ATGACTATAC CGTTGATTTT CTGACGAATT CTCCAAACCC GATTTTTCCG GACAGCATCG CGAACTGGAT GATCATGGAC AGCGGCTGGG CCGAGGCCAA TGCCGCCAGC CGCCCCGACA AGGAGAGCGG CAACTACGCG ACGCTCAACG CCAATGGCAC CGGCGCGTTC CGCGTGACCG CACGCGAGCC TGGCCTGCGC ACCGTGCTGG AGCCCTATGA AGGCTGGTGG GGCGAGGCCG AGCACAACAT CACCCGCGCC GAGATGACTC CGATCCAGAA CCCGGCCACC GCCCTTGCGG CGCTGCTCTC GGGGGACGTG GACATGATCA ATCCGGTGCC AATTCAGGAC GTCGAGCGCC TGCAGGGCAA CCCGGATGTG AATGTGGTGC AGGGCATCGA GGCCCGCGTC ATCATGCTGG GCTTTGGCCA TCAGGCGGAG GCGTTGAAAT ACTCTGCCGA GACTGAAGAC AACCCCTTTG GGGACCCGCG CGTGCGCAAG GCCGTGGCCC ATGCGGTCAA TGTGCCTGCA ATCCTGCGCA CCATCATGCG TGGCAACGCC GAGCCGGTGA ACCAGCTGGT GAGCAGCGCC ATGCGCGGCT ACTCCGAGGC GCTGCCGGGC CAGATGGCCT ATGACCCGGA GGCCGCAAAG GCGCTCCTGG CAGAGGCAGG CTATCCCGAC GGGTTTTCCT TTGGTCTGAA GTGCCCCAAC AACCGCTACC TCAATGATGA GGCGGTCTGT CAGGCTGTGA CGGCCATGCT GGCGCAGGTG GGGCTCAAGG CGACTTTGGA TGCGATGCCG GTGCAGAACT ACTGGCCGGA ACTGCGGGCC GGGAACTTTG ACATGTATCT TCTGGGCTGG TCGCCCGGCA CCTTTGATGC GGAGCATCCA ATCCGCTTCC TGGCGGCGAC CCCGAACACG GAGAAGAAGC TCGGTTCCTG GAACTTTGGC GGCTATTCCA ACGCGCGCGT GGACGCGCTG TTGCCTAAGA TCCAGTCCGA GATCGACGAT TCGACCCGGC AGGGGATGCT TGATGAGGTG GCGCAGGTCT TGCAGGACGA GACCGCCTAT GTACCGCTCT ACGTGCAACC GCTGGTCTGG GGGACCCGCA GCAACGTCAC TTTGACCCAA CGGCCAGATA ACTTCTTTAT CCTGCGCTGG GTCTCCGTTA AATAA
|
Protein sequence | MLRLLCTAAC ATALVSQLPA SAVAKEFSWA VTTDPQTMDP HAVNSSPVLG FLNNVYEGLV RRGKDMSIEP ALATGWEPIG EGEGWRFFLR EGVTFQDGSA FDAEDVIFSY ERASSPESDT ASWFAPVSDV VKVDDYTVDF LTNSPNPIFP DSIANWMIMD SGWAEANAAS RPDKESGNYA TLNANGTGAF RVTAREPGLR TVLEPYEGWW GEAEHNITRA EMTPIQNPAT ALAALLSGDV DMINPVPIQD VERLQGNPDV NVVQGIEARV IMLGFGHQAE ALKYSAETED NPFGDPRVRK AVAHAVNVPA ILRTIMRGNA EPVNQLVSSA MRGYSEALPG QMAYDPEAAK ALLAEAGYPD GFSFGLKCPN NRYLNDEAVC QAVTAMLAQV GLKATLDAMP VQNYWPELRA GNFDMYLLGW SPGTFDAEHP IRFLAATPNT EKKLGSWNFG GYSNARVDAL LPKIQSEIDD STRQGMLDEV AQVLQDETAY VPLYVQPLVW GTRSNVTLTQ RPDNFFILRW VSVK
|
| |