Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2745 |
Symbol | |
ID | 4077617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2889841 |
End bp | 2891496 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638008070 |
Product | extracellular solute-binding protein |
Protein accession | YP_614739 |
Protein GI | 99082585 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCA CTGCTACCGC CGGGCTCTTG GCCTCGGTTG CGCTCTGGAC CACTGCCGCA CAGGCAGAGG AAACCGTGCT GAGCGCGTTG CCGGAACAGG TCACCGCTTG GGTGGAGAAC TTCAACCCGT TCAACCAGAC CACCGCGGCG CCGTCCGTCA TGCATTTCAT GTACGAGCCG CTGATCATTT TCAACGCGCT CGACGGCGGC AAGCCGATCT ACCGTCTGGC GACCGCATTT GAGTATTCCG ATGATCTGAG TTCCATCACC GTGACCCTGC GCGATGGCGT TCAGTGGTCC GATGGCGAAG CATTCACCGC CGATGATGTG GTCAAATCCT TTGATCTGGC GCTGAGCGAT CCGGCCCTCG ACAGCGTCGG CATGGCGCAG ATGCTCTCTG GTGTCGAGAA ACTCGACGAG ATGACGGTGA AGTTCAACCT CTCCACCCCT TCAAGCCAGG CCATGTACCA GATCGTGCGT GTGCCAATCG TCCCCGAACA CGTCTGGAGC AACGTCTCCG ATCCCGTGAC CTTTACCAAC CCTGATCCCG TGGGGTCCGG TCCGCTGACA GAGATCCGCC GTTTCACGCC GCAGGAATAC ATCCAGTGCC GCAACAACAA TTACTGGGAC GCTGAGAGCC TCAAGGTCGA CTGCATGCGT TTCCCGCAGA TCGCCAACAA CGATCAGGCG CTGGCAGCGG CCGCGAATGG CGAACTGGAC TGGATGGGGT CCTTCCTGCC CGACATCGAC AACACCTTTG TCGCCAAGGA CCCCGAGCAT CACAGCTATT GGCTGCCCGC AGGCTCTCTC GTGGCGTTCT ACATGAACTT CGAGGCCAAA GAAGCCGGCG ACAAAGAAGC CGTGAACAAC GTCGCCTTCC GCCGCGCGGT GTCGATGGCC TTCGACCGCG AAGCGATGGT GGAAATTGCA GGCTATGGCT ATCCGACGAT CAACCAGTAT CCCTCTGGTC TGGGCCGCGC TTATCACGCG TGGAACAACC CCGAAGTCGA GGACAAATTT GGCGCGTTTA CCCAATATGA CATCGAGGGC GCCAAGGCAC AGCTGGCCGA GGCCGGGTTC AAGGACATTG ACGGTGACGG CTTTGTGGAA ACCCCAAGCG GTGAGCAGAT CGACATTGAA GTCATCGTGC CCAACGGCTG GACCGACTGG GTCAACAGCA GCCAGATCGC GGTCGAGGGC CTGAATGCAG CCGGGATCAA GGCCAATGTC TCAACACCTG AATCCGCGAT CTGGACCGAA AAGCTGATCA AGGGCGACTA TGACATGGCG ATCAACTCGG TTCGTGTTGG TGCGACCCCC TTCAACCAGT ATCTGGACTC GCTCCACGAG ATTAATCAGG CCAAGTCGCG TTTTGCCGCG TCGCGGTACT ACAACGAAGA GCTGAGCGAC CTTCTGGATG CCTTCACCCA GACCAGCGAC ACCGACAAGC AGATGGCGAT CATGTCCGAT GTACAAGAGA TCGTCGGTGA AGACATGCCG CTGGCCTATG TGTTCAACAA CCCGCGCTGG TATCAGTACA ACACCAAGCG TTTCGAAGGC TTCTTCAACG CTGACAACCC GGTGGCCAAC CCGGTGGTTC ACAAAACCAA CCCGGCCCGT CTGATCCACC TCCTGAACCT GCGCCCGGTC GAGTAA
|
Protein sequence | MKFTATAGLL ASVALWTTAA QAEETVLSAL PEQVTAWVEN FNPFNQTTAA PSVMHFMYEP LIIFNALDGG KPIYRLATAF EYSDDLSSIT VTLRDGVQWS DGEAFTADDV VKSFDLALSD PALDSVGMAQ MLSGVEKLDE MTVKFNLSTP SSQAMYQIVR VPIVPEHVWS NVSDPVTFTN PDPVGSGPLT EIRRFTPQEY IQCRNNNYWD AESLKVDCMR FPQIANNDQA LAAAANGELD WMGSFLPDID NTFVAKDPEH HSYWLPAGSL VAFYMNFEAK EAGDKEAVNN VAFRRAVSMA FDREAMVEIA GYGYPTINQY PSGLGRAYHA WNNPEVEDKF GAFTQYDIEG AKAQLAEAGF KDIDGDGFVE TPSGEQIDIE VIVPNGWTDW VNSSQIAVEG LNAAGIKANV STPESAIWTE KLIKGDYDMA INSVRVGATP FNQYLDSLHE INQAKSRFAA SRYYNEELSD LLDAFTQTSD TDKQMAIMSD VQEIVGEDMP LAYVFNNPRW YQYNTKRFEG FFNADNPVAN PVVHKTNPAR LIHLLNLRPV E
|
| |