Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3603 |
Symbol | |
ID | 4075030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 652050 |
End bp | 653573 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638005122 |
Product | extracellular solute-binding protein |
Protein accession | YP_611832 |
Protein GI | 99078574 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.721238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGATC TTTCTCTCCC CCGTTCGGCT TTGATTGCCC TGACGCTTGC TTCGACCACC GCGATGCCCG CGCTTGCAGA GAAAGCTGCA GGCACCTTGA ACGTCGCCTT CACCAAAGAG CTCGAGAACG TCGACAGCTA TTTCAATTCA TCGCGCGAAG GCGTGGTGAT GCAGCGCGCG GTCTGGGATG GCCTGATCTA CCGCGATCCC AACACCAACG AATACATCGG CAACCTTGCA ACCAGCTGGG AATGGATCGA CGACACCACG CTGGAATTCA AGCTGCGTGA GGGCGTTACT TTTCACAATG GCGAGCCCTT CAACGCCGAT GACGTCGTCT ACACCGTGAA CTATGTCGCC AACGAGGAAA ATGGCGTCAA AACCCAGCGC AATGTGAACT GGATGAAGTC CGCAGAAAAG ATCGACGACT ACACCGTCCG GATCCACCTC AAGGACAAAT TTCCCGCTGC GATCGAGTTC CTCTCCGGCC CAGTTTCGAT GTATCCCAAT GAGTATTACG CCGAGGCAGG CCCCTCTGGC ATGGGACTGA AGCCCATCGG CACCGGGCCT TACAAGGTGA CAGAAGTGGT TCCGGGCCAG CATTTTGTGC TTGAGGCCAA CGAGACCTAT CACGACAGCC CCAAGGGTCA GCCGGAGATC GCAAAAATCG ACATCCGCAC CATTCCAGAC GTCAATACTC AGATGGCAGA GCTCTTTTCC GGCTCTCTGG ATCTGATCTG GCAGGTGCCC GCGGATCAGG CCGAAAAACT TGCGCAACTG GGCCAGTTCA CCGTCGCCAA TGAATCCACG ATGCGTGTGG GCTACCTGCA AATGGACTCG GCCGGTCGAT CAGGTGAGGA CAACCCGTTT ACCAACGCCA AGGTGCGCGA GGCCGTGAAC TATGCGATCA ACCGTCAGGA ACTGGTCGAT GCCCTGCTCA AGGGCTCCAG CCAAGTTGTC TACACCCCCT GTTTTCCAAG CCAGTTTGGC TGTGTGCAGG ATGTGACCAC GTATGAGTAC AATCCCGAAA AGGCGAAGGA GCTGCTGGCA GAGGCAGGTT ATCCCGACGG GTTCTCGACA GAATTCTATG CCTATCGCGA CCGCCAGTAT GCAGAGGCCA TCGTCTCTTA CCTGAATGCC GTGGGCATCG ATACCGATTT CAAGATGCTA CAGTATTCAG CGCTGCGCGA CCTGAACATG AAGGGCGAAG TGCCGCTGTC GTTCCAAACC TGGGGCAGCT ATTCGATCAA TGACGCGTCT GCGATGGTTA GCCAGTTCTT CAAACACGGC TCGCTCGACA GCACCCGCGA CGATGAAGTG CTCGATTGGC TAAATGTGGC CGACAGCTCC ACCGATCCCG ACGAGCGGAT CGAGTATTAC ACCAAGGCGA TCCAGAAGAT CACAGGCGAG GCCTACTGGG CACCCATGTT CAGCTACAAC ACGAACTATG TCTTCACCAG CGACGTGAGC TACACGCCCA CCGCAGACGA AGTTCTGCGT TTTGTGGACA TGTCCTGGAA CTGA
|
Protein sequence | MFDLSLPRSA LIALTLASTT AMPALAEKAA GTLNVAFTKE LENVDSYFNS SREGVVMQRA VWDGLIYRDP NTNEYIGNLA TSWEWIDDTT LEFKLREGVT FHNGEPFNAD DVVYTVNYVA NEENGVKTQR NVNWMKSAEK IDDYTVRIHL KDKFPAAIEF LSGPVSMYPN EYYAEAGPSG MGLKPIGTGP YKVTEVVPGQ HFVLEANETY HDSPKGQPEI AKIDIRTIPD VNTQMAELFS GSLDLIWQVP ADQAEKLAQL GQFTVANEST MRVGYLQMDS AGRSGEDNPF TNAKVREAVN YAINRQELVD ALLKGSSQVV YTPCFPSQFG CVQDVTTYEY NPEKAKELLA EAGYPDGFST EFYAYRDRQY AEAIVSYLNA VGIDTDFKML QYSALRDLNM KGEVPLSFQT WGSYSINDAS AMVSQFFKHG SLDSTRDDEV LDWLNVADSS TDPDERIEYY TKAIQKITGE AYWAPMFSYN TNYVFTSDVS YTPTADEVLR FVDMSWN
|
| |