Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2686 |
Symbol | |
ID | 4077597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2825147 |
End bp | 2826733 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638008011 |
Product | extracellular solute-binding protein |
Protein accession | YP_614680 |
Protein GI | 99082526 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000382125 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.980281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTC TAGGGATGAC GCGGCGTGGC GCGATGGCTG CGATGCTTGC GACGACGGCA ATGGCGGGGG TGGCGATGGG CGTGGCGCCT GCCGCAGCGC AGACACCGCC TGGCGTGCTG ATCGTGGGCC AGATCGCAGA GCCAAAAGCG CTGGACCCGG CGGCAGTGAC GGCGGTAAAT GACTTCCGCA TCCTGATGAA CGTCTATGAC GGTCTGGTGC GCTACAAGGA CGGCACGCTC GAGGTCGAAC CCGCGCTGGC GACCGACTGG AGCATCTCCG AAGATGGCAC CGAATATACA TTCACGCTGC GCGAAGGGGT GTCGTTTCAT GACGGCAGCG CCTTTGATGC CGAGGCGGTG GTGTTCAACT TTGAGCGCAT GCTCAATGAG GATCACCCCT ATCACAACAC CGGCCCCTTC CCGCTGGCCT TCTTCTTTTC TGCCGTGGAG AGCGTCGAGG CCGTTGATGA TCTGACGGTG AAATTCAAAC TGAACGCGCC CTATGCGCCG TTCCTGTCGA ATCTCGCTTA TCCTACAGGC CTGATTGTAT CGCCTGAGGC GGTCAAGACC CATGGCGCGG AGTTCGGCCG CAACCCCTCC GGCACCGGTG CTTTCAAATT TGCCGAGTGG CGCTCCAATG AGGCCGTGGT GGTCGAGAAA AATCCCGACT ACTGGGATGG CGCGGCAGAG CTGGACGCGG TGGTCTTTCG CCCGATCACC GATGCCAACA CCCGCACGGC AGAAATGCTG GCAGGTGGCA TTGATCTGAT GGTCGAGGTG CCGCCGGTGG CACTGTCGGA GTTTCAGGGC GATGCTTTCA CCGTGCATGA ACAAGCCGGC CCGCACGTCT GGTTCCTGAT CCTCAACGCC AAGGAAGGCC CCTTTGCCGA CAAGCGCGTC CGCCAAGCGG CGAATTACGC GATCAACAAA TCCGCGATTG TGAACGATGT GCTTGAGGGC ACGGCGGAGG TGGCCGCAGG CCCGACCCCG CCCGCCTTTG CCTGGGCCTA CAATGAAACG CTCGAACCCT ATCCCTATGA CCCCGACAAG GCGCGGGAAC TCCTGGCCGA GGCGGGTGCA GAAGGGGCGG AGCTGACGTT CTATGTGACC GAGGGCGGCT CCGGCATGCT CGACCCTATC GCCATGGGCA CTGCCATTCA GGCGGATCTC AACGCCGTGG GGCTGGATGT GAAGATCGAA ACCTACGAGT GGAACACCTT CCTGGGCGAG GTCAATCCGG GGCTGGAGGG CAAGGCCGAC ATGGCCGAGA TGGCCTGGAT GACCAACGAC CCCGACACGC TCCCCTTCCT GGCGCTGCGC ACCGAAGCCT GGCCTGACAA GGGCGGCTTC AACTCCGGCT ATTATTCCAA CCCGAAGGTG GATGAGCTGT TGGAAGCGGC CCGCGTTGCG ACCGATCAGG ACGAGCGCGC CAAGCTTTAT CAGGAGATGC AGACCATCGT GCAGGAAGAT GCGCCTTGGG TCTTTGTCGC CAACTGGAAG CAGAATGCAG TGACCTCGGA TCGGGTGGGC GATTTTGCCC TGCAGCCCTC GTTCTTCCTG CTGCTCGATG ATGTGACCAA GAACTGA
|
Protein sequence | MKLLGMTRRG AMAAMLATTA MAGVAMGVAP AAAQTPPGVL IVGQIAEPKA LDPAAVTAVN DFRILMNVYD GLVRYKDGTL EVEPALATDW SISEDGTEYT FTLREGVSFH DGSAFDAEAV VFNFERMLNE DHPYHNTGPF PLAFFFSAVE SVEAVDDLTV KFKLNAPYAP FLSNLAYPTG LIVSPEAVKT HGAEFGRNPS GTGAFKFAEW RSNEAVVVEK NPDYWDGAAE LDAVVFRPIT DANTRTAEML AGGIDLMVEV PPVALSEFQG DAFTVHEQAG PHVWFLILNA KEGPFADKRV RQAANYAINK SAIVNDVLEG TAEVAAGPTP PAFAWAYNET LEPYPYDPDK ARELLAEAGA EGAELTFYVT EGGSGMLDPI AMGTAIQADL NAVGLDVKIE TYEWNTFLGE VNPGLEGKAD MAEMAWMTND PDTLPFLALR TEAWPDKGGF NSGYYSNPKV DELLEAARVA TDQDERAKLY QEMQTIVQED APWVFVANWK QNAVTSDRVG DFALQPSFFL LLDDVTKN
|
| |