Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3437 |
Symbol | |
ID | 4075611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 461879 |
End bp | 463357 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638004946 |
Product | extracellular solute-binding protein |
Protein accession | YP_611671 |
Protein GI | 99078413 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.58601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.852102 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTCA AAAAACCCAT TCTGGCGGCC GTCTCTGCGC TCGCCCTTCT GGCGGGGCCT GCGCTGGCCA AGGACACCGT GACCTACGCC ACCCAGCTGG AGCCGCCGCA TCTCGATCCC ACTGGCGGCG CGGCGCAAGC CATCGATACG GTGGTGTATC TCAATATCTT CGAGGGCCTC ACGCGCTTTA CCCCGGATGG GGCTGTGGTG CCGCTTCTTG CGAAATCCTG GGAGATTTCC GAGGACGGTC GGACTTATAC GTTCACTCTG CAAGAGGGCG TCACCTTTCA CGACGGCAGC ACGCTTGATG CGCAGGATGT GAAATTCTCG CTCGATCGCG CCCGCGCCGA GGACAGCACC AACGCCCAGA AGGCGCTCTT TGCGGACATT GCGGACGTCA CAGTGAGTGA TGCGCAAACC GTGGTGGTGA CGCTCTCCGA GCCCAACGGC AATTTCCTCT TCAACCTCGC CTGGGGCGAT GCGGTGATCG TGGCCGAAGA GTCGGTCGAG ACCCTGAAAA CTGCTCCTGT GGGCACCGGC CCCTACCGCT TTGGCGAATG GGTGCAGGGA GACCGGGTAG AGATGGTGCG CAACCCGAAT TACTGGGGCG AGATCCCCGA GCTGACGGGC GCCACGATCA AGTTCATCTC CGACCCCACC GCCGCCTTTG CCGCGATGAT GGCCGAAGAC ATTGACGCCT TTGATAATTT CCCGGCGCCA GAGAATATGA TCCAGTTCGA GGCCGATCCG CGTTTTCAGG TGATCGTTGG CTCCACCGAA GGCGAGACGA TCCTGTCGAC GAACAACGCC CAGGCACCCT TTGACAACCC CAAAGTGCGT CAGGCGCTGG CCCATGCGAT TGATCGTCAG GCCATCGTGG ATGGTGCGAT GTTTGGCTAT GGCACGCCGA TTGGCACCCA TTTTGCGCCG CATAACCCGG CCTATGTGGA TCTCACCGGT CAGTCCGATT TTGACCCCGA CAAAGCCCGC GCGCTTCTGG CCGAAGCGGG CCTTGCAGAT GGGTTCACCA CCACGCTGCA CCTGCCGCCC CCCGCCTATG CCCGTCGCGG TGGCGAGATC GTGGCCGCGC AGCTGGCCCA GGTGGGTATC ACCGCCGAGA TCATCAATGT GGAATGGGCG CAGTGGCTTG AGACCGTGTT CAAAGGCAAG ACCTATGGTC TCACGATCGT CAGCCACACC GAGCCGATGG ACATCGGGAT CTATGGCCGT CCGGATTATT ACTTCCAGTA TGACAATCCG GAGTTCCAGG GCGTGATGAG CCGCCTCAAC GCGACCACCG ACCCGGACCA GCGTACGGCG CTCCTGCAGG ACGCCCAGCG CATGATCGCG GATGACTATG TTAACGGCTA CCTGTTCCAG CTGGCAAAGC TCGGCGTTGC CAAAGCCGGC CTTGAGGGCA TCTGGGCCAA TGCTCCGGCC GCTGCCATCG AAATCGGGGC GCTCAGCTGG GCTGAGTAA
|
Protein sequence | MNFKKPILAA VSALALLAGP ALAKDTVTYA TQLEPPHLDP TGGAAQAIDT VVYLNIFEGL TRFTPDGAVV PLLAKSWEIS EDGRTYTFTL QEGVTFHDGS TLDAQDVKFS LDRARAEDST NAQKALFADI ADVTVSDAQT VVVTLSEPNG NFLFNLAWGD AVIVAEESVE TLKTAPVGTG PYRFGEWVQG DRVEMVRNPN YWGEIPELTG ATIKFISDPT AAFAAMMAED IDAFDNFPAP ENMIQFEADP RFQVIVGSTE GETILSTNNA QAPFDNPKVR QALAHAIDRQ AIVDGAMFGY GTPIGTHFAP HNPAYVDLTG QSDFDPDKAR ALLAEAGLAD GFTTTLHLPP PAYARRGGEI VAAQLAQVGI TAEIINVEWA QWLETVFKGK TYGLTIVSHT EPMDIGIYGR PDYYFQYDNP EFQGVMSRLN ATTDPDQRTA LLQDAQRMIA DDYVNGYLFQ LAKLGVAKAG LEGIWANAPA AAIEIGALSW AE
|
| |