Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2713 |
Symbol | |
ID | 4077020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2857581 |
End bp | 2859170 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638008038 |
Product | extracellular solute-binding protein |
Protein accession | YP_614707 |
Protein GI | 99082553 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACACC TGAAGACAAC TCTGCTCGCC AGCGCCTTGA TGCTGCCGCT TGCAGCCCCT GCCGTACTGG CTGATACGCC CGAAGGCGTG CTCGTGGTTG CACAGAACAT CGACGATGTC GTCGCCATCG ACCCGGCGCA GGCCTATGAG TTCACCTCCG GCGAGCTCGT GACCAACCTC TATGACCGTC TGGTGCAATA CGATGCCGAA GACACCACCG TTCTGGCCGC AGGTCTGGCC TCGGAATGGG TCACCGATGC GGATGCCAAG ACCATCACTT TCACCCTGCG CGATGGGGCG ACCTTTGCCT CTGGCAATCC CGTGACCGCA GAAGATGTGG TCTATTCCTT CTCCCGCGTG GTGAAGCTGA ACCTGACCCC GGCGTTCATC CTGACTCAAC TGGGCTGGAC GGCAGATAAC ATCGGCGAAA TGGTCACGGG CGAGGGCAAC ACCGTCACCG TGAAATACGC CGGCGACTTC TCTCCGGCGT TTGTACTGAA CGTCCTGGCG GCACGTCCTG CCTCCATCGT CGACAGCAAG CTGGTGCAGG AAAACGAAGT TGACGGCGAC ATGGGCAATG CCTGGCTCAA CGCCAATGCC GCCGGTTCTG GCCCCTTCAC GCTGCAACGC TATGCGGCGG GTCAGATGGT GCGCATGCAG GCCAATCCGA CCTATTTCAA CGGCGCGCCC AAGATCGACA GCGTGATCAT CCGCCATGTG GCCGAAAGCG CGACCCAGCA GCTTTTGCTG GAGCAGGGCG ACGTGGATCT GGCCCGCAAC ATGACACCCG ATCAGGTGGC TTCCCTTGAA AGCGGCGAGA TCAAGGTCGA GACATACCCG CAGGCGGCTG TGCATTTCCT GTCGTTCAAC CAGAAGACCG AGAGCCTTAC GCCCCCCGCC GTTTGGGAAG CCGCGCGCTA TCTGGTGGAC TACAAGGGCA TGACCGAGAC CATCATCAAA GGTCAGATGG AAGTCCACCA GGCGTTCTGG CCCAAGGGCT TCCCCGGTTC CTATGACGAA ACGCCGTTCT CTTATAACCC GGAAAAAGCC AAGAGCATTC TGTCCGAGGC CGGGATCGAG ACCCCGATCA CCGTGTCGCT CGACGTGATC AACGCCGCGC CCTTTACCGA CATGGCGCAA TCGTTGCAGG CGAGCTTTGC CGATGCGGGC ATCAACTTTG AGATCCTGCC CGGCACCGGC AGCCAGGTCA TCACCAAGTA CCGCGAGCGC AGCCATGAGG CGATGCTGCT GTACTGGGGC CCGGACTTCA TGGATCCGCA CTCCAACGCC AAGGCCTTCG CCTATAACTC CAACAACGCA GACGACTCCT ATGCCGCCAC AACCACATGG CGCAATGCAT GGGCCGTGCC GGATGCGCTC AACGAGAAAA CCATGGCGGC TCTGACCGAG AGCGACGCCG AGGCCCGTCT CAACATGTAT CGCGAGCTGC AAAAAGAAGT GCAGGCCGAG TCGCCCATCG TGATCATGTT CCAGGCCGCC TATCAGGTTG CCATGAACGA GGCCGTTTCT GGCTATGTGA ACGGCGCCAC CTCGGATTTT GTCTTCTACC GTCTGGTTGA AAAACAGTAA
|
Protein sequence | MKHLKTTLLA SALMLPLAAP AVLADTPEGV LVVAQNIDDV VAIDPAQAYE FTSGELVTNL YDRLVQYDAE DTTVLAAGLA SEWVTDADAK TITFTLRDGA TFASGNPVTA EDVVYSFSRV VKLNLTPAFI LTQLGWTADN IGEMVTGEGN TVTVKYAGDF SPAFVLNVLA ARPASIVDSK LVQENEVDGD MGNAWLNANA AGSGPFTLQR YAAGQMVRMQ ANPTYFNGAP KIDSVIIRHV AESATQQLLL EQGDVDLARN MTPDQVASLE SGEIKVETYP QAAVHFLSFN QKTESLTPPA VWEAARYLVD YKGMTETIIK GQMEVHQAFW PKGFPGSYDE TPFSYNPEKA KSILSEAGIE TPITVSLDVI NAAPFTDMAQ SLQASFADAG INFEILPGTG SQVITKYRER SHEAMLLYWG PDFMDPHSNA KAFAYNSNNA DDSYAATTTW RNAWAVPDAL NEKTMAALTE SDAEARLNMY RELQKEVQAE SPIVIMFQAA YQVAMNEAVS GYVNGATSDF VFYRLVEKQ
|
| |