Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3153 |
Symbol | |
ID | 4075323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 133822 |
End bp | 135075 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638004656 |
Product | extracellular solute-binding protein |
Protein accession | YP_611389 |
Protein GI | 99078131 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000182033 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTA AGTTTATGAT GGCCGCGCTG ACGGGCACTG CCCTGGTGGC CACTTCCGCA CTGGCCGAGG ATGTCACCCT CACTGTCGAA AGCTGGCGCA ATGACGACCT GACGCTCTGG CAGGACAAGA TCATCCCCGC GTTCGAAGCC GCAAACCCCG GCATCAAGGT GAAATTCACC CCCAGCGCGC CGACCGAATA CAACGCGGTC CTGAACTCCA AGCTGGACGC AGGCTCTGCT GGTGATCTGA TCACCTGCCG CCCGTTTGAC GCCTCGCTTG CGCTCTATGA GGCGGGCCAC CTCGCCGCGC TGGATGATAT GGACGCGATG AGCAACTTCT CTGACGTCGC CAAATCCGCA TGGCAGACCG ACGATGGCTC CGCGAGCTTC TGTGTGCCGA TGGCCTCCGT GATCCACGGC TTTATCTACA ACAAAGAGGC CTTCGAAGAG CTCGGCCTTG AGGTTCCGAC CACCGAAGAC GAATTCTTTG CCGCGCTTGA GACCATCAAG GAAGACGGCA GCTATATCCC GATGGCGATG GGCACCAACG ACCAGTGGGA AGCCGCCACC ATGGGCTATA ACAACATCGG CCCGAACTAC TGGAAAGGCG AAGAAGGCCG TCGCGCCCTG ATCGCGGGCG AGCAGAAGCT CACCGACGAA CAATGGGTTG CCCCCTATGC GACCCTCGCC AAATGGGCGG ATTATCTGGG CGACGGCTAT GAGGCGCAGA CCTATCCTGA CAGCCAGAAC CTCTTCACGC TGGGCCGCGC GGCGATCTAT CCGGCAGGCA GCTGGGAAAT TTCTGGCTTC AACGCGCAAG CCGATTTTGA AATGGGCGCC TTCAAGGCTC CGGTCAAATC CGCAGGCGAC ACCTGCTATA TCTCGGACCA CACCGACATT GGTATTGGCA TGAACGCCTC CACCGAGCAC CCCGAAGCCG CCAAGGCCTT CCTCGCCTGG GTCGCATCGC CCGAGTTCGC GGACATCTTC GGCAACGCTC TGCCGGGCTT CTTCCCGCTC TCCAATGCGC CGGTTGAGCT CGAAGATCCG CTGGCCAAGG AATTTGTAAG CTGGCGTGGC GAGTGCGAGA GCACCATCCG CTCCACCTAC CAGATCCTGT CGCGCGGCAC GCCGAACCTC GAAAACGAGA CCTGGGGCGC ATCCGTTGCC GCAATCAAAG GCACCGAAAC GCCCGAAGCT CTGGGCGAAA AACTCCAGTC GGGTCTCGCA ACCTGGTACG AACCGCAACA GTAA
|
Protein sequence | MKSKFMMAAL TGTALVATSA LAEDVTLTVE SWRNDDLTLW QDKIIPAFEA ANPGIKVKFT PSAPTEYNAV LNSKLDAGSA GDLITCRPFD ASLALYEAGH LAALDDMDAM SNFSDVAKSA WQTDDGSASF CVPMASVIHG FIYNKEAFEE LGLEVPTTED EFFAALETIK EDGSYIPMAM GTNDQWEAAT MGYNNIGPNY WKGEEGRRAL IAGEQKLTDE QWVAPYATLA KWADYLGDGY EAQTYPDSQN LFTLGRAAIY PAGSWEISGF NAQADFEMGA FKAPVKSAGD TCYISDHTDI GIGMNASTEH PEAAKAFLAW VASPEFADIF GNALPGFFPL SNAPVELEDP LAKEFVSWRG ECESTIRSTY QILSRGTPNL ENETWGASVA AIKGTETPEA LGEKLQSGLA TWYEPQQ
|
| |