Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1493 |
Symbol | |
ID | 4077049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1598143 |
End bp | 1599129 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006806 |
Product | ABC transporter, periplasmic substrate-binding protein |
Protein accession | YP_613488 |
Protein GI | 99081334 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.140791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACG TGCTAACAGC CGCCGCCATG GCACTGATGG CAACCTCTGC TGCGCAGGCC GCAGATGAGG TGAAGCTGCA ACTGAAATGG GTCACCCAGG CGCAGTTTGC AGGCTACTAT GTGGCGCTGG ACAAAGGGTT TTACGGGGAA GAAGAGCTGG ATGTGACCAT TCTTCCCGGT GGCCCCGACA TCGCGCCCAC GCAGGTGATC GCAGGGGGCG GCGCGGATGT GACCGTGGAA TGGATGCCCG CAGCTCTGGC TGCGCGGGAA AAGGGGCTGC CGCTGGTCAA CATTGCCCAG CCGTTCAAAT CTTCTGGCAT GATGCTCACC TGCTGGAAAG ACACCGGCAT CTCTGCACCC AAGGATCTCG CGGATCGCAC TCTTGGGGTC TGGTTCTTCG GCAATGAGTT CCCCTTCATG AGCTGGATGG GCAAGCTCGG TATCTCGACA GAGGGCAAGG GTCCCGAAGG CGTAGAGGTT TTGAAACAGG GCTTTAACGT CGATCCTCTG CTGCAGCGGC AGGCCGATTG CATCTCCACC ATGACCTATA ACGAATATTG GCAGGTGATT GATGCGGGGG TCTCGGCTGA TGAGCTTGTG ACCTTCAAAT ACGAGGACCA AGGCGTCGCA ACGCTCGAGG ATGGCCTCTA TGTGCTCGAG GACAACCTCT CCGATCCGGC CTTCGTCGAC AAAATGGAGC GCTTTGTCCG AGCGTCTATG AAGGGTTGGA AATACGCCGA AGAGAACCCG GAGGAGGCCG CAGAAATCGT GCTCGACAAT GATGCCTCTG GTGCCCAGAC CGAAACCCAC CAAAAGCGGA TGATGGGCGA GATTGCCAAA CTCACTTCCG GCAGCAATGG CGCGCTTGAC GTGGCAGACT ACGAGCGCAC GGTGCAGACC CTGCTGAGCG GCGGCTCTGA CCCGGTCATC ACCAAGGCGC CTGAAGGGGC GTGGACCCAT GCGATCACCG ATGCGGCGCT GAACTGA
|
Protein sequence | MKNVLTAAAM ALMATSAAQA ADEVKLQLKW VTQAQFAGYY VALDKGFYGE EELDVTILPG GPDIAPTQVI AGGGADVTVE WMPAALAARE KGLPLVNIAQ PFKSSGMMLT CWKDTGISAP KDLADRTLGV WFFGNEFPFM SWMGKLGIST EGKGPEGVEV LKQGFNVDPL LQRQADCIST MTYNEYWQVI DAGVSADELV TFKYEDQGVA TLEDGLYVLE DNLSDPAFVD KMERFVRASM KGWKYAEENP EEAAEIVLDN DASGAQTETH QKRMMGEIAK LTSGSNGALD VADYERTVQT LLSGGSDPVI TKAPEGAWTH AITDAALN
|
| |