Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2097 |
Symbol | |
ID | 7318200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2222620 |
End bp | 2223780 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643616991 |
Product | UDP-sulfoquinovose synthase |
Protein accession | YP_002514164 |
Protein GI | 220935265 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTC TTGTTCTGGG TGGTGACGGA TTTTGCGGCT GGCCGACTTC CTTGTATCTC TCGGCGGAAG GCTACGAGGT ATCCATAGTC GACAACCTGT CGAGAAGAAA GATAGATATC GAACTTGAGT GCGACTCCCT GACGCCAGTA GCTCCAATGG GTGAACGCAT CAGGGCATGG AAAGAGGTGT CAGGTAAGAC CATCGAGTTC CATAACCTTA ATGTCGCGAG GAATTACCAA AGGCTGCTCG ATCTACTGGT ATCGTGGAAA CCAGATGCCG TAGTGCATTT CGCTGAACAG CGAGCTGCGC CCTATTCCAT GAAAGGGTCG CATGGAAAGC GGTATACCGT TGATAACAAT ATCAATGCAA CGCATAACCT CCTTGCTGCT ATTGTCGAAT CTGGGCTTGA CATTCACGTC GTACACCTGG GAACGATGGG TGTTTATGGA TATGGAACGG CAGGGATGAA GATACCCGAA GGCTATGTCA CGGTTCAGGT GCAGACCGAT GCCGGAGGTG TGGTAGAGAA GGAAATCCTT TATCCAGCTG ATCCCGGTAG CATTTACCAT ATGACCAAAA CCCAGGATCA GCTGATGTTT TATTTTTACA ACAAGAACGA TGACATCAGG GTGAGGGACC TTCATCAGGG TATTGTCTGG GGCACCCAGA CGAGCCAGAC GAGATTGGAT GAGCGCCTTA TCAATCGATT TGATTATGAC GGTGATTATG GAACTGTTCT TAATAGATTT CTGATGCAGG CGGCAATAGG CTATCCGTTG ACGGTTCACG GCACAGGCGG CCAGACGCGC GCCTTTATTC ATATTCAGGA CACTGTGCAT TGTGTTCTGT TAGCAATTCA GAATCCGCCT GAGAAGGGTG ACCGCGTACA CATCCTGAAT CAGATGACGG AAACCCATCG TGTTCGGGAG TTGGCGAAAT TGGTGTCGGA TAAGACTGGC GCAGAGATTA ATTTCGTGTC GAATCCTCGT AACGAAGAAG AGGAAAATGA CCTGCATGTT ATGAATGATA GGTTTCTGGG CTTGGGACTT CAGCCAATCA CGCTGAATGA AGGTCTACTT GAAGAGGTAA CGCAAATTGC CAAGAAATAT GCACACAGGT GTGACAAGGA AAAAATTCCA TGTGTTTCTA TGTGGAAATA A
|
Protein sequence | MKVLVLGGDG FCGWPTSLYL SAEGYEVSIV DNLSRRKIDI ELECDSLTPV APMGERIRAW KEVSGKTIEF HNLNVARNYQ RLLDLLVSWK PDAVVHFAEQ RAAPYSMKGS HGKRYTVDNN INATHNLLAA IVESGLDIHV VHLGTMGVYG YGTAGMKIPE GYVTVQVQTD AGGVVEKEIL YPADPGSIYH MTKTQDQLMF YFYNKNDDIR VRDLHQGIVW GTQTSQTRLD ERLINRFDYD GDYGTVLNRF LMQAAIGYPL TVHGTGGQTR AFIHIQDTVH CVLLAIQNPP EKGDRVHILN QMTETHRVRE LAKLVSDKTG AEINFVSNPR NEEEENDLHV MNDRFLGLGL QPITLNEGLL EEVTQIAKKY AHRCDKEKIP CVSMWK
|
| |