Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_0436 |
Symbol | |
ID | 5752153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | + |
Start bp | 500726 |
End bp | 501766 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641286701 |
Product | hypothetical protein |
Protein accession | YP_001552877 |
Protein GI | 160873561 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.574558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00210413 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAATAC TCTACGGAGT TCAAGGCACA GGGAATGGCC ACTTAAGTCG TGCACGCGTC ATGGCTAAAG CACTGATGAA ACAAGATATT GAAGTGGACT TTTTATTTTC GGGGCGTAAG CCTGAACAGT TTTTCGATAT GGAATGTTTT GGCGATTATC GCGTACAAGC GGGCATGACG TTTGCGACTC ACTCTGGGCG CGTGAATGTG CCGCAAACAG TTCGGCAGAA TCTGTCACTA TCGTTATTTA AAGATATTAA AGCCTTAGAT CTCAGTTGTT ACGATCTCGT GCTCAACGAC TTTGAACCAG TATCGGCTTG GGCGGCGCGG CGCCAAGGCG TGCCATCCAT TGGTATTAGT CATCAGGCGG CGTTGACTCA TCCTGTGCCG AAACTCGGTA GCACTTGGTT TAATGAAATG CTGCTCAATA ACTTTGCCCC TGTGGATGTG GCGTTGGGTT GTCATTGGCA CCACTTTGGT TTTCCTATCT TGCCGCCTTT TGTCGAAGTG GATGCCAGCC CATTCGAACA CACGCATCAA ATCTTAGTGT ATTTGCCATT TGAGGATGCG GATGTGATTG CTCGTTTCCT GTCGCATTTC AGTGACTATC AGTTTTTGGT GTACCACAGC CAACAGCCTA AGGGACAAGT TGCCGAACAC ATTAAGTGGC ACGGCTTTAA TCGTGAGGGC TTTAAGCAAC ATCTTGCCAG CTGCGGCGGT GTGATAGGCA ATGCGGGCTT TGAGCTTGCC AGCGAAGCAT TAACCTTAGG TAAAAAGTTG TTAGTGAAAC CACTGATTGG CCAGTTTGAG CAATTATCGA ATGTGGCGGC GCTGCAACTC TTGGCGGCCG CAGATAGCAT GATGAGTCTC GATGTGAACG TTGTGAAGCG CTGGTTGAAA ACCGCATCAC CTAATCCGAT TGCCTATCCT CAGGTTGGGG ATGCGCTGGT TAAATGGATT GATGGCGGCG ATTGGCACGA TAGTCAGCCG CTATGTAAAG AGCTGTGGAG CCAAGTTACG CTGCCTGATA CCTGGCGTTA A
|
Protein sequence | MRILYGVQGT GNGHLSRARV MAKALMKQDI EVDFLFSGRK PEQFFDMECF GDYRVQAGMT FATHSGRVNV PQTVRQNLSL SLFKDIKALD LSCYDLVLND FEPVSAWAAR RQGVPSIGIS HQAALTHPVP KLGSTWFNEM LLNNFAPVDV ALGCHWHHFG FPILPPFVEV DASPFEHTHQ ILVYLPFEDA DVIARFLSHF SDYQFLVYHS QQPKGQVAEH IKWHGFNREG FKQHLASCGG VIGNAGFELA SEALTLGKKL LVKPLIGQFE QLSNVAALQL LAAADSMMSL DVNVVKRWLK TASPNPIAYP QVGDALVKWI DGGDWHDSQP LCKELWSQVT LPDTWR
|
| |