Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_4217 |
Symbol | |
ID | 5756048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 4991555 |
End bp | 4992460 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641290573 |
Product | DNA binding domain-containing protein |
Protein accession | YP_001556635 |
Protein GI | 160877319 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00203903 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.426085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCTG CCAGTGAATT GGTTTACATG AGCGCGAAGC AAGTGGCCGA GTATTTAGAT CTTAACGAGA AAAAAGTCTA CGCCATGGCC AACGACAGAA TTCTCCCCGC CACTAAAATC ACCGGTAAAT GGCTATTCCC GAAAGTGCTA ATCGACCGTT GGGTGATGGA TTCGTGTCAC AGTGGCATGC TTACCGACCG TTTGTTGATC ACCGGTAGTG ACGATCCACT CTTATCTATG CTGGTGGCGC GCTTGATGGC ACAAGTCGGT AGTCGTGAGT TGATAAGCTA CAGCGCGACA GGTTCACGCT TAGGATTAGA GTTACTCGCT AAAGGTTATG CCGATGTGTG TACCTTACAC TGGGGCAGCA TGGAGGATCG CAATATCCGT CATCCAGCCT TACTTAAAGG GTATAACAAT CATCAACAAT GGATCATGGT GCACGGTTAC TCCCGTCAAC AAGGGTTGAT CATGCGTGCC GATATGCACC ACAGATGCCA AGAGGAAGAT AAAGTCGTGA ACTTACCTTG GCGTTGGGTG AGTCGTCAGG GCGGCGCGGG TAGCCAGCAA CATTTAGAAC ATTGGTTGTT AAAGCAAGGC GCTCGCTTAG ATCAGCTAAA TGTCGTGCTG ACGGCCTATA GTGAACGCGA GCTGGCAGGT TATATCGCCC GTGGTGATGC CGATATAGGT TTTGGCTGTC AATCTGTGGC ATTGGAGAGT GGTTTGAGTT TCGTGCCACT GATTAAAGAG TCCTTCGATT TCGTTATGCC GCAAAGCATT TACTTCCGTC GTCAGCTTCA ACAACTCTTT ACTATGTTGG CGAGCGGCCA CTCGAGGCAA ATGGCGGCGC TACTGGGTGG CTATGATCTT ACCGACTGCG GACAATTACT CTGGAGTGCG AGCTAA
|
Protein sequence | MTSASELVYM SAKQVAEYLD LNEKKVYAMA NDRILPATKI TGKWLFPKVL IDRWVMDSCH SGMLTDRLLI TGSDDPLLSM LVARLMAQVG SRELISYSAT GSRLGLELLA KGYADVCTLH WGSMEDRNIR HPALLKGYNN HQQWIMVHGY SRQQGLIMRA DMHHRCQEED KVVNLPWRWV SRQGGAGSQQ HLEHWLLKQG ARLDQLNVVL TAYSERELAG YIARGDADIG FGCQSVALES GLSFVPLIKE SFDFVMPQSI YFRRQLQQLF TMLASGHSRQ MAALLGGYDL TDCGQLLWSA S
|
| |