Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_1606 |
Symbol | |
ID | 6332187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | + |
Start bp | 1661638 |
End bp | 1663728 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642657881 |
Product | TonB-dependent receptor |
Protein accession | YP_001931758 |
Protein GI | 188997507 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000000872297 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TTGCATCAAT GGTTTTATTA TCTACTTACG TGATGTTAAG TTATTCACAT GCAGTAGAAG GCATTAGAAT TGAAAAAGTT ACAGTTGAAG AGAAAAAAGC TGAGGAAGAA GTAAGCACAA AAAAAGAACT TACACAAGAA GAGGCTAAAG TTACAAGACA GATTGACCTT GGAGAGATTT TATCAGAGTT TTATCCAGAA GTTTGGTACA TGAGAAAGGC AGGAATTGCA AATGATTTAT ACATTAGAGG ATTTGCAAAA GATAACATTA ATGTATTGAT CGATGGTTCT AAAATCTATG GTGCATGTCC AAACAGAATG GACCCCCCTT CTTTTCACGT TTCATCTCCT CAGATTGAGA ATATAACCAT AAAAGAAGGT CCATTTGATG TTGAAAATGC AGGTTCTCTT GCTGCGGTTG TAAACGTAAA AACAAAAGAC CCTAAGGAAG GAATCGGGGG AGAGATAGGG GGAACCTATG GAAGTTGGAG CTATAGGACT GGCTACGTAT GGGGAAATGT TGGAAATAAG TTTATCAAGG TTTTAGTTGG TGCATCAAAT CAATACTCTA AGCCTTATGA AAGCGGAGAG GGTAAAAAGG TTACAGAGTA TGCAGTCTAT TCTACACCAG CTGGACCAAC AGCAGGAAAG TATAAATCCC AATACATTAA CGACAAAGCA TTTAATATAG ACAGAGTATG GACAAAGTTA TTGATTACTC CAAACGATAA TGCAGAGATT AAACTTTCTT ATGCTTTTGA AAATGCAAGA AATGTATTAT ATCCGGCTTT AAAAATGGAT GCTTTGTATG ACAGAACAAA CAGATTTAAC ACAGATTTTA AACTTAAAGA TATTGGTCTT AATTTTAGCA TCTACTATAA TGAAGTAAAA CATGATATGA GAGATTCTTT TAGAACTACA GCAGGAAATT ATCCATATTC TATGAAAACT TATGCTGAGT CAAAAATGTT TGGAACTAAA CTTTCAAAAG ATTTAAATCT TTTTGGACTT AACATGACAG TTGGGATTGA TACATATTTA AGAAATTGGA AAGCGGATAA TATTCAAAAT GCATTTACAT CGACACCTAA CTACAACGAT GGTATGATAC CGGATGTTGA TATTAAAGAC ATTGGATTGT TTGTAAAAAG CAGTAAAGAT ATAGAAAAAT GGACTATCTC CGGCGGCTTA AGATACGATT ATACGTATTC AAAAGCAGAC CCAAATTCTA TGACAGCTAA TACTAACAGA GATTTATTTA TTAAAAAAAA TGGCTACAAA TTCTCTAACA CAGATAACTA CGTATCAGGT TATGCTATTG CAAAATACAA TATCAATAAA AAAGATAATG TATACGTAGG ATTTGGTAGC ACTGTAAGAG TTCCAGACCC AGAGGAGAGA TTTATATCTT TAAGCGGTAT GGCACCATGG TATGGTAATC CAGATTTAAA GCCAACAAGA AATAATGAAA TAGATGCAGG GTTTGAAGCA TTTCCAATGA ATAATGTTGG AATTAAAGGC AATATCTTTT ATAGCATGCT GCAGGATTAT ATTTATTTAC GTCAGTATTC AGATAGTGGC AAAACTTACA TAACCTATGA AAACATAAAC GCTGCGATTT ATGGTGGAGA TATTAACGGA TTTGTTGCTA TAAACGACAA AATCTCAACA GAGCTTGGGC TTGCTTATCA AATTGGTAAA AAAACAGACA AAAAAGGATT ATACACAGAC AGCGATTTAG CAGAAATTCC ACCATTAAAA GCAAGATTGG CTGTTAAATA CGATGATAGA ACTTTCTACG GACTAATAGA AGGTATATAC TCTGCAAAAC AATCAAAAGT TGACTCAGAT TTAAAAGAGC AAGAAACCGG CAGCTGGTTT ATAGTAAACG TTAAAGCAGG ATATACATAT GCAAATAGAC TATTTGTTGG TGTTGGTGTT GATAACGTAT TTGATAAATT CTACTATAAC TATCTATCTT ATGTAAGAGA CCCATTTAGA GGAATTTCTT TATCAGGAAA TCCTGTCAAA ATCCCAGAAC CCGGAAGATT TATTTACGCA AACGTTTCTT ACAGATTTTA A
|
Protein sequence | MKKLASMVLL STYVMLSYSH AVEGIRIEKV TVEEKKAEEE VSTKKELTQE EAKVTRQIDL GEILSEFYPE VWYMRKAGIA NDLYIRGFAK DNINVLIDGS KIYGACPNRM DPPSFHVSSP QIENITIKEG PFDVENAGSL AAVVNVKTKD PKEGIGGEIG GTYGSWSYRT GYVWGNVGNK FIKVLVGASN QYSKPYESGE GKKVTEYAVY STPAGPTAGK YKSQYINDKA FNIDRVWTKL LITPNDNAEI KLSYAFENAR NVLYPALKMD ALYDRTNRFN TDFKLKDIGL NFSIYYNEVK HDMRDSFRTT AGNYPYSMKT YAESKMFGTK LSKDLNLFGL NMTVGIDTYL RNWKADNIQN AFTSTPNYND GMIPDVDIKD IGLFVKSSKD IEKWTISGGL RYDYTYSKAD PNSMTANTNR DLFIKKNGYK FSNTDNYVSG YAIAKYNINK KDNVYVGFGS TVRVPDPEER FISLSGMAPW YGNPDLKPTR NNEIDAGFEA FPMNNVGIKG NIFYSMLQDY IYLRQYSDSG KTYITYENIN AAIYGGDING FVAINDKIST ELGLAYQIGK KTDKKGLYTD SDLAEIPPLK ARLAVKYDDR TFYGLIEGIY SAKQSKVDSD LKEQETGSWF IVNVKAGYTY ANRLFVGVGV DNVFDKFYYN YLSYVRDPFR GISLSGNPVK IPEPGRFIYA NVSYRF
|
| |