Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3392 |
Symbol | |
ID | 8727145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4099447 |
End bp | 4100877 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_003388199 |
Protein GI | 284038269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.428431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.278741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACCT CGTTATCGGC CTTTACGGCC TTTGGAACGT TACTGTTCAT TTTTGTTTTA GCTGTGTCCA TCTCTTTTCA GTCAATGGCG CAGAACGGGC AGATCGTTTA CAAAAACTAC TGTGCCGGAT GTCATGGGGC CCAATTGCAA GGCAGCGCAG GGCCCGCCCT CATTAAAAAG GAATGGAAAC ACGGGGGCGA CCGCAACTCA TTGCTCAAAA CCATCCGGAA TGGCGTTCCA TCGACCGAGA TGATTAAGTT TGAAGGCGTT CTTTCGGCTA AAGAGATCGA GGCCGTCACT GATTTTATCC TGAATGCCCA AACTTCGCCG GAGCTGGTCA AAAAGACCGA CTTACCCCTT CGCGTCGATA CAAAGCTCTA CAAGCTCAGG ATTGAAAAAC TGGTGACGGA GGGGTTGACC AACCCCTGGG GTATCGAGTT TATCGATGCT AACCATGCGC TTGTTACCGG CAAGAACGGG GAGCTCTTTT CGTTGGTCAA TGGTAAACTC GACAATCAGA AAATTACCGG TCTACCCAAG ACGTACGGCA CAGACCTTGT TGGTGGGATG ATGGATCTGG CGCTCGATCC CGAGTACAAA ACCAATGGCT GGATTTACAT TGGATTTAGT TATAACCCGC AAAATAGCCC GGACAAGAAG ACGCCGGGCA TGACAAAGAT CGTCCGGGGA AAGGTCCGTG AACATCAATG GGTGGACGAA CAGACCTTGT TTCAGGTCCC CGACTCGCTG CTGGTGACCG GCGGTACGCG TTGGGGGTGC CGTTTTCTGT TCGACAAACA GGGGCACCTC TTTTTTACCA TCGGGGATAT GAATCGCGCC GGCGATTCCC AGATTTTGTC GAGACCGTCC GGCAAGATTT ATCGAATTAA TCCCGACGGG TCTATTCCGA AAGATAATCC GCTGTATGGC AAAGCGAACT ACTTACAGGC CATTTACAGC TGGGGCAACC GCAATGCACA GGGCCTCGCC CAACACCCGC AGACCGGCGT CATCTACGCG AGTGAACACG GGCCGCAGGG GGGCGACGAA CTGAATATTC TAAAAAAGGG CGCTAATTAC GGATGGCCGG TGATTACTTA TGGTATCGAC TACAACGGTA GCATTATCAC CACCGAAACC CAACGGGAGG GAATGGAACA GCCTATTACC TACTGGACGC CTTCCATCGC CGTCAGTGCC ATTGAATTTG TGACCGGTAA CCGCTTTCCG CGCTGGCAGA ATAACTTGCT GGTAACCGCG CTCAAGTTTG AAGAAATCCG CCGACTGGTG GTCAGGGACG ATAAGGTGAC GGAACAGGAG GTGCTTCTGA AAGGCTATGG CCGGGTACGG GACTTGAAGA TCGGTCCCGA CGGTGCCCTG TACGTGCTGA CCAACAGCCC GGATGCCCTT TTACGAATTA CGCCCGAATA A
|
Protein sequence | MRTSLSAFTA FGTLLFIFVL AVSISFQSMA QNGQIVYKNY CAGCHGAQLQ GSAGPALIKK EWKHGGDRNS LLKTIRNGVP STEMIKFEGV LSAKEIEAVT DFILNAQTSP ELVKKTDLPL RVDTKLYKLR IEKLVTEGLT NPWGIEFIDA NHALVTGKNG ELFSLVNGKL DNQKITGLPK TYGTDLVGGM MDLALDPEYK TNGWIYIGFS YNPQNSPDKK TPGMTKIVRG KVREHQWVDE QTLFQVPDSL LVTGGTRWGC RFLFDKQGHL FFTIGDMNRA GDSQILSRPS GKIYRINPDG SIPKDNPLYG KANYLQAIYS WGNRNAQGLA QHPQTGVIYA SEHGPQGGDE LNILKKGANY GWPVITYGID YNGSIITTET QREGMEQPIT YWTPSIAVSA IEFVTGNRFP RWQNNLLVTA LKFEEIRRLV VRDDKVTEQE VLLKGYGRVR DLKIGPDGAL YVLTNSPDAL LRITPE
|
| |