Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_1572 |
Symbol | |
ID | 8725306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 1897440 |
End bp | 1900676 |
Gene Length | 3237 bp |
Protein Length | 1078 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | glycosyl hydrolase, BNR repeat-containing protein |
Protein accession | YP_003386420 |
Protein GI | 284036490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0452612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAATA GAAAATTTAC ATTTAGTATA ATCGCTTTGG CAATTGTCTG TTCATCTGTC GATCTAGCCT TTGCCCAGCG GAAAAGTAAA CCAGTTGCTA AATTAACATC GGCACAATCG ACTGACAGCC TCATCAACGG CTTGCGGTGG CGTAACATCG GCCCCTTTCG CGGTGGGCGG TCACTGGCTG TTTCCGGGCA TTCGAGCCAG CCGCTTACGT ATTATTTTGG TGCTACGGGC GGTGGCGTCT GGAAAACGGT CGATGGCGGG GCCAACTGGT TCATGGTCTC GGATAGTACC TTTAAATCCA GCTCAGTAGG TGCAGTTGCC GTGGCCCCTT CCGATCCGAA CACGATTTAC GTGGGCATGG GCGAAGCCGA TATCCGCAGC AACATTGCCA ATGGCGATGG AGTCTATAAA TCGACCGACG CGGGTAAAAC CTGGAAACAT ATTGGTCTGA AAATGGCCGA TGCGGTCGCT AGTATTGAGG TTCATCCGAC CAATCCCGAT GTGGCGTATG TGGCCGCTTT GGGGAATCCG TTTGCGCCTA ACAAAGAGCG GGGCCTGTTT CGTACCACCG ATGGCGGTAA AAGCTGGAAG GCCATTCTGA CAAAAAACGA TAGTACCGGT GCTATTGTCG TGAAACTCGA TCCTAACAAC CCGTCGATCG TTTACGCGTC GATGTGGCAG GCGTACCGCA ATAGCTATAT GATGAGCAGT GGCGGTCCCG GTTGCGGTTT GTATAAATCG ACCGACGGGG GCGACACCTG GACTAACCTG AGCACCAAAC CCGGTATGCC CAAAGGGCTG TTGGGCAAAA TTGGCATTAC CGTTTCGCCC GCCAACTCGA ACCGGCTTTA TGCCATGGTC GAGAACGCTA AGGGTGGTTT ATACCGCTCC GACGATGCCG GTGAATCTTG GCAGTTGATC AACGAAGACA AGAATCTCTG GCAGCGGCCC TGGTATTATA TGATGCTAGC CGCCGATCCC CAAGACGAAA ACGGCTTGAT CGTACTGAAC GTGAATGCCC TGAAATCGTA TGATGGCGGT AAGACATTTT CGACCATTGG CGTACACCAC GGCGATACGC ATGACATTTG GTGGAACCCC AAAAATCCGC AGAACTTCAT CATTGCCGAC GATGGTGGTG CCGAAGTGAC CTATAATGGT GGGGCAACCT TTTCCGATAT TGATATTCCA ACCTCCCAGT TTTACCACGT AGCCGTTGAT AATGATTTTC CCTACAACCT CTACGGAGCC CAGCAGGATA ACTCTTCTAT CCGGATTGCC AGCCGCACTA CCGAGTATTC CATTGGCAAA TCGGCCTGGT ACCCGGTATC GGGTGGTGAG TCTGGTTATA TCACACCCGA CCCGAACAAT CCGAACGTGA CCTACGGCGG TAGCTATGAT GGCCTGATTA CCCGCTACGA TAAAGCGACG GATCAGAATC AGGTTATCAA TGTGTATCCA GAGTACTTCA TGGGAGCCCC TTCATCAGCG CGGAAGTACC GTTTTCAGTG GACGTACCCC ATCGTTTTCT CGCCCCACGA CAGTAAAACG CTGTACATCA CCTCGCAGTA CCTGCACCGG AGTACGGACA ATGGCCATTC GTGGCAGGAC ATCAGCCCCG ATCTAACCCG TAATGACCCC AAAACGCAGG GCGATACGGG CGGCCCGATC ACCAAAGACA ATACGGGGGC CGAAACCTTC CCCACGATCT TCACCTTTGC CGAATCGCCG GTAGAAAAAA ACATCTTCTG GGCCGGTTCC GACGATGGGC TGATGCACAT TAGTCGGGAC GGCGGCAAAA ACTGGCAGAA TATCACCCCC CCGGTGTCCA TGCTGCCCGA AATGGCAATC ATGAGCATGG TTCACCCGTC TGACCATACG GCGGGTAAGG CCTATCTGGC GGCCAAGCGG TACATGTCCG GCGACCGTAA ACCCTATATG TTCAAAACGA CCGATTATGG CAAAACCTGG ACGAGCATCA CGGCGGGTAT TCCATCGGAC GAGCATTGCC ATGTAGTGCG CGAAGACCCG AATAAACCTG GTCTGTTATA CGCTGGAACC GAGCGTGGGG TGTATGTATC CTTCAACGAT GGCGGTTCGT GGGAGAAGTT AAGCATGAAC CTGCCCGTTA CGCCTGTGCG TGATTTGCAA ATTCAGAAGC GCGAAAAAGA TCTGGTCATT GCCACTCACG GGCTGGCATT CTGGATCATG GACGACATTA CACCGCTGCA TGAACTGATG GATGTGAAAC AGGGCGGTAA ATCGGTGGCT CACATGCATT TGTTCAAACC TCGCCATGCC TATCGCATGG AAGGCGGTGC CGGAAACAGC CGTCGGGGTC GTGCTGCGCC ATCTGACGAA GGCGAAAATG CTCAAAATGG CGTCATTACG CGTTATTATC TGAAAAGCAA GCCGACCAAA GAACTACGGC TTATCTACAT GACAACCGCT GGTGATACCA TCAGTGCCTA CTCCAGCACG AAGGATAAAA AAGGGCAACC GCTCAAAATT GCGAAGGAGT TTTATCAGGA CGAAACCGCA ACCCGCCCCG GCTCCATTCC CGCCAGTCCG GGAATGAACG TGTTCGTTTG GGATATGCGT TACCCCGATG CCACCGCCGT TGACGGCATC AACGTGATGT GGACGGGTCG CGGAACGGGT GCTAAAGTCA TTCCAGTAAT GTATAAAGTA CGGATGTTGC TGGGCGATTC GCTGATCTCG GAGCAACCCA TAGAAATCCG GAAAGACCCT CGCTTAGCGA TAACAGCCGC CGAGTACCAG GAACAGTTTG ACCTGTTGCA GAAAATCAAC GGAAAACTGT CCGAAACGCA CAAAGCGATC AACCAGCTTC GGCAGATTCG CACCCAGATC AACGGGTATA CGAGCGGGGT AAAGGAACCG AAAGTAGCCG AGAAATTTAA AAACGCGGCT AAGCCCATTC TGGACGAACT GGACAAGATC GAATCGACGC TGATGCAGCC GAAGTCGAAA GCACCGCAGG ACGCGCTGGC GTATCCAATC CGATTGAACG ACAAAATTGC AGGTGTGGCC TCGGTGGTTT CCTCGGCCGA TACCAAGCCA ACCAAGTCGT CTTACACCGC ATACGATGAC CTGTCCAAAC AAATTGACAC GGCACTGACC AAGCTGAAAG AAGTCATCAA CACCCAGGTG CCGGGCTTCA ACAAAATGGT AACCGAACAG CAGGTACCCG CTATTATTTT GAATTGA
|
Protein sequence | MVNRKFTFSI IALAIVCSSV DLAFAQRKSK PVAKLTSAQS TDSLINGLRW RNIGPFRGGR SLAVSGHSSQ PLTYYFGATG GGVWKTVDGG ANWFMVSDST FKSSSVGAVA VAPSDPNTIY VGMGEADIRS NIANGDGVYK STDAGKTWKH IGLKMADAVA SIEVHPTNPD VAYVAALGNP FAPNKERGLF RTTDGGKSWK AILTKNDSTG AIVVKLDPNN PSIVYASMWQ AYRNSYMMSS GGPGCGLYKS TDGGDTWTNL STKPGMPKGL LGKIGITVSP ANSNRLYAMV ENAKGGLYRS DDAGESWQLI NEDKNLWQRP WYYMMLAADP QDENGLIVLN VNALKSYDGG KTFSTIGVHH GDTHDIWWNP KNPQNFIIAD DGGAEVTYNG GATFSDIDIP TSQFYHVAVD NDFPYNLYGA QQDNSSIRIA SRTTEYSIGK SAWYPVSGGE SGYITPDPNN PNVTYGGSYD GLITRYDKAT DQNQVINVYP EYFMGAPSSA RKYRFQWTYP IVFSPHDSKT LYITSQYLHR STDNGHSWQD ISPDLTRNDP KTQGDTGGPI TKDNTGAETF PTIFTFAESP VEKNIFWAGS DDGLMHISRD GGKNWQNITP PVSMLPEMAI MSMVHPSDHT AGKAYLAAKR YMSGDRKPYM FKTTDYGKTW TSITAGIPSD EHCHVVREDP NKPGLLYAGT ERGVYVSFND GGSWEKLSMN LPVTPVRDLQ IQKREKDLVI ATHGLAFWIM DDITPLHELM DVKQGGKSVA HMHLFKPRHA YRMEGGAGNS RRGRAAPSDE GENAQNGVIT RYYLKSKPTK ELRLIYMTTA GDTISAYSST KDKKGQPLKI AKEFYQDETA TRPGSIPASP GMNVFVWDMR YPDATAVDGI NVMWTGRGTG AKVIPVMYKV RMLLGDSLIS EQPIEIRKDP RLAITAAEYQ EQFDLLQKIN GKLSETHKAI NQLRQIRTQI NGYTSGVKEP KVAEKFKNAA KPILDELDKI ESTLMQPKSK APQDALAYPI RLNDKIAGVA SVVSSADTKP TKSSYTAYDD LSKQIDTALT KLKEVINTQV PGFNKMVTEQ QVPAIILN
|
| |