Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0231 |
Symbol | |
ID | 8723959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 309231 |
End bp | 310736 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | protein of unknown function DUF404 |
Protein accession | YP_003385095 |
Protein GI | 284035165 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0467865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC AAAACGCCCG TCAATCACAA TCGCAAACAC TCAATGGTAT GACTCAGTCG CAGGGTCAGG CCAATGCTAA TTTTTCGTTC AGCGATTACC AGACTGAAAA CTTTTTTGAT GAGATGTTCG CCAGTGAGAT GCAGGTGAGA GCAGGCTACG CTCCTTTCCA GCAGCGCGTT GAGCAGCTTA CCCGCGAGGA TCTTATTGGG CGACAGCATG CAGCCGAACG GGCACTCATG AGCATGGGCA TCACCTTCAA CGTTTACTCG GAAGGTGAAG GCACCGAGCG GATTATGCCC ATCGACATTA TCCCCCGTAT TATCGAATCG GCCGAGTGGG ACCGGCTCGA AGCGGGCCTC ATCCAGCGTA TTAAAGCCAT TAATATGTTT CTGGACGACG TCTACAACGA TCAGAATATT CTGAACGACG GCGTTGTTCC CCGCGACCTT ATCGAATCCA GCAAGTCGTT TTTGCCGGGC TGCTTAGGTG TAAAACCGCC CAAAGGCATC TGGTGCCACA TTACCGGCAC CGACCTGATC CGGGGCGAAG ACGGTACCAT GATGGTGCTT GAAGATAACC TTCGTTGCCC ATCGGGGGTA TCGTACATGC TCGAAAATCG CGAACTCAAT AAGCAAACCT TCCCCGATGT GCTGGCCCAG ACGGGCGTTC GGCCGGTTTC GGATTACCCA ACGCGACTGT TGCAGATGTT GCAGTACATT GCCGACCGGC CCAACCCAAC CGTAGTAGTC CTAACGCCGG GTATCTATAA CTCCGCTTAT TTCGAGCATT CGTATCTGGC TCAGCAGATG GGCGTCGAAC TGGTCGAAGC GCGTGATCTA GTTGTATCGG GTGGTTACGT AAAAATGCGC ACGACCAAAG GCTTTCAGAT CGTCGACGTG ATCTACCGCC GTATTGATGA TACATTCCTG GACCCCAAAG CCTTCAATCC CGATTCGATG ATTGGCGTAC CGGGCATTTT CGAGGTGTAC AAAAAAGGTC GTGTTGCGCT GGCCAACGCC CCCGGAACCG GTGTTGCCGA TGATAAAGTG ATTTACGCTT ACGTACCCCG CATCATTAAA TATTACATGG GCGAAGAAGC TATCATTCCC AACGTAAAAA CGTATATCTG CCGCGAAGAG GAGGACTGCG CTTACGTCAT GGAAAATATT GAAAAACTGG TGGTTAAGGA AGCCAATGAA GCGGGCGGTT ATGGTATGCT CATCGGCCCG AAGGCGACGC CGGAAGAACA CGAATTATTC CGTCAGAAGA TCAAGGACAA TCCCCGGAAT TACATCGCCC AGCCAACCAT TTCGCTGTCA CGCGTGCCCT GCATTGTGGG CGACCATGCC GAAGGCCGAC ACGTTGACCT TCGGCCGTAT ATTCTCTACG GCGACGGCGT CAACGTCATT CCCGGCGGCC TTACCCGCGT AGCCCTGCGC AAAGGCTCCC TCGTGGTCAA CTCCTCACAG GGCGGTGGCG GTAAAGACAC ATGGGTGTTG TATTAG
|
Protein sequence | MKKQNARQSQ SQTLNGMTQS QGQANANFSF SDYQTENFFD EMFASEMQVR AGYAPFQQRV EQLTREDLIG RQHAAERALM SMGITFNVYS EGEGTERIMP IDIIPRIIES AEWDRLEAGL IQRIKAINMF LDDVYNDQNI LNDGVVPRDL IESSKSFLPG CLGVKPPKGI WCHITGTDLI RGEDGTMMVL EDNLRCPSGV SYMLENRELN KQTFPDVLAQ TGVRPVSDYP TRLLQMLQYI ADRPNPTVVV LTPGIYNSAY FEHSYLAQQM GVELVEARDL VVSGGYVKMR TTKGFQIVDV IYRRIDDTFL DPKAFNPDSM IGVPGIFEVY KKGRVALANA PGTGVADDKV IYAYVPRIIK YYMGEEAIIP NVKTYICREE EDCAYVMENI EKLVVKEANE AGGYGMLIGP KATPEEHELF RQKIKDNPRN YIAQPTISLS RVPCIVGDHA EGRHVDLRPY ILYGDGVNVI PGGLTRVALR KGSLVVNSSQ GGGGKDTWVL Y
|
| |