Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4996 |
Symbol | |
ID | 8728760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 6084962 |
End bp | 6086482 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003389773 |
Protein GI | 284039843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.317023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGAT TTACTAAATG GAGTGTTGTC ACCGGCATCA TCGGCTGTGT GCTGATTTGT ATCGGCATGG CTTCCCGGCC CGCTCAGCAA CCACCCAACA TCGTTTACAT TCTGGCCGAT GACTTAGGAT ATGGCGACGT GTCGGTCTAT AACCCGGCGG GAAAGATTGC CACGCCCAAT ATTGACAAAC TGGCCGCGCA GGGTATGCGC TTTACCGATG CGCACTCGCC TTCGGGTGTG TGTACGCCTA CCCGGTACTC CCTGCTGACG GGTCGTTATC CATGGCGTAG TCGTTTACCC GTGGGCGTTT TGCGCGGCTA CAGCCGAACA TTGATCGAAG CGGATCGCCC GACGGTGGCC TCATTGCTGA AAGGTAATGG GTATCAAACG GCGGTCATCG GCAAATGGCA TCTGGGTCTC GACTGGGTTC CAAAAAAAGG CAGTGAGTCG TTGCTGGCGT CGGCGGAGTA TGGCATCCAA TCGGAGATGG ACCCGGCGGT GATCGATTTC TCGCAGAATC CAGCGCATGG GCCTAATACA ATAGGGTTCG ATTATTCGTA CGTGTTGCCC GCTTCGCTCG ATATGCCGCC TTACTGTTAC CTGGAAAATC ATAAACTGAC CGAGTTACCC ACTGGCTACA CTAAAGGTAA TAAAATAGAG TCGGGCTACG CGGGTCCTTT CTGGCGCGAA GGCAGTATGG CTCCTTCCTT CGATTTTCAT GGCGTACTAC CCCGATTTGT TGAGGAAGCG GTTGGTTTTC TGAACCGACA AACGGCAAAA AAACCATTCT TTCTGTATTT GCCACTGGCG GCCCCGCACA CGCCCTGGAT GCCGACTAAA GACTATACGG GCAAATCGAA AGCGGGTGAG TACGGCGATT TCGTGCAGCA GGTCGATGCA ACAGTGGGGG AGGTGTTGGC GGCTCTCGAA AAAACGGGAC TGGCTGGCAA TACACTCGTT GTTTTTACCA GTGATAACGG ACCGTATTGG CGGGATGATT ACGTGAAGCG TTTCGACCAC AGGGCCGCTG GCGGGTTCCG GGGGATGAAA GGCGATGCGT TCGAAGGGGG GCACCGCATT CCGTTTATCG TCCGCTGGCC GGGTAAAGTG AAAGCTGGAA CGGTGAGCCA GGCCACCACA ACGCTGGCTA ATCTGACCGC TACATGCAGG GAAATTCTGG GTAAGACTAA CCCCAACCAG GATGATAGTT ACAGTATACT ATCGGTGCTT GCGGGGAAAA CCAGGGATGT ACCGAACCAA CCGGCTGTCG TGCATAGTTC ATCAATCGGC TTTTTCGCCA TTCGGAAAGG AGATTGGAAA CTAATCGAAG GGCTGGGGTC GGGCGGTTTT ACGGAACCCA AAGAAATTAA GCCTAAAGCA GGAGAGCCCG TCGGGCAGTT GTACAACCTC GCCACCGATC AGCTGGAAAC CACCAACATG TACCAGCAAC ATCCCGAAAA AGTAAAGGAA TTGACGGATT TGCTGGCGAA AATTAAAGAG GGAAAAGAAC AGTATAAGTA G
|
Protein sequence | MPGFTKWSVV TGIIGCVLIC IGMASRPAQQ PPNIVYILAD DLGYGDVSVY NPAGKIATPN IDKLAAQGMR FTDAHSPSGV CTPTRYSLLT GRYPWRSRLP VGVLRGYSRT LIEADRPTVA SLLKGNGYQT AVIGKWHLGL DWVPKKGSES LLASAEYGIQ SEMDPAVIDF SQNPAHGPNT IGFDYSYVLP ASLDMPPYCY LENHKLTELP TGYTKGNKIE SGYAGPFWRE GSMAPSFDFH GVLPRFVEEA VGFLNRQTAK KPFFLYLPLA APHTPWMPTK DYTGKSKAGE YGDFVQQVDA TVGEVLAALE KTGLAGNTLV VFTSDNGPYW RDDYVKRFDH RAAGGFRGMK GDAFEGGHRI PFIVRWPGKV KAGTVSQATT TLANLTATCR EILGKTNPNQ DDSYSILSVL AGKTRDVPNQ PAVVHSSSIG FFAIRKGDWK LIEGLGSGGF TEPKEIKPKA GEPVGQLYNL ATDQLETTNM YQQHPEKVKE LTDLLAKIKE GKEQYK
|
| |