Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_4179 |
Symbol | |
ID | 8727938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 5031006 |
End bp | 5032550 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003388964 |
Protein GI | 284039034 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.10978 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACGG GTCTGAAACC ATTATTGAGT AGTATTCTGG TATTGCTTTG TGTAGTATAT ACTGTGTTCG ACGCCGTTGA ACCGACCCGT TTAACCAGCA GTTCCGGGCA GAAAACGAGG GCTGCGGACA GTCGGCCAAA CATTGTTTTG ATCGTTGCCG ATGATCATGG CCGGGAGGTT TTAGGCTGCT ATGGCGCATT GGCTATCAAG ACGCCACACA TCGATCAGTT AGCCGCCGAC GGGGTTCGCT TTTCCAATGC GTTTTGTACA ACGGCCAGTT GCAGTCCCAG CCGCTCTGTA TTGTTGACGG GTTTGCAGAA TCATACCAAT GGAATGTATG GCCTTGAACA TCAGGAACAC CACTTCGCTT CCTTCGATAC CGTACGGTCG TTACCCGTTC TGCTTGAAAG AGCGGGATAC CGCACCGCCC GAATTGGGAA ACTGCACGTA GCGCCGGAGA AGGTGTACCA TTTTCAACAG GTACTCAAAG GTGGTGGAGT AAACGATCCG GCATCTATCG GCCGTAGCCC GGTCGAAATG GCCCGCTTCT GTTATCCTTT TCTGGAGGCC ACAACGCATA CCAGCGCACC AACGAACCAA CCGAACACAG CTCAACCCTT CTTTCTTTAC TTTGCCACGG ATGATCCGCA CCGCAGCAAC ACGGTGGCCA CCGATGGATC GCCGGTGTTT GATGGTACTA AACCAAATGT ATTCGGGAAT CGGCCAGGGG GATATCCGCA GGTGGGCGAC CATTTCTATC AGCCTCGGGA TGTACGCGTA CCGGCTTACT TACCCGACAC AAAAGCGTGC CGGGCCGAAC TGGCACAGTA TTATGAAGCC ATCAGCCGAC TGGATGCGGG CGTTGGCCGA CTGATCGACT ACCTGAAAGA CACCGGGCAG TACGATAATA CCCTGATTGT CTATCTATCC GATAATGGCG CGCCTTTTCC GGGAGCTAAA ACAACCTTGT ATGAACCGGG TATGCGGTTA CCGTGCATTG TTAAATTGCC GAAACCAAAG AAACGGGGAT TTGTCCAGGA TGCGATGATT TCCTGGGCCG ATATAACACC TACACTGCTG GATTTTGCCG GTGTCCGGCC CAGAAATTCA CCAAAGCTAG GGCGATCCTT CAAGGATATT ATCGAGCAGG AACAGGTAAC GGGTTGGGAT GAAGTATATG CCTCGCACTC GCTGCACGAA GTGACCATGT ATTACCCCAT GCGAGTGGTA CGGGAACGTC GGTATAAACT GATTTATAAC ATTGCTTATC AACTGCCGTT TCCTATGGCG CTCGACTTAT ACCACTCCTT TACGTGGCAG GATGTGCTCC GCACGAAGCA GAAATTGTAC GGCAAACGAA CGGTGAACAC CTATCTGCAT CGCCCGCGGT TCGAATTATA CGATCTGCAA ACGGACCCGG ATGAAGTGAA AAATCTGGCC GTCAATCCCC AATTTAAAGC GGTACTGGCC CGGATGCAAG CCCGGCTCAA ACGGTTTCAG CAGCAAACCC GCGACCCGTG GATGAGCAAA TGGAATGTTG AGTGA
|
Protein sequence | MVTGLKPLLS SILVLLCVVY TVFDAVEPTR LTSSSGQKTR AADSRPNIVL IVADDHGREV LGCYGALAIK TPHIDQLAAD GVRFSNAFCT TASCSPSRSV LLTGLQNHTN GMYGLEHQEH HFASFDTVRS LPVLLERAGY RTARIGKLHV APEKVYHFQQ VLKGGGVNDP ASIGRSPVEM ARFCYPFLEA TTHTSAPTNQ PNTAQPFFLY FATDDPHRSN TVATDGSPVF DGTKPNVFGN RPGGYPQVGD HFYQPRDVRV PAYLPDTKAC RAELAQYYEA ISRLDAGVGR LIDYLKDTGQ YDNTLIVYLS DNGAPFPGAK TTLYEPGMRL PCIVKLPKPK KRGFVQDAMI SWADITPTLL DFAGVRPRNS PKLGRSFKDI IEQEQVTGWD EVYASHSLHE VTMYYPMRVV RERRYKLIYN IAYQLPFPMA LDLYHSFTWQ DVLRTKQKLY GKRTVNTYLH RPRFELYDLQ TDPDEVKNLA VNPQFKAVLA RMQARLKRFQ QQTRDPWMSK WNVE
|
| |