Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3375 |
Symbol | |
ID | 8727128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4076680 |
End bp | 4078395 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003388182 |
Protein GI | 284038252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.590124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG GTTTATTCGT ACTGGTAGTA CCGTTGATGG CACTGGTATC TATACTGATA CTGGGCGGAT TCGAGTGGAA ACAAGCAACA TCGCCAAGCC ATAAACAAGC ACCCAGACCC AACATTATTG TCATCATGGC CGATGATATG GGCTACTCAG ATCTGGGCTG CTACGGGGGC GAGATCCACA CACCAAATAT TGACTACCTG GCGAACAACG GCATTCGCTA CACGCAATTT TACAATACAT CGCGCTGCTG TCCAACCCGG GCGTCGTTGC TTACCGGCCT CTACAATCAT CAGGCGGGTA TCGGCAAAAT GACGGATGCC GAAGACGAGC CGGGGTATCG CGGCCATTTG ACGGAGAACA CTGTTACGCT GGCCGAAGTC CTCAAATCGG CAGGCTATCA GACGGGTATG ACCGGTAAGT GGCACGTTTC CAATACCAAT GTGCAAAAGA ATCCGCAGGA ACAGCTCGAC TGGCTGAACC ATAAGAAAGA CTATGGCGAT TTTGCGCCTA TCAGCCAGTA CCCAACCAGC CGGGGGTTCG ATAAATACTT TGGTAACATC TGGGGTGTGG TCGACTTCTT CGACCCGTTC AGTCTGGTGA GCGGTACCAA ACCGGTTAAG GAGGTGCCGA AGAACTATTA CCATACCGAC GCCATTAGCG ATACGACGGT GGCCTACATT AAATCCTTCG CCAAAACATC GTCGCCATTT TTTATCTACG TGGCCGAAAC CGCCCCGCAC TGGCCCCTGA TGGCCTTGCC TGAAGATATT GCGAAGTACA AGGATACATA CAAACCCGGT TGGGAAGCTA TTCGGAAAGC CCGCTACCGG AAAATGAGCA AGCTGGGGTT GATCGATTCG ACCAAAACGA AGCTCTCCAA ACGCTGGCAG GATAATCTGA CCTGGGCCAA CAACCCCGAT AAGGATTGGG ATGCCCGGGC AATGGCCGTT CATGCCGCCA TGATCGACCG GATGGACCAG GGAATCGGTC GTATGATCAA GACCCTGCGA GAAACGGGAC AGTTGGATAA TACGCTAATC CTGTTTTTGT CCGACAATGG GGCCAGCCCG GAGAACTGTG CGGCCTACGG TCCCGGCTTC GACCGCCCCA ACGAAACCCG CGATGGCCGT AAAATCGTGT ACGACTTGAA AAAACAGGTT CTACCCGGTG CCCAAACATC CTACGCATCC ATTGGGCAGC GGTGGGCCAA TGTGGCCAAC ACGCCTTATG CCTTCTGGAA AGCAGAATCG TATGAAGGCG GCATTCGTAC CCCGCTGGTT GCCTTCTGGC CAAAGGGAAT AACAGCCCAA AAAGGCAGTT ACAGTACGCA GGTAGGGCAC GTGATGGATT TTATGAAGAC GTTCCTCGAC CTGACCGGCG CTGCGTATCC CGCCACGTTT AAGGGACACA CCATTACCCC AACAACGGGC GTCAGCCTAC TGCCTTCCTT CAGTGGAAAG GCCTCCATTG GGCACGAGAC CTTGTTCAAC GAGCATTTTG GGGCTCGCTA CGCCCGTTCG GGCAACTGGA AACTGGTGTC GTCGAGCCGA GACAGCACCT GGAGTCTATT CAATCTGGCC ACCGATAAAT CGGAAACGCA GGATCTGGCA GCCAGATACC CCGAAAAAGT TCGTCAGCTT CAGGGCTTAT GGCAGCAGTG GGCCAGTGCG CATCAAGTAT TCCCGAAACC CGGCAGAAAG AACTAG
|
Protein sequence | MKKGLFVLVV PLMALVSILI LGGFEWKQAT SPSHKQAPRP NIIVIMADDM GYSDLGCYGG EIHTPNIDYL ANNGIRYTQF YNTSRCCPTR ASLLTGLYNH QAGIGKMTDA EDEPGYRGHL TENTVTLAEV LKSAGYQTGM TGKWHVSNTN VQKNPQEQLD WLNHKKDYGD FAPISQYPTS RGFDKYFGNI WGVVDFFDPF SLVSGTKPVK EVPKNYYHTD AISDTTVAYI KSFAKTSSPF FIYVAETAPH WPLMALPEDI AKYKDTYKPG WEAIRKARYR KMSKLGLIDS TKTKLSKRWQ DNLTWANNPD KDWDARAMAV HAAMIDRMDQ GIGRMIKTLR ETGQLDNTLI LFLSDNGASP ENCAAYGPGF DRPNETRDGR KIVYDLKKQV LPGAQTSYAS IGQRWANVAN TPYAFWKAES YEGGIRTPLV AFWPKGITAQ KGSYSTQVGH VMDFMKTFLD LTGAAYPATF KGHTITPTTG VSLLPSFSGK ASIGHETLFN EHFGARYARS GNWKLVSSSR DSTWSLFNLA TDKSETQDLA ARYPEKVRQL QGLWQQWASA HQVFPKPGRK N
|
| |