Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4036 |
Symbol | |
ID | 8885237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4304941 |
End bp | 4306320 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003512781 |
Protein GI | 291301503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0188862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.130927 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGCC GACCGAACGT CATCGTGATC CTGTCCGACG ACCACGGCTA CGCCGACCGC TCCGCGCTCG GCGTCCACGA CAACGTCCAC ACCCCGGCCC TCGACCGGCT GGCCGCCGAG GGCGTGTCCT GCGACAACGC CTACGTCGCC GCGCCGATCT GCAGCCCGTC GCGGGCCGGG CTCATGTCGG GGCGCTACCC GTTGTCGTTC GGCACCACGT GGTTCGACAA CTCGCGGCTC CCCGACGACT CACCGACGCT CGCCGAGCGC TTCAAGGAGC GCGGCTACAC CACCGGCTAC TTCGGCAAAG TCCACTACGG ACCCGAGCAG CTCGGTGACC ACGCCTGCCC ACCGCACCAC GGCTTCGACG AGACCCGGTA CGGCCTGGCC GGACAGTCCC AGGGGCGGCT GCACTACCTG CGGCACTCGC GCGCCGAATA CGAGGCGCGC GGCGAGGCGG GCTGGCGGAT GGGCACCCAG CCGCTGCTGG AGGGCGACGA CGAGTACGAG ACCGAGGACT TCCTGACCTG GGATCTCGGT CAGCGGGCCC GCGACTTCGT CACAGGCCAC GCCGGGGACG CGAACCCGTT CTTCCTGATG CTGGCCTTCA ACGCCGTCCA CAACTTCTGC TGGCAGCTGC CCGAGGCCGA ACGGCGTAAA CGCGGCCTGT CCGAGTACCA CGACTGGGAC CCCGAGACCC GAAGCTACTT CGACTGGTAC GACGACGTCG TGGCGCCCAA TCTGGACAAA GGCCGGGAGT ACTATCTGGC GCAACTGGAA CTGATGGACG CCGAGATCGG TCGTCTGATG GACACCGTGG ACGCCAACGG TCTGCGCGAG GACACCATCG TGGTCTACCT GACCGACAAC GGCGGCTCGC ACTGCAACCA CGGCGACAAC ACGCCGCTGG CCGGTTCCAA GTACACGCTG TTCGAGGGCG GGATCCGGGT GCCGTTCCTG GTGCGCTGGC CCGGCGGCGG CGTCCCCGCT GGCGAACACC GCGACGGCTT GATCTCCGCG TTGGACCTGT ACCCGAGCCT GCTGGCCGCC GCCGGGGGAG ATCCCGGCGA CGGCCACGGC GTCGACCAGT GGGCCATGCT GCGCGGCGAG ACCGACGCGG GACATGAAGC GCTGCACTGG GACTGCGGTT TCCAGTACGC GACCAGAAGC GGGGCCTGGA AGCTGCGCTA CGCCGACGGC GAGTCCGACG AGGTGCGGGG GCTGCTTCAG TACGAGCACA CCGACCTGGG CGCCGGACTG TTCCTGTACA ACCTGGATGA TGACCCCGCC GAGACCCGCA ACCTCGCCGA CGCTCACCCC GACAAGCTCG CGGAGCTACA GCGGCTGCGG CACGACTGGC GCGCGACGAT GCTGAGCTGA
|
Protein sequence | MSRRPNVIVI LSDDHGYADR SALGVHDNVH TPALDRLAAE GVSCDNAYVA APICSPSRAG LMSGRYPLSF GTTWFDNSRL PDDSPTLAER FKERGYTTGY FGKVHYGPEQ LGDHACPPHH GFDETRYGLA GQSQGRLHYL RHSRAEYEAR GEAGWRMGTQ PLLEGDDEYE TEDFLTWDLG QRARDFVTGH AGDANPFFLM LAFNAVHNFC WQLPEAERRK RGLSEYHDWD PETRSYFDWY DDVVAPNLDK GREYYLAQLE LMDAEIGRLM DTVDANGLRE DTIVVYLTDN GGSHCNHGDN TPLAGSKYTL FEGGIRVPFL VRWPGGGVPA GEHRDGLISA LDLYPSLLAA AGGDPGDGHG VDQWAMLRGE TDAGHEALHW DCGFQYATRS GAWKLRYADG ESDEVRGLLQ YEHTDLGAGL FLYNLDDDPA ETRNLADAHP DKLAELQRLR HDWRATMLS
|
| |