Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_7372 |
Symbol | |
ID | 8670692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 8135572 |
End bp | 8137113 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | N-acetylglucosamine-6-sulfatase |
Protein accession | YP_003342801 |
Protein GI | 271968605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.834841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGATG CTGTCCGCAA GGCCCTGTGC CTGATCTTGC TGTCCCTCCT CTGCGCCTCA GGAGTCACTC CCGTCGCGGC GCAGGCCGCC GGCCGGCCCA ACATCGTCCT GATCCTCGTC GATGACCTGG ACGATGGGGA TCTACAGAAG TTTCCGAATA TCTGGAATCA ACTCGTCCGT GCCGGCACCA GTTTCGATCG GTTTTTCGTG ACGAACTCCT GGTGTTGCCC GTCGCGTTCG TCGATCCTGC GTTCCCAATA CGTCCACAGC CACGGCGTTC TGACTAATAC GGCGCCTGAA GGCGGTTTCA CGAAGTTCCA TGATTCCGAT TTGGAGCGTT CCACTCTCGG TACGTGGATG AAATCGGCCG GCTATCGCAC CGGACTGATG GGCAAATACC TCAACCACTA TCCCGGCGGC TCCGCCGAAC CCACGTACGT GCCGCCGGGC TGGGACGAGT GGGACGTACC GGTACGCAAG CTCTACGAGG AGTACGGCTA CACGCTCAAC GAGAACGGCG TGCTGACGAC CCGCGGCTCC GCGCCGGAGG ACTACCTGAC GGACGTGCTC AGCCAGAAGG CCGGGGCGTT CGTCTCCCAG AGCCCCGATC CGTTCTTCCT GTACCTCGCT CCGACGGCGC CGCACAACCC GGCCAACCAC GCGCCCCGGT ACGCCAACGC CTTCGCCGAC GCCATCGCCC CCCGGACCCC GTCCTTCGAC CAGGCGGACG TGAGCGCCGA GCCCCGCTGG CTGAGCTCGC GCCCCAGGTT CAGCGCCAGG ACGATCGAGA AAATCGACGA GCGCTACCGT CGCAGGCTCC GGTCCATGCT CGGCGTGGAC GACATGGTCG GCGCGCTGAT CAAGACCCTG AGGGATACCG GCAAACTAGA CGATACGTAC ATATTTTTCG CATCTGATAA CGGGTTCCAT CTGGGGCAGC ACCGGCTCGC CCAGGGCAAG ACGACCCCGT TCGACGAGTC GATCAAGGTG CCCCTGGTGG TCCGCGGCCC GGGTGTCCGG CCCGGCGGCG TCAACGGCGA TATGTCGGCC AACGTGGACC TCGCGCCGAC CTTCGCCGAG CTCGCCGGGG CGCAGCTCCC CGACTTCGCC GAGGGCCGGT CCCTGCTCCC GCTCCTGCGC GGCCAGACGC CCTCCCCGTG GCGGCAGAAC GTCCTGCTCG AGTTCTACCG CCCGACCAGC GAGAAATCGG CCCGCCAGAC CCCGGTCCCC GCCTACCAGG GCATGCGGAC CGCCCAGAAC ACCTTCGTCC GCTACTCGAC CGGCGAATAC CAGCTCTACG ATCTCGTCAG GGATCCGCAC CAGCTCCACA ACCTCGCCGC CAGGGTCGCG CCCGCCGTGA TCGCCCAGTT CAACCAGCAA CTCGACGCCC TGGCGGCCTG TTCCGGCGCC ACCTGCCGTT CGGCCGACTC CGTCCGGCCG CCCCCGTTCG TGGGGCCGCC CTCACCCGTC GGCCCGACCT CGGTCCTCAC TCCGAGCCTG GTCCTCACCC CGGCGTCGGT CATCGGCGCC CGGAGGCCCT GA
|
Protein sequence | MLDAVRKALC LILLSLLCAS GVTPVAAQAA GRPNIVLILV DDLDDGDLQK FPNIWNQLVR AGTSFDRFFV TNSWCCPSRS SILRSQYVHS HGVLTNTAPE GGFTKFHDSD LERSTLGTWM KSAGYRTGLM GKYLNHYPGG SAEPTYVPPG WDEWDVPVRK LYEEYGYTLN ENGVLTTRGS APEDYLTDVL SQKAGAFVSQ SPDPFFLYLA PTAPHNPANH APRYANAFAD AIAPRTPSFD QADVSAEPRW LSSRPRFSAR TIEKIDERYR RRLRSMLGVD DMVGALIKTL RDTGKLDDTY IFFASDNGFH LGQHRLAQGK TTPFDESIKV PLVVRGPGVR PGGVNGDMSA NVDLAPTFAE LAGAQLPDFA EGRSLLPLLR GQTPSPWRQN VLLEFYRPTS EKSARQTPVP AYQGMRTAQN TFVRYSTGEY QLYDLVRDPH QLHNLAARVA PAVIAQFNQQ LDALAACSGA TCRSADSVRP PPFVGPPSPV GPTSVLTPSL VLTPASVIGA RRP
|
| |