Gene Sros_7372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7372 
Symbol 
ID8670692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8135572 
End bp8137113 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID 
ProductN-acetylglucosamine-6-sulfatase 
Protein accessionYP_003342801 
Protein GI271968605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.834841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTGATG CTGTCCGCAA GGCCCTGTGC CTGATCTTGC TGTCCCTCCT CTGCGCCTCA 
GGAGTCACTC CCGTCGCGGC GCAGGCCGCC GGCCGGCCCA ACATCGTCCT GATCCTCGTC
GATGACCTGG ACGATGGGGA TCTACAGAAG TTTCCGAATA TCTGGAATCA ACTCGTCCGT
GCCGGCACCA GTTTCGATCG GTTTTTCGTG ACGAACTCCT GGTGTTGCCC GTCGCGTTCG
TCGATCCTGC GTTCCCAATA CGTCCACAGC CACGGCGTTC TGACTAATAC GGCGCCTGAA
GGCGGTTTCA CGAAGTTCCA TGATTCCGAT TTGGAGCGTT CCACTCTCGG TACGTGGATG
AAATCGGCCG GCTATCGCAC CGGACTGATG GGCAAATACC TCAACCACTA TCCCGGCGGC
TCCGCCGAAC CCACGTACGT GCCGCCGGGC TGGGACGAGT GGGACGTACC GGTACGCAAG
CTCTACGAGG AGTACGGCTA CACGCTCAAC GAGAACGGCG TGCTGACGAC CCGCGGCTCC
GCGCCGGAGG ACTACCTGAC GGACGTGCTC AGCCAGAAGG CCGGGGCGTT CGTCTCCCAG
AGCCCCGATC CGTTCTTCCT GTACCTCGCT CCGACGGCGC CGCACAACCC GGCCAACCAC
GCGCCCCGGT ACGCCAACGC CTTCGCCGAC GCCATCGCCC CCCGGACCCC GTCCTTCGAC
CAGGCGGACG TGAGCGCCGA GCCCCGCTGG CTGAGCTCGC GCCCCAGGTT CAGCGCCAGG
ACGATCGAGA AAATCGACGA GCGCTACCGT CGCAGGCTCC GGTCCATGCT CGGCGTGGAC
GACATGGTCG GCGCGCTGAT CAAGACCCTG AGGGATACCG GCAAACTAGA CGATACGTAC
ATATTTTTCG CATCTGATAA CGGGTTCCAT CTGGGGCAGC ACCGGCTCGC CCAGGGCAAG
ACGACCCCGT TCGACGAGTC GATCAAGGTG CCCCTGGTGG TCCGCGGCCC GGGTGTCCGG
CCCGGCGGCG TCAACGGCGA TATGTCGGCC AACGTGGACC TCGCGCCGAC CTTCGCCGAG
CTCGCCGGGG CGCAGCTCCC CGACTTCGCC GAGGGCCGGT CCCTGCTCCC GCTCCTGCGC
GGCCAGACGC CCTCCCCGTG GCGGCAGAAC GTCCTGCTCG AGTTCTACCG CCCGACCAGC
GAGAAATCGG CCCGCCAGAC CCCGGTCCCC GCCTACCAGG GCATGCGGAC CGCCCAGAAC
ACCTTCGTCC GCTACTCGAC CGGCGAATAC CAGCTCTACG ATCTCGTCAG GGATCCGCAC
CAGCTCCACA ACCTCGCCGC CAGGGTCGCG CCCGCCGTGA TCGCCCAGTT CAACCAGCAA
CTCGACGCCC TGGCGGCCTG TTCCGGCGCC ACCTGCCGTT CGGCCGACTC CGTCCGGCCG
CCCCCGTTCG TGGGGCCGCC CTCACCCGTC GGCCCGACCT CGGTCCTCAC TCCGAGCCTG
GTCCTCACCC CGGCGTCGGT CATCGGCGCC CGGAGGCCCT GA
 
Protein sequence
MLDAVRKALC LILLSLLCAS GVTPVAAQAA GRPNIVLILV DDLDDGDLQK FPNIWNQLVR 
AGTSFDRFFV TNSWCCPSRS SILRSQYVHS HGVLTNTAPE GGFTKFHDSD LERSTLGTWM
KSAGYRTGLM GKYLNHYPGG SAEPTYVPPG WDEWDVPVRK LYEEYGYTLN ENGVLTTRGS
APEDYLTDVL SQKAGAFVSQ SPDPFFLYLA PTAPHNPANH APRYANAFAD AIAPRTPSFD
QADVSAEPRW LSSRPRFSAR TIEKIDERYR RRLRSMLGVD DMVGALIKTL RDTGKLDDTY
IFFASDNGFH LGQHRLAQGK TTPFDESIKV PLVVRGPGVR PGGVNGDMSA NVDLAPTFAE
LAGAQLPDFA EGRSLLPLLR GQTPSPWRQN VLLEFYRPTS EKSARQTPVP AYQGMRTAQN
TFVRYSTGEY QLYDLVRDPH QLHNLAARVA PAVIAQFNQQ LDALAACSGA TCRSADSVRP
PPFVGPPSPV GPTSVLTPSL VLTPASVIGA RRP