Gene Sros_9002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9002 
Symbol 
ID8672344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9950646 
End bp9952283 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content74% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003344376 
Protein GI271970180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.5335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC TTCTGCCGCG TCCCGCTGTC TTTGCGCCTG CCGACGGCTC GTTCCCGCTC 
TCTCCCGGCA CCCCCGTCTC CGGTCCCGCC GAGCTGGTGG ACGGGGTACG GCTCGCGCTC
GCCGTGCTTG ATCCGCGGCC GGCGGAGGCG GGAACGATCG TGGTGGAGCG GGACGCGGCG
CTGGACGCGG AGGCCTACAC GCTGGAGGTG ACGGCCACCG GCGTGCGGAT CACGGCCGGA
GACCGCGCGG GGGCCTTCTA CGCCGGCCAG ACCCTGCGGC AGCTGCTCCC CCATGGGGCG
TTCCGGACGG TCGCCCAGGC GGGCTGGACC GTTCCGTGCG GCCGGGTGGA GGACGCGCCC
CGGTTCTCCT GGCGCGGCGT CCACCTGGAC GTGGCGCGGC ACTTCCTGCC CAAGCGCGAG
GTGCTGCGGA TGGTCGACCT CATGGCGGCG CACAAGCTGA ACCGGCTCCA TCTCCACCTG
GTGGACGACC AGGGATGGCG GGTCGAGAGC CGGGTCGCCC CCAGGCTGCA CGAGGTGGCC
TCGCACCGGC CGCGCACGAT CACCAGCCAC CACAAGGACG ACCCGGTCTA CGACGAGGTG
CCGCACGGCG GCTACTACAC GCTGGACGAC CTCGCGGAGA TCGCCGCCTA CGCGCGGGCC
AGGGCCGTGA CCGTCGTGCC CGAGATCGAC GTGCCGGGAC ACGCCTCGGC GATCCTCGCG
GCCTACCCCT CGCTCGACGC GCGGGCCACC GGCGGCCGGG AGCCCGAGCC CTTCCCCGTG
CTGGACCGGT GGGGCATCTC CCCCGCGATC CTGTCCCCGC TCCCCCCGAC GGTCGACTTC
CTGACCTCGG TGATCGACGA GATCCGCGGC GCGCTGGGCG AGACGCCTTA TGTGCACCTC
GGCGGGGACG AGTGCGTGCT CGACGACTGG GCTGCTTCAG CGGAGATCGT CGCGTTCCAG
GAGGAGCTCG GCCTGGAGAG CCTGAGCGGC CTGCACGCCT GGTTCCTGCG CCGGCTGGCG
GACCTGCTGG CCGAGCGGGG CAGCCGGGCG ATCGTCTGGG ACGAGGCGTT CGTCAGCGGC
ATGCTGCGGC CCGACACGAT CGTGATGCCG TGGCGCGGGC CGGGCGTGGC CCGGCGCGCC
GCCGAGGCCG GGCACGACGT GGTGCAGACA CCGGTCTTCC CGCTGTACTT CGACTACGCC
GAGACCTCCT CGGAGGAGGA GCCGCTCGCC ATCGGCGACG CGATCACCGT GTCGGACGTC
GCGACCTTCG CTCCCGCACC GGAGTCGTGG ACGGCCGAGC AGCGGGAGCA CGTGCTCGGG
GCGCAGTTCC AGCTGTGGAG CGAGCGCCTG CCCGACGGCC GGGCGGTGGA CTACCGCGCC
TGGCCGCGAG GCTGCGCGCT GGCCGAGGTC GTCTGGTCCG GCTCGGCCGG GCCCGGATTC
GGGGAGCGGC TCGAAGGGCA CCTGGGCCGC CTGGACGCGC TGGGGGTCGA GTACCGCCCG
CCGGCAGGCC CGCGCCCCTG GCAGCTCGGC GGCACCGGGC GCCGCCGCCA CCGCCCGGGG
GTCGTCAAGG TGGACGAGGT CATGGGGCAC CTGGAGGAGA TGACCCACCT CGCCGACTCC
ACCCGGCCGA GCATGTGA
 
Protein sequence
MTDLLPRPAV FAPADGSFPL SPGTPVSGPA ELVDGVRLAL AVLDPRPAEA GTIVVERDAA 
LDAEAYTLEV TATGVRITAG DRAGAFYAGQ TLRQLLPHGA FRTVAQAGWT VPCGRVEDAP
RFSWRGVHLD VARHFLPKRE VLRMVDLMAA HKLNRLHLHL VDDQGWRVES RVAPRLHEVA
SHRPRTITSH HKDDPVYDEV PHGGYYTLDD LAEIAAYARA RAVTVVPEID VPGHASAILA
AYPSLDARAT GGREPEPFPV LDRWGISPAI LSPLPPTVDF LTSVIDEIRG ALGETPYVHL
GGDECVLDDW AASAEIVAFQ EELGLESLSG LHAWFLRRLA DLLAERGSRA IVWDEAFVSG
MLRPDTIVMP WRGPGVARRA AEAGHDVVQT PVFPLYFDYA ETSSEEEPLA IGDAITVSDV
ATFAPAPESW TAEQREHVLG AQFQLWSERL PDGRAVDYRA WPRGCALAEV VWSGSAGPGF
GERLEGHLGR LDALGVEYRP PAGPRPWQLG GTGRRRHRPG VVKVDEVMGH LEEMTHLADS
TRPSM