Gene Sros_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2191 
Symbol 
ID8665473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2356051 
End bp2357505 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content74% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003337917 
Protein GI271963721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGT TCATCGCGGA CCCCAGCGAT CGGGGGCTGC GTCGTCTCGC GGCCGGCACG 
TTGCTCGTCG CCTTCCGGGG CACCACCGCC CCCGAATGGG TCCTCCGGGA CCTGGAGAAC
GGTCTTGGCG GCGTCACCCT CTTCGGCTTC AACGTCGCCG ATCCCGCCCA GCTCTCCGGC
CTCACCGCCC GCCTGCGCGA GGCCGGTGAA CCGGTCATCT CTCTGGACGA GGAGGGCGGC
GACGTCACCC GGCTCGACTA CCACGTGGGC AGCCCCTACC CGGGCAACGC CGCCCTCGGC
GCCGTCGACG ACGTCGAGCT GACCCGGCGC GTCTACCGGT CCATCGGTGA CGACCTGGCC
CGCTGTGGCG TCAACCTGGA CATGGCGCCG TCCGCCGACG TGAACACCGC CGACGACAAC
CCGGTGATCG GCACCCGATC CTTCGGGCCG GACACCGCGC TGGTCGCCCG GCACACCGTC
GCCGCCGTCC ACGGCCTGCA GGCGGCGGGC GTGGCCGCCT GCGTCAAGCA CTTCCCCGGT
CACGGCGCCA CCCGCCAGGA CTCCCACCTG GAGATCCCGC TCGTCGACGC CTCGCTCGAA
CTGCTGCGGG AACGCGAGCT CGTGCCCTTC CGCGCCGCGA TCGACGCCGG GACCAGGTCC
ATCATGACCG CGCATGTCCG GGTCCCCGCC CTGACCGGCA CGACCCCCGC CACGCTGTCG
GCCGCCGCCC TGACCGTCCT GCTCCGCGGC GAGCTGGGCT ACGACGGCGT GATCGTCTCC
GACGCCCTCG ACATGAAGGC CGTCACCGAC ACCTACGGCC TGGCCGGCGG CTCGGTGCTC
TCGCTCGCCG CCGGGACCGA CCTGCTCTGC CTCGGCCCAC TGCCGACCGA GGACGACGTC
CGCCGGATCA TCACCGAGAT CGTCGCCGCG GTCGGGGACG GCCGGCTGCC GCTGGCCCGC
CTGGAGGCCG CCGCCGAGCG GGTGGCCCGC CTGCGCGCCT GGTTCGGCAC GTCCCAGACG
GGCCAGGCGG AGAAGAACGT CATCGGCCTG GCCGCCGCCC GCCGCGCGGT CAGCCTCACC
GGCTCGGCGT CCCCGCTGGT CGACCCGCTG GTGGTCGAGG TGGACACCCC GCCGACCATC
GCGGTCGGCG ACGTGCCGTG GGGCTTCGCC CCCCTGCTGC CGCAGGCGGA GGTGGTCCGG
GTCAAGCCCG AGACGGCGGA CGTCCCCGGC ATCCTGGAGC GTGCCACCGG CCGTTCCCTG
GTCGTCGTGG TCAAGGACGC CCACCGCTAC GAGGCCAGCA AGTCGGTGGT GTCGGCGCTG
CTGGCCGCCC GCCCCGACGC CACGGTGGTC GAGATGGGCC TGCCGATCTG GCGCCCCGAG
GGCGTCACCT ACCTCGCCAC CTACGGCGCC GCCCGCGCCA ACGCCCAGGC CGCCGCCGAA
CTGCTCGGCG TCTGA
 
Protein sequence
MSEFIADPSD RGLRRLAAGT LLVAFRGTTA PEWVLRDLEN GLGGVTLFGF NVADPAQLSG 
LTARLREAGE PVISLDEEGG DVTRLDYHVG SPYPGNAALG AVDDVELTRR VYRSIGDDLA
RCGVNLDMAP SADVNTADDN PVIGTRSFGP DTALVARHTV AAVHGLQAAG VAACVKHFPG
HGATRQDSHL EIPLVDASLE LLRERELVPF RAAIDAGTRS IMTAHVRVPA LTGTTPATLS
AAALTVLLRG ELGYDGVIVS DALDMKAVTD TYGLAGGSVL SLAAGTDLLC LGPLPTEDDV
RRIITEIVAA VGDGRLPLAR LEAAAERVAR LRAWFGTSQT GQAEKNVIGL AAARRAVSLT
GSASPLVDPL VVEVDTPPTI AVGDVPWGFA PLLPQAEVVR VKPETADVPG ILERATGRSL
VVVVKDAHRY EASKSVVSAL LAARPDATVV EMGLPIWRPE GVTYLATYGA ARANAQAAAE
LLGV