Gene Sros_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4989 
Symbol 
ID8668283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5507210 
End bp5508814 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content70% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003340532 
Protein GI271966336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.171054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCCGA TCCCGGCGCC TTCCCGCTGC ACACTCGGAG AAGGGCGGTT CACCTTCACC 
GCCGCGACGC CGTTGACCGC CGACCCGGTG CTGGCCGGCG CCGCGACGTG GCTCCGCCAG
GCCCTCACCC CGGCTACGGG TCTACCGCTG CTCGAAGGCC CCGGCGGGGT CGAGATCCGC
CACGCCGTGG GCCTCGGGCC CGAGGAATAT CGGCTCACCG TCACGACGGA GTCCGTGCTG
ATCGAGGCCT CGGCCCAGGC CGGTGCCTTC TACGGGGCGC AGACTCTTCG CCAGCTGCTC
GACCCGGCCG CCTTCCGCAC CGGCCGTACC GGAGACAGAA CCTGGAGCAT CCCCGCGATC
GAGATCGCCG ACGCCCCCAA GTACGGCTGG CGCGGCTGCC TCATCGACGT CGCGCGGCAC
TTCCTGCCCA AGAACGACCT GCTGAGATAC ATCGACCTGC TGGCCGCGCA CAAGCTCAAC
GTCCTCCATC TGCACCTGAC GGACGATCAG GGCTGGCGGT TGGAGATCAG GAAGTACCCG
AAGCTCACGG AGATCGGCGC CTGGCGGCGC GAGTCCCCGC TGGGAGCCAA GCAGCATCGC
CTGTTCGACG GGCGCCCGCA CGGCGGCTTC TACAGCCAGG ACGACATCAG GGAGATCGTC
TCCTACGCCG CCGACCGCTC CGTCACGATC GTCCCGGAGA TCGACCTTCC CGGCCACACT
CAGGCTGCCA TCGCCGCCTA CCCCGAGCTC GGCAACCTCG ACGTCCCCCT GGAGGTGCGC
ACCGAGTGGG GCGTCGGCGA GAACGTGCTC AACGTCTCCG ACGACACGAT CGCCTTCTTC
ACCGACGTCC TCGACGAGGT CCTCGAACTC TTCCCCGGCG AGTACGTCTG CGTCGGCGGC
GACGAGACCC CCAAGACACA GTGGAACGAG AGCGTTCCCG CCAAGGAGCG CATCCGTGAC
CTCGGCCTGC GCGACGCCGA TGAGCTGCAG AGCTGGCTCA TGCGGCACTT CACCGACTAC
CTGCTGGCGC GCGGGCGCAA GCCGCTCGGC TGGGACGAAC TCCTGGAGGG TGGCCTGCCG
CTGGGCGTCA CCGTCGCCGC CTGGCGCGGC GACAGGTGCG CGGCGATGGC CGCGCGAGCC
GGCCACGACG TTGTCGTCTC CCCGTTCGCC GAGACGTACC TGGATTTCCG CCAGGCGGAG
GGCGATCAGG AGCCGGTGCC GATCGGCAGC GTGACCTCCC TGCGCGCTGT CCACGCCTTC
GATCCGGTTT CCCCTGGCCT CACCGGGGAG GAGCGGAGCA GGATCCTCGG CGCGCAAGCG
GCGCTGTGGA CCGAGCACAT CGACTCGCCC CGGCTCCTCG ACTACATGGC CTTCCCGCGA
CTGGCCGCCT TCGCCGAGGC GATGTGGAGC GACGAGCGCG ACTTCGAGGA CTTTCTCGTA
CGGCTCGCCG TACACGAAAA GCGGCTCGAC GCCCTGGGTG TGGAATATCG TCCGGCCGCC
GGTCCGCACC CCTGGCAGCA ACGCCCTGAT GCTCCCGGCC ATCCCCGGAC CAGGGCCGAG
ATCGACCGCG TGCTCGCCGG CTGGACCTCC AGCCTGCGGC CCTGA
 
Protein sequence
MIPIPAPSRC TLGEGRFTFT AATPLTADPV LAGAATWLRQ ALTPATGLPL LEGPGGVEIR 
HAVGLGPEEY RLTVTTESVL IEASAQAGAF YGAQTLRQLL DPAAFRTGRT GDRTWSIPAI
EIADAPKYGW RGCLIDVARH FLPKNDLLRY IDLLAAHKLN VLHLHLTDDQ GWRLEIRKYP
KLTEIGAWRR ESPLGAKQHR LFDGRPHGGF YSQDDIREIV SYAADRSVTI VPEIDLPGHT
QAAIAAYPEL GNLDVPLEVR TEWGVGENVL NVSDDTIAFF TDVLDEVLEL FPGEYVCVGG
DETPKTQWNE SVPAKERIRD LGLRDADELQ SWLMRHFTDY LLARGRKPLG WDELLEGGLP
LGVTVAAWRG DRCAAMAARA GHDVVVSPFA ETYLDFRQAE GDQEPVPIGS VTSLRAVHAF
DPVSPGLTGE ERSRILGAQA ALWTEHIDSP RLLDYMAFPR LAAFAEAMWS DERDFEDFLV
RLAVHEKRLD ALGVEYRPAA GPHPWQQRPD APGHPRTRAE IDRVLAGWTS SLRP