Gene Sros_7722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7722 
Symbol 
ID8671044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8518732 
End bp8520618 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content77% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003343135 
Protein GI271968939 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCCCC AGGACCGGCC CTCGCACGGA TCCCCCTCCC CGCGACCGGC CCCCCGGCAT 
CCGGCCCCGC CCGGCCAGGG CTCCGGAACT CCCGGGCAGG GCTCCGGACC GGCGGCCGCG
TACGGCCTGA TCCCGCTGCC CTCCTCGGTC CGGCCGGGCT CCGGCGAGTT CACGCTGACC
GCCGGGACCG CTCTCGACGC CCCGCCCGAG CTGGCCGCCG TCGCCGCCTG GCTCCGCTCG
GCACTCGGCC CGGCCACCGG CTGCGCCCTC CTGCCGTCGG GGGCCGGGAT CCCGGGCGTC
GCCGGCGGGG ACGGTGCCGG GGCGGACGCG GACGCGGAGC GTGCGGCCGG GGGAGAGGGG
CGCATCGTGC TGGCGCTCGG CCCCGGTCTG GCGGCCGAGG AGTTCCGGCT CCGGATCACC
CCCTCCGGCG TGCGGATCAC CGGCGGTGAC GCCGCGGGAG TGTTCTACGG GGCGCAGACC
CTCCGCCTGC TGCTCCCGCC CGCCGCGCTG CGCCGGGCGC CCGTCGCGGG CGGCCCGCTG
ACGGTGCCCG CCACCGAGAT CGAGGACCGG CCCCGGTACG GCTGGCGCGG ATGCATGCTG
GACGTCGCCC GGCACTTCAT GCCCAAGGCG GACCTGCTGC GCTTCATCGA CCTGCTCGCC
CTGCACCGGC TGAACGTGCT CCACCTCCAC CTCACCGACG ACCAGGGCTG GCGGGTGGAG
ATCGCGAAGT ATCCCCGGCT GACCTCGGTG GGAGGCTGGC GGTCGTCGAG CATGCTCGGC
TCCCGCCAGC ACGAGACCTT CCTGCCGCGC CCGCACGGCG GCTTCTACAC CGCCGACGAC
GTGCGGGAGA TCGTCGCCTA CGCGGCCGAG CGGTACGTCA CCGTGGTGCC CGAGATCGAC
CTGCCCGGTC ACACCCAGGC CGCGGTCGCC GCCTACCCCT GGCTCGCCCC GGAGGCGGCC
GGCGTCCCGG AGGCGGCCGG CGTCCCGGAG GCGGCCGGCG TCCCGGAGGC GGTTGACCGC
CCGGACACGG CCGAGGCTCC GGGAGTGGCC GACGGCCGGG ACACAGCCGA CGGCCCGGAC
GCGGCCGACG GTGCGGGGCC GGTCGGGGTC CGCGCCGCGT GGGGGCTCTC CCCGCACTGC
CTCGACGTCA CCTCCGCCGC GGCGCTGGAC TTCTGCCGTG ACGTCCTGGA CGAGGTCATG
GAGCTGTTCC CCGGCTCGCA CGTGGGCATC GGCGGCGACG AGTGCCCGGG TGACGCCCGC
TCGCGGGCCG CCTTCGTCGC CGCCCTCGCC GAACACGTCA CCGGACGGGG GCGCGTGCCG
TACGCCTGGG ACGAGATCCT GGAGTGCGGG GCGGTGCCCG GTGTGACGGT GGCGGCCTGG
CGCGGGCCCG ACGCGGCGGC CGTGGCGGCG CGGGCCGGCC ACCAGGTGGT CTCCTGCCCG
GACATGTTTG TCTACTTCGA CTATCGCCAG TCGGAGCTGC CGGACGAGCC CATCCCCGTC
GGCCCGCCCC TCTCCCTCCA GGACGTCCAC GCCTTCGAAC CCGAGCCGGC GGGGCTGACC
GCCGACGAGC GCTCCCGCGT GATCGGAGCC CAGTGCAACG TCTGGACCGA GCACATGGAC
ACCCCGAGGG CCGTCGACTA CATGGTCTTC CCGCGGCTGT GCGCGTTCGC CGAGGTCGTG
TGGTCCGCGG ACCGCGCGCC GTACCCGGAC TTCGCCCGGC GGCTGGAGGC GCACACGGCC
CGGCTCGACG CGCTGGGAGT GGAGTACCGG CCCGCGGCGG GACCGCGTCC CTGGCAGAGC
CGTCCCGACG CGCCCGGCTG GCCGATCAGC AAGGCCCGGC GCGAGGCCAT GGTGGCCGAG
TTCGGCGTCG GCAAGCACCA GGGGTGA
 
Protein sequence
MTPQDRPSHG SPSPRPAPRH PAPPGQGSGT PGQGSGPAAA YGLIPLPSSV RPGSGEFTLT 
AGTALDAPPE LAAVAAWLRS ALGPATGCAL LPSGAGIPGV AGGDGAGADA DAERAAGGEG
RIVLALGPGL AAEEFRLRIT PSGVRITGGD AAGVFYGAQT LRLLLPPAAL RRAPVAGGPL
TVPATEIEDR PRYGWRGCML DVARHFMPKA DLLRFIDLLA LHRLNVLHLH LTDDQGWRVE
IAKYPRLTSV GGWRSSSMLG SRQHETFLPR PHGGFYTADD VREIVAYAAE RYVTVVPEID
LPGHTQAAVA AYPWLAPEAA GVPEAAGVPE AAGVPEAVDR PDTAEAPGVA DGRDTADGPD
AADGAGPVGV RAAWGLSPHC LDVTSAAALD FCRDVLDEVM ELFPGSHVGI GGDECPGDAR
SRAAFVAALA EHVTGRGRVP YAWDEILECG AVPGVTVAAW RGPDAAAVAA RAGHQVVSCP
DMFVYFDYRQ SELPDEPIPV GPPLSLQDVH AFEPEPAGLT ADERSRVIGA QCNVWTEHMD
TPRAVDYMVF PRLCAFAEVV WSADRAPYPD FARRLEAHTA RLDALGVEYR PAAGPRPWQS
RPDAPGWPIS KARREAMVAE FGVGKHQG