Gene Sros_3060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3060 
Symbol 
ID8666347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3336966 
End bp3339428 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content71% 
IMG OID 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003338753 
Protein GI271964557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.481908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATG CCCCCCACCG CTGGACATCA CTGTCGAAAC ACCGCTCGAT CGCGGTTGCC 
CTGTCCGCCG CACTCGCCAT GACGTTGCTG GCCCCACCGG GAGCCGGCAG CGCACAGGTG
ATCACGCCGG ATGCCCCGGC GGCGGCGGGC GCCGCCGCGC CGGCCCCGAC CAACCTGGCC
CTCGCGGGCA CGGCCACCGC CTCCAGCGTC GAACTCGACC GCGACACCTT CGCCGCGGCC
CACGTCAAGG ACGGCGACAC CGGCACCCGG TGGTCGTCGA AATACCAGGA CGACAACTGG
GTGCAGATCG AGCTGGCCGG CCCGTCCAAG GTCGACCACG TCACGCTCAC CTGGCCCAAC
GCCTGCGCCC GCGACTTCGT CCTGCAGACC TCGGCTGACG GCGCCACCTG GACCGACGTC
GCCGCCCTCC AGCGGGACAC CTGCCCCAGG ACCGACGTCG TCGAGGTGAA GTCCGAGGAG
CCGGTCTCCT TCGTCCGGAT GCAGGGCCGC AAGCGCTGGG CGGCCTACGG ATACTCCATC
TCCGAGTTCG AGATCTGGGA TCGCCCGCCC GCGCCGCCGG AGCCGACGCT CGGCCTGGTG
CCGGAGCCGG TGTCGGTCGA GGAACACGAC GCCACGCCGT TCACCCTCCA CCGGAACACG
CGGATCGTCG CGGTCGGCGA CGGCGCCCAG GCGCCGGCGC GATACCTGGC CGGCGTCCTC
CGCCCGTCGA CCGGGCACCG CCTGCCCGTC GTCGGCCGGG GATGGGGACC GGCGCCGATC
GTCATCGAGG TCGGGCCGGG GAAGGGCCCC GACGGTCATG AGGCCGAGGG CTATACCCTC
ACCGTCGCCG CCAGTCGTAT CAGCATCGGC GCCGACACGC CCAACGGCGC GCTCAACGGT
GTGCAGACGC TGCGCCAGCT GTTCCCGCAG TGGGTGGAGT CCGGCACCGC CACCGACGCC
GAGTGGACGG TGCCGGCGGT GACGATCACC GACTACCCGC GCTTCGCGCA CCGGGGGATG
ATGCTCGACG TCGCCCGCAG CTTCTATCCG GTGAACGAGG TCAAGGCCTA CATCGACGCA
GCGGCGCAGT TCAAGGTCAA CCGGCTCCAC CTGCACCTGA CCGACGACCA GGGGTGGCGC
ATCGCGATCG ACGAGCCGCG GGACAACCCG TCCGGCATCG ACTACGGCCT GCTCACCGAG
GTGAGCGGCG CCACCGCCAT GACCTATAAC GGCAACGGTC AGCTGATGGG CACCGAGCTC
GGCGTCACCG GCCACTACAC GAAGGCCGAC TACGCCGAGA TAGTCCGCCA CGCCGGCGAG
AACGGCATGA CGGTGATCCC CGAGATCGAC ATGCCCGGCC ACACCAACGC CGCGCTGCAC
GCGATCCCCC AGCTCAACAC CCCGGGCGCC CAGCCGCGGC CGAAGCCGGG AGAGACGACG
GTGCCGCACA ACGGCACCGG CTCGGTCGGC TACTCGTCGT TCGACTCGGG CAGCGACGTG
ACGTACGAGT TCGTCGAGCA CGTCCTCACC GAGATCGCCG AGATGACCCC CGGGCCGTAC
CTGCACATCG GCGGTGACGA GGCGCACGTC ACGAGCCACG CCAGCTACAC CACGATGGTC
GACGCGTTCA CCAGGACCGT CACCTCCCTG GGCAAGACCG TCGTCGGCTG GAACGAGTAC
TCCGGGACCG CGCTCCCGCA GGACAAGGCC GTCGTCCAGT TCTGGAACGG GAACCGGGCC
GCCGTGGCCG GCGCGGTGCG CGACCGCGGC GCCAAGGTCA TCCTCTCCCC CGCGGCCCAC
ACCTACGTGC CGCAGAAGCA GGACCCCCGC CAGCCCCAGG GCGGCACCTG GGCGTGCGGC
GGGCCCTGCG GACTGGACCG CCACTACAAC TGGGACCCGG GGACGTTCAT CCCGAACATC
GCCGAGTCCA GCGTGCTGGG CGTCGAGTCG GCGCTGTGGG GAGAGTTCAT CCGGCGCCTC
GGCCAGGCGC AGTACTACAG CTTCCCCCGC ATCATCGCCA CCGCCGAGGT CGGGTGGACC
CCGCAGGCGC AGCGCGACTA CCGCGACTTC AAAACGCGCC TGGCCAAGGT GGGGGGCAGG
CTGACGGTTC AGGGGACGAA CTTCTTCCCC ACCGCCGACG TCGCCTGGCT GACGGACGTG
CTCGGCACCC CGGCCGCCGT CGACAGCGGC GAACCGGCCG GGGCCACCTG GACCGTCACC
GCTCCGGGAG CCGCGCCCGG TGATCTCACG GCGACCATCG CCTGGAGTGA CGGCATGCGG
GAGGACGTCA CGTTGACGAC CCCCCGTCAG GCAAGCATCC CCGACATGCG GATCAACGAC
GCGTTCACCG CCACGTCCGG CCGTACCTTC GACCGGCCGG GAACATACAC GGGAACGCTC
TCGGTCAACG CGCCCGGCCG GTCCCCGGTC GAGGGGCGCC TGACGGTCAC CGTACGCGGC
TGA
 
Protein sequence
MSDAPHRWTS LSKHRSIAVA LSAALAMTLL APPGAGSAQV ITPDAPAAAG AAAPAPTNLA 
LAGTATASSV ELDRDTFAAA HVKDGDTGTR WSSKYQDDNW VQIELAGPSK VDHVTLTWPN
ACARDFVLQT SADGATWTDV AALQRDTCPR TDVVEVKSEE PVSFVRMQGR KRWAAYGYSI
SEFEIWDRPP APPEPTLGLV PEPVSVEEHD ATPFTLHRNT RIVAVGDGAQ APARYLAGVL
RPSTGHRLPV VGRGWGPAPI VIEVGPGKGP DGHEAEGYTL TVAASRISIG ADTPNGALNG
VQTLRQLFPQ WVESGTATDA EWTVPAVTIT DYPRFAHRGM MLDVARSFYP VNEVKAYIDA
AAQFKVNRLH LHLTDDQGWR IAIDEPRDNP SGIDYGLLTE VSGATAMTYN GNGQLMGTEL
GVTGHYTKAD YAEIVRHAGE NGMTVIPEID MPGHTNAALH AIPQLNTPGA QPRPKPGETT
VPHNGTGSVG YSSFDSGSDV TYEFVEHVLT EIAEMTPGPY LHIGGDEAHV TSHASYTTMV
DAFTRTVTSL GKTVVGWNEY SGTALPQDKA VVQFWNGNRA AVAGAVRDRG AKVILSPAAH
TYVPQKQDPR QPQGGTWACG GPCGLDRHYN WDPGTFIPNI AESSVLGVES ALWGEFIRRL
GQAQYYSFPR IIATAEVGWT PQAQRDYRDF KTRLAKVGGR LTVQGTNFFP TADVAWLTDV
LGTPAAVDSG EPAGATWTVT APGAAPGDLT ATIAWSDGMR EDVTLTTPRQ ASIPDMRIND
AFTATSGRTF DRPGTYTGTL SVNAPGRSPV EGRLTVTVRG