Gene Sros_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1431 
Symbol 
ID8664706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1493938 
End bp1495221 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content73% 
IMG OID 
Productbeta-hexosamidase A precursor 
Protein accessionYP_003337168 
Protein GI271962972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.982661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.138036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCCG GATGTGCCGG CACGGGAGCG GGGGCGACCC CGCCCGCCCG GCAGGCCGCG 
GGGCAGGTCA ACGGCACGGC CACGACCGCC CCCTCGCCGT CCGCGACGTC CGCGGCGTCC
GCGACCCCGG GGGCCTCCAA GGTGGAGGCC GTGCTCGCGC GGATGAGCGT GGAGGACAAG
GTCGGGCAGC TCTTCATGCC GGTGCTGTAC GGCTCGGCGG CGGACACGGT GTCGGGGGAG
AACCAGGCGC GGTTCGGGGT CGGCACCCCG GCCAAGGCGG TCGCCAGATA CCGGCCGGGC
GGGGTGATCC TGTTCCCCTG GGCGGGCAAC GTCAAGAACG TCCGGCAGGT CGTGGCGCTG
ACCAACGGGC TGCAGAAGGC GTCGCCGGAG ATCCCGCTGC TGGTCGGCGC CGACCAGGAG
AACGGCAGGG TCTCCCGGAT GGCCCCGCTG GTCACCGAGA TGCCCGGCGC CTCGGTCATC
GGCTCGACCG GCGATCCCTC GCTGGCCCGC AAGGCGGCCA AGGTCACGGG CACCGAGCTG
CGCGCCCTCG GCATCAACCT CGACTTCGCC CCGGTCGCCG ACGTGAACAT CAACCCGCGC
AACCCGGTGA TCGGCCCCCG GGCCTACGGT TCGGACCCGA AGAAGGTGGC GCCGATGGTC
GCCGCGGCGG TCCAGGGCTT CCACGACGCC GGCATCGCCA GTACGGCCAA GCACTTCCCC
GGCCACGGCG ACACCAACGT GGACAGCCAC TCCGGGCTGC CGGTGATCCA GCACTCCCTG
TCCCAGTGGA ACAAGCTGGA CGCGCCTCCC TTCGCCGCGG CCATCGGCAA GAACATCGAC
GCGATCATGA GTGCCCACGT GGTCATGCCC AAGCTCGACC CGTCCGGTGA CCCCGCCACG
CTCTCCAAGC CCATCCTGAC CGGGCTGCTC CGCGAGAAGC TCGGCTTCGA CGGGGTCGTC
TCGACGGACG CGCTGGACAT GGCGGGGGTG CGCAAGAAGT ACGGGGACGG GCAGGTGGCC
GTGCGGGCCA TCCAGGCCGG GGTGGACCTG CTGCTGATGC CGCCGGACTT CCCCAAGGCC
TACGGGGCGG TGCTGGCCGC GGTGAAGTCC GGGAAGATCT CCACCGCGCG GCTCGACCAG
TCCGTCCGGC GGCTGCTGAA GCTGAAGGCC GCGCGGGGCC TGCTGGACCG GGCGCCGGTC
GCCGACCCGG CCGAGGCCGA GCGGGTGCTG CGCTCGGCCG AGCACCGCAA GGTCGCCCAG
CTCATCAACG CGCGGGCCCG CTGA
 
Protein sequence
MVAGCAGTGA GATPPARQAA GQVNGTATTA PSPSATSAAS ATPGASKVEA VLARMSVEDK 
VGQLFMPVLY GSAADTVSGE NQARFGVGTP AKAVARYRPG GVILFPWAGN VKNVRQVVAL
TNGLQKASPE IPLLVGADQE NGRVSRMAPL VTEMPGASVI GSTGDPSLAR KAAKVTGTEL
RALGINLDFA PVADVNINPR NPVIGPRAYG SDPKKVAPMV AAAVQGFHDA GIASTAKHFP
GHGDTNVDSH SGLPVIQHSL SQWNKLDAPP FAAAIGKNID AIMSAHVVMP KLDPSGDPAT
LSKPILTGLL REKLGFDGVV STDALDMAGV RKKYGDGQVA VRAIQAGVDL LLMPPDFPKA
YGAVLAAVKS GKISTARLDQ SVRRLLKLKA ARGLLDRAPV ADPAEAERVL RSAEHRKVAQ
LINARAR