Gene Sros_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3784 
Symbol 
ID8667074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4207809 
End bp4209341 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content73% 
IMG OID 
ProductGlycoprotein endo-alpha-1,2-mannosidase 
Protein accessionYP_003339448 
Protein GI271965252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00391126 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.836971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACT GGCGAAGCGC GGCGGCCGTC GCCGCCCTGC TCCTGACTGC CGGCCTGACA 
GCCGGGGGAG CGCCGCCGGC CCTGGCGGCC GGAACGGCCG TCGCGGCGGA GCGCAGCGCG
ACCGACGTCC ACCTGTTCTA CTACCCCTGG TACGGCAGCC CGGCGGTCTC CGGCGGCTAC
CGGCACTGGC AGCAGGGCGG CCACACCCCG CCCGGCGACG TGGGCGCGGA CTTCTACCCC
AAGCTCGGCG CCTACGACTC CGGCGACTTC GACGGCGCGG TGGCACAGCA CATGAGCTGG
ATCCGGCGCT CGGGCGCCGG GGTCATCGTC TTCAGCTGGT GGGGGGAGGA CTCCTACGAG
GACCGGCTCG CCGCCGGGGT GCTGGAGGCG GCGGCGCGAT CCGGGGTCAA GGTCGCCTGG
CACCTGGAGC CCTACGCCGG CCGGACGGCG GCCTCGACGG TCGCCGACAT CGCCTACATC
AACTCCCGCT ACGGGAGCAG CCCGGCCTTC TACCGCGACG CCGGGCACGG CGGCCGCGGC
GCGTTCTACG TCTTCGAGAG CCTGCGGATC GCCGACTGGT CGGCCCTGGA CCAGGTGCGC
GCGCACAGCA TCGTGCTGGC GCAGACCACC GACACCACCA AGGTCGCGCA CTTCGGCGGG
ATGTACACCT ACGACGCGAT CGCCGGCGCG ACGGCCCCCG GCTGGAAGCA GGCGAGCGAC
TTCTGCAGGG CGAACGGTCT CGTCTGGGCG CCCTCGGTGG GCCCCGGCTA CGTCGACGAC
CGGGCGGTCC CCGGCAACAC CACCCCGACA CTGGGCCGCG ACAACGGCGC CACCTACGAC
CTGGAGTGGC GCAACGCGCT GGCCCCGGCC ACGGGCGGGT CGCCGACCTG GGTCTCCGTC
ACCTCCTTCA ACGAGTGGCA CGAGGGCTCG ATCCTCGAAC CGGCGAGCTC CACCCCGCCC
GCGGGATCCG GCTACCAGAC CTTCGCCGGC GCCTACGGCA AGACGGGCAC CGACGCCGAG
ACCGCCTACC TCGACCGGAC GAGGCACTGG GTCACCCAGT TCACCGGGGA GGTCGCGCCG
CCCGATCCCG ACCTGGCGGC CGGCAAGGCG ATCACGGCGA GCAGCCACAC CGGCGGCTAC
GGCGCGGCGG CGGCCAACGA CGGGAACACC GGCACCTACT GGGAGAGCCT CAACCACACG
TTCCCGCAGT CCATCACCGT CGATCTGGGC GCGGCGAGCA GCGTCGGCAG GATCGTCCTC
AAACTGCCGC CGTCGCCCGC CTGGGGTGCC AGGACCCAGA CCCTCTCCGT CCTCGGAAGC
CTGAACGGCT CGGCGTACTC GACGATCTCC CCGTCCGCGG GCCGTACCTT CGACCCGGCC
ACCGGCAACA CCGTCACCAT CACCTTCCCC GCCACCACCC AGCGCCACAT CCGGCTGGCC
ATCACCGCGA ACACCGGCTG GCCGGCCGGG CAGCTCGCGG AGCTGCAGGT CTTCCGGAGC
CGGCACCCCA CCGGCATCGG AGATCTCGGC TGA
 
Protein sequence
MRNWRSAAAV AALLLTAGLT AGGAPPALAA GTAVAAERSA TDVHLFYYPW YGSPAVSGGY 
RHWQQGGHTP PGDVGADFYP KLGAYDSGDF DGAVAQHMSW IRRSGAGVIV FSWWGEDSYE
DRLAAGVLEA AARSGVKVAW HLEPYAGRTA ASTVADIAYI NSRYGSSPAF YRDAGHGGRG
AFYVFESLRI ADWSALDQVR AHSIVLAQTT DTTKVAHFGG MYTYDAIAGA TAPGWKQASD
FCRANGLVWA PSVGPGYVDD RAVPGNTTPT LGRDNGATYD LEWRNALAPA TGGSPTWVSV
TSFNEWHEGS ILEPASSTPP AGSGYQTFAG AYGKTGTDAE TAYLDRTRHW VTQFTGEVAP
PDPDLAAGKA ITASSHTGGY GAAAANDGNT GTYWESLNHT FPQSITVDLG AASSVGRIVL
KLPPSPAWGA RTQTLSVLGS LNGSAYSTIS PSAGRTFDPA TGNTVTITFP ATTQRHIRLA
ITANTGWPAG QLAELQVFRS RHPTGIGDLG