Gene Sros_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3988 
Symbol 
ID8667282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4444242 
End bp4445321 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycosyl hydrolase BNR repeat-containing glycosyl hydrolase 
Protein accessionYP_003339639 
Protein GI271965443 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.757519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CACTCCTCGC CGTGGGCACC CGCAAAGGCC TGTTCCTCGC CCGCTCCACC 
GGGGACGGCC CGTTCGAGGT GGAACCGGTC CGCTTCTCCA CCATCGGCGT GCACTCGGTC
GCCATCGACA CCCGAGGCCC GGTGCCCCGG ATCCTCGCCG GGATCGAGTA CGGCCATTTC
GGCCCCTCCG TGATGTGGTC GGATGATCTC GGCGAGACCT GGCAGGAGGC CGAACAGGCC
CCGATCGCCT TCCCGCCGGA AACCGGGGCG GCGCTCACCC GCGTCTGGCA GCTCCAGCCC
TCCCCGTCGG AGCCGGGGGT GGTCTGGGCC GGGGTCGAGC CGGGGGCGCT GTTCCGCTCC
GAAGACGGCG GCGTCACCTT CTCCCTGGTG GAAGGCCTGT GGAACCATCC GCACCGGGCC
CAGTGGCAGC CCGGTTTCGG CGGGCTCTGC CTGCACAGCG TGCTACAGCA CCCGAGCGAT
CCGAAGACCA TGGCGATCGC GGTCTCCTCC GGCGGCTTCT ACCGCAGCAG TGACGGCGGA
GCCTCCTGGG AGGCGGCCAA CAAGGGGATC CGCGCCCCGT TCCTCCCCGA GGGATCGCAG
TACCCCGAGT TCGGGCAGTG CGTGCACAAG GTCGGCATGC ATCCGTCCCG GCCGGAGCGG
CTCTTCCTCC AGCACCACTT CGGCGTCTAC CGCAGCGACG ACTTCGGCGG CTCCTGGACC
TCCGTCGGCG ACCCGCTCCC CTCCGACTTC GGCTTCCCGC TGGCCGTGCA CCCCGACCGG
CCCGACACGC TGTACGTCTT CCCGCTCCAG GCCGACATCG ACCGCACGCC CGTCGACCAC
CGCTGCCGGG TCTTCCGCAG CGACGACGCG GGCGCGTCGT GGCAGCCCCT GAGCAAGGGA
CTGCCCGAGG GCCCGGTCCA CGCCGCCGTG CTGCGCGACG CCCTGTGCGC GTCGAGGGCG
GGGATCTTCT TCGGCACCCG CGACGGCGAG GTGTACGGCT CCCGCGACGA CGGCGAGAGC
TGGTTCTCCG TCGCCGAGCA CCTGCCGGAC GTCCTGACCG TGCGTGCGGC GGTGCTCTGA
 
Protein sequence
MPDTLLAVGT RKGLFLARST GDGPFEVEPV RFSTIGVHSV AIDTRGPVPR ILAGIEYGHF 
GPSVMWSDDL GETWQEAEQA PIAFPPETGA ALTRVWQLQP SPSEPGVVWA GVEPGALFRS
EDGGVTFSLV EGLWNHPHRA QWQPGFGGLC LHSVLQHPSD PKTMAIAVSS GGFYRSSDGG
ASWEAANKGI RAPFLPEGSQ YPEFGQCVHK VGMHPSRPER LFLQHHFGVY RSDDFGGSWT
SVGDPLPSDF GFPLAVHPDR PDTLYVFPLQ ADIDRTPVDH RCRVFRSDDA GASWQPLSKG
LPEGPVHAAV LRDALCASRA GIFFGTRDGE VYGSRDDGES WFSVAEHLPD VLTVRAAVL