Gene Sros_0353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0353 
Symbol 
ID8663621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp340472 
End bp341623 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content69% 
IMG OID 
ProductXylose isomerase 
Protein accessionYP_003336128 
Protein GI271961932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.539165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT ACACGCCCAA GCCTGAGGAC CGCTTCACCT TCGGGCTGTG GACCGTCGGC 
TGGCAGGCCC GTGACCAGTT CGGAGACGCG AGCCGTGCCC CGCTCGACCC GGTGGAGAGC
GTCCACCGCC TCGCCGAGCT CGGCGCGTAC GGCGTCACCT TCCACGACGA CGACCTGCTG
GCCGTCGAGC CGGACCGGGA CAAGGCCGTC GAGCGTTTCA AGAAGGCCCT GGCCGAGACC
GGCCTCAAGG TCCCCATGGC CACCACGAAC CTGTTCACCC ACCCCGTCTT CAAGGACGGC
GGGTTCACCA GCAACGACCG CGACGTGCGC CGCTACGCCC TGCGCAAGGT GATGCGCAAC
GTCGACCTGG CAGCCGAGCT CGGCGCGACC ACCTACGTCT GCTGGGGCGG CCGCGAGGGC
GCCGAGTCAG GGGCCGCCAA GGACATCAGG GCCGCGCTCA GCCGTTACAA GGAGGGCATG
GACCTGCTGA CCTCCTACGT GATCGACCGG GGCTACGACA TCAGGTTCGC CATCGAGCCC
AAGCCGAACG AGCCGCGCGG CGACATCCTG CTCCCGACCG TCGGCCACGC GCTCGCCTTC
ATCAACGAGC TGGAGCACTC CGAGCGGGTC GGCCTCAACC CGGAGGTCGG CCACGAGGAG
ATGGCCGGGC TCAACTTCGC GCACGGCATC GCGCAGGCGC TCTGGCACGG CAAGCTCTTC
CACATCGACC TCAACGGCCA GCACGGCCCC CGGTTCGACC AGGACCTCGT CTTCGGCCAC
GGCGACGTGA AGAACTCCTT CTTCCTGGTG GACCTGCTGG AGAACGGCGG CTACGACGGC
CCCCGGCACT TCGACTACAA GCCGCTGCGC ACCGAGGACG CCGAGGACGT CTGGGTCTCG
GCCGCGGCCA ACATGCGCAC CTACCTGATC CTCAAGGAGA AGGTGAAGGC CTTCCACGCC
GACCCCGAGG TCGTCGAGGC GCGCGCCGCC AGCAGGGTCG CCGAGCTGTC CGAGCCCACG
CTGGCCCCCG GTGAGACGCT TGAGGACCTG CACCGCGACG ACTTCGACGT CGACCGGGCC
GCCGCGCGAG GCTTCCACTT CTCCCGGCTG AACCAGCTCG CCCTGGAGCA CCTCCTCGGA
GTCCGGGGAT GA
 
Protein sequence
MSDYTPKPED RFTFGLWTVG WQARDQFGDA SRAPLDPVES VHRLAELGAY GVTFHDDDLL 
AVEPDRDKAV ERFKKALAET GLKVPMATTN LFTHPVFKDG GFTSNDRDVR RYALRKVMRN
VDLAAELGAT TYVCWGGREG AESGAAKDIR AALSRYKEGM DLLTSYVIDR GYDIRFAIEP
KPNEPRGDIL LPTVGHALAF INELEHSERV GLNPEVGHEE MAGLNFAHGI AQALWHGKLF
HIDLNGQHGP RFDQDLVFGH GDVKNSFFLV DLLENGGYDG PRHFDYKPLR TEDAEDVWVS
AAANMRTYLI LKEKVKAFHA DPEVVEARAA SRVAELSEPT LAPGETLEDL HRDDFDVDRA
AARGFHFSRL NQLALEHLLG VRG