Gene Sros_8847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8847 
Symbol 
ID8672185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9767635 
End bp9768771 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID 
ProductFumarylacetoacetase 
Protein accessionYP_003344223 
Protein GI271970027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.768181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTGGG GCCTGGAGCA CCTGCCCTAC GGAGTCTTCT CCCGGCAGGA GGGCGAGACC 
CCCCGCGTCG GCGTGCGGTA CGGCGAGCAC GTGGTCGACC TCGCCGGCGC CCTGCACGAC
GAGGTCTTCG CCACCGGTTC GCTCAACGCC TTCATGGCCA GGGGCTCGGC CGCGTGGCAG
TCCACCCGGG CGGTGATCCA GAACAAGCTG AGCACCGACC GGGCCGCGAT CGAGGCGCAC
CTGATCCCGC TGGACCTGGT CACGCTGCAC CTGCCGGTCG AGGTGGCCGA CTACGTCGAC
TTCTACTGCT CGCTGGAGCA CGCCTCCAAC CTCGGCCGGA TGTTCCGCCC CGACGAGGAG
CCGCTGAAGC CGAACTGGCG GCACCTGCCC GTCGGCTACC ACGGCCGGGC CGGGACCGTC
GTGGCGTCCG GCACGCCGAT CGTGCGGCCG TCCGGGCAGC GCGGCGCCGG GGTCTTCGGC
CCCTCGGCCA AACTGGACAT CGAGGCGGAG CTCGGCTTCG TGGTCGGCTC GCCGACGGCC
CTCGGCGAGC GGGCCGGCCG GTTCGAGGAC CACGTGTTCG GCGTGACGCT GGTCAACGAC
TGGAGTGCCC GGGACATCCA GGCGTGGGAG TACGTCCCGC TGGGGCCGTT CCTCGGCAAG
TCCTTCGCCA CCTCGGTGTC GCCGTGGGTC ACCCCGCTGG CGGCCCTGTC GCAGGCCCGC
CTCCCGGGCC GGCCACAGGA CCCCGAGCCG CTGGACTACC TGCGCCGCCG GGAGCCGTGG
GGCCTCGACC TGACGCTGGA GGTCTCGCTG AACGGCGAGG TCGTCTCCCG CCCGCCGTAC
CGGGACATGT ACTGGACGCC CGACCAGATG CTCGCCCACA TGACCGTCAA CGGCGCCCGC
CTGCGCACCG GGGACCTCTA CGCCAGCGGC ACGGTCTCCG GCGGCGGGCA GGACGAGCGG
GGGTCGCTGA TCGAGCTGAC CTGGAACGGC ACCGAACCGC TCAAACTGCC CGACGGCTCG
GCCAGGACCT TCCTGGAGGA CGGCGACACC GTCACGATCA CCGCGTCCGC TCCCGGTCCC
GGCGGGACCG TGATCACCCT GGGCGAGGTG TCGGGAACCA TCCGGCCCGC ACGATAG
 
Protein sequence
MSWGLEHLPY GVFSRQEGET PRVGVRYGEH VVDLAGALHD EVFATGSLNA FMARGSAAWQ 
STRAVIQNKL STDRAAIEAH LIPLDLVTLH LPVEVADYVD FYCSLEHASN LGRMFRPDEE
PLKPNWRHLP VGYHGRAGTV VASGTPIVRP SGQRGAGVFG PSAKLDIEAE LGFVVGSPTA
LGERAGRFED HVFGVTLVND WSARDIQAWE YVPLGPFLGK SFATSVSPWV TPLAALSQAR
LPGRPQDPEP LDYLRRREPW GLDLTLEVSL NGEVVSRPPY RDMYWTPDQM LAHMTVNGAR
LRTGDLYASG TVSGGGQDER GSLIELTWNG TEPLKLPDGS ARTFLEDGDT VTITASAPGP
GGTVITLGEV SGTIRPAR