Gene Sros_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4233 
Symbol 
ID8667527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4717489 
End bp4718781 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339878 
Protein GI271965682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.121217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0238527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTT ACCGCGAGCT CCTGCGCACG CCGGAGTTCA CTCCGCTCTT CCTGACCGTC 
TCCGCGCAGG TGGCGGCCGC GACCGTGAGC GGGCTGGCGC TGGGCGTGCT GGTGTACGCG
GCCACCGGGT CACCGCTGCT GGCCGCCCTC AGCATGTTCG GTCCCTCGCT GGCCCAGGCG
ATCGGCGCGG CCGCGCTGCT GTCGGCCGCC GACCGGCTTC CGCCGCGCGC CGCGATGACG
GGAGTGGCCC TGGCCTTCTG CCTCGGCACC GCCGCCCTGG CCGTGCCCGG GCTGCCGCTG
CCGGGCATAT TCGCGATCAT CCTGGGGCTC GGCCTGGTCG GCTCGGTGGG CGGCGGGGTG
CGCTACGGGC TGCTGAACGA GATCGTGCCC GCGGACGGCT ACCTGCTCGG GCGGTCGCTG
GTCAACATGT CCGTCGGGAT CATGCAGGTC TGCGGGTTCG CGGCGGGCGG CGTGCTGGTG
TCGGTGTTGT CGCCGCGCGG CACGCTGCTG GCCGGGGCCG CCCTGTATCT CGTCGCCGCG
GGCACCGCCA GGTTCGGCCT CAGCGCGCGG GCGCCACGGG CCGTGGGGCG GCCGTCGGTG
GCCGCGACGT GGCGCTCCAA CGTGCGGCTG TGGTCCTCGG CGCCCCGTCG CCGCGTCTAC
CTCGCGCTCT GGGTGCCGAA CGGGCTGATC GTCGGTTGCG AGTCGCTGTT CGTACCGTTC
GCGCCCGAGC AGGCCGGGAC GCTCTTCGCC TTCGCCGCGT CCGGCATGCT GGCCGGGGAC
GTTCTGGTCG GCAGGTTCGT GCCGGCGCGG TGGCAGGCGC GGCTCGGCGC CGCGCTGCTG
CTCCTGCTGG CCGCGCCGTA CCTGGTGTTC GCCGTGGACC CGCCGGTGCC GCTCGCCGTC
GCGGCCGTGA CGGTGGCCTC GATCGGATAC GCGGCGAGCC TGGTGCTGCA GCAGCGGCTG
ATGGACCTGA CCCCGGCCGA GATGAGCGGG CACGCGCTGG GGCTGCACTC CTCCGGCATG
ATCACCATGC AGGGTGTCGC CGCCGCCCTC GCCGGCACGC TCGCTCAGTA CACCTCGCCG
GGGACGGCGA TCGCCGTCAT GGCGGCGGCG TCCGTGACGG TCACGCTGGC GCTGGCGCGC
GGCCTCTCCG GGCCGGCACG CACAGGCCCG TCTCAGAGAG CGCGCGCCGA CATCGAGCGG
CACGGCGACG CCGGGAACCC GGCGGCACCG CCCCAGGCCG GTCACCTCGC CGGGAACGCT
CCGCCGGCCA CCGGAAACCG GCGGACGACG TGA
 
Protein sequence
MRTYRELLRT PEFTPLFLTV SAQVAAATVS GLALGVLVYA ATGSPLLAAL SMFGPSLAQA 
IGAAALLSAA DRLPPRAAMT GVALAFCLGT AALAVPGLPL PGIFAIILGL GLVGSVGGGV
RYGLLNEIVP ADGYLLGRSL VNMSVGIMQV CGFAAGGVLV SVLSPRGTLL AGAALYLVAA
GTARFGLSAR APRAVGRPSV AATWRSNVRL WSSAPRRRVY LALWVPNGLI VGCESLFVPF
APEQAGTLFA FAASGMLAGD VLVGRFVPAR WQARLGAALL LLLAAPYLVF AVDPPVPLAV
AAVTVASIGY AASLVLQQRL MDLTPAEMSG HALGLHSSGM ITMQGVAAAL AGTLAQYTSP
GTAIAVMAAA SVTVTLALAR GLSGPARTGP SQRARADIER HGDAGNPAAP PQAGHLAGNA
PPATGNRRTT