Gene Sros_5342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5342 
Symbol 
ID8668636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5855810 
End bp5857081 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340849 
Protein GI271966653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.764052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.588325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACCG ACCCGGCAGC GACCGCCGTC GCGGTTGTCG AGATGGCATG TGAGGGACGC 
TTCGCCGACA TCGAACGGCT CTTCACGCCC TCGCTGCGAG CGTTGGTCGG CGCCGAGGTG
TTGCAGGCCG CCTGGACAGC CGAGATGGGC AGGCGCGGAC CGGTTGGCAC TGTCGGGGAG
CCGGTGAGCG AGCCGGCCCA GGCGGGGCTG GTCCGTGTGA GTGTCCCGGT GGCCTGCGAG
CACGCGGGAC TCACCGTGGT GGTGGCGGTT GACCACGACG GCATGCTGCA CGATCTGCGG
CTTACCGCTA TCACCGCGCC CTGGACGCCT CCGCCGTACG CCGACCCGGC GAGGTTCGAC
GAGCACGACG TCACGGTCGG CGACGGCCCT CTCGCCGTGG CCGGCACGGT GAGCCTGCCG
CATGGGCCCG GCCCGCGAGC GGGCGTCGTG TTGCTCGGCG GGGGCGGGCC CTTCGACCGC
GACGCCACCA GCGGAGCCAA CAAACCACTC AAGGACCTGG CCTGGGGGTT GGCCGGTCGC
GGTGTCGCGG TGTTGCGGTT CGACAAGGTG ACCCACACCC ACAGCGAACA GGTGGCGAAC
GCAGCCGGCT TCACGATGAC CGACGAGTAC GTGCCGCACG CGGTCGCCGC CGTCCGGCTT
CTCCAGCGGC AGCCGGGCGT GGACCCCGCC CGCGTCTTCG TTCTCGGCCA CAGCATGGGC
GGTAAGGTCG CGCCGCGTGT CGCGGCTGCC GAGGCGTCCG TCGCCGGTTT GGTGATCATG
GCCGGCGATA CGCAGCCGAT GCACCAGGCC GCCATCCGCG TCATCCGTTA CCTCGCCTCG
CTGGATCCCG GACCGGCGAC GGAGGCGGCC GTCGAGGCGT TCACGCGGCA GGCCGCGATG
GTCGCCGGTC CCGACCTGTC ACCGTCGACG CCGACCGAGG CGCTGCTGTT CGGCTGGCCG
GCGGCGTACT GGCTGGATCT GCGCGGCTAC GACCCGGTCG CCACCGCGGC GGCGCTGGAC
AAGCCGATGT TCATCCTCCA GGGCGGCCGC GACTATCAAG TGACGGTGGC CGACGATCTG
TCAGGGTGGA AGGCCGGCCT CGCTCACCGG CCGGATGTCA CGATCCGCGT CTACGACGCC
GACAACCACC TGTTCTTTCC CGGCGCGGGT CCGTCCACGC CCGCGGAGTA CGAACCCCCG
CAACACGTGG ACCCGGCCGT CGTCGCCGAC ATCGCGGAGT GGCTGGCGCC GGAGCACGGG
AAGATCGCTT GA
 
Protein sequence
MGTDPAATAV AVVEMACEGR FADIERLFTP SLRALVGAEV LQAAWTAEMG RRGPVGTVGE 
PVSEPAQAGL VRVSVPVACE HAGLTVVVAV DHDGMLHDLR LTAITAPWTP PPYADPARFD
EHDVTVGDGP LAVAGTVSLP HGPGPRAGVV LLGGGGPFDR DATSGANKPL KDLAWGLAGR
GVAVLRFDKV THTHSEQVAN AAGFTMTDEY VPHAVAAVRL LQRQPGVDPA RVFVLGHSMG
GKVAPRVAAA EASVAGLVIM AGDTQPMHQA AIRVIRYLAS LDPGPATEAA VEAFTRQAAM
VAGPDLSPST PTEALLFGWP AAYWLDLRGY DPVATAAALD KPMFILQGGR DYQVTVADDL
SGWKAGLAHR PDVTIRVYDA DNHLFFPGAG PSTPAEYEPP QHVDPAVVAD IAEWLAPEHG
KIA