Gene Sros_5367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5367 
Symbol 
ID8668661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5879468 
End bp5880682 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340873 
Protein GI271966677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.071544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.251798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCTTG ACGTCATCCG GCGCGGCATC GACGCGCTCC GCGAGTACGA CACGGACCTC 
TGCTGCTTCG GCGCCCGCGA GCACCGCTAC CGGTTCAAGC CGCCGCTCGA CGAGGCCGCG
TTGGCGGCCG TCGAGGCGCG GATCGGAGTG CGCTTCCCGG CCGATTACCG CACGTTCTTG
ACACGGCTCG GCAACGGCGG CGCCGGCCCG TACTACGGGG TTCACGGGGT GCGGCCGGAC
GGCGATTGGG CGCGGTTCCG CCCGTTCCCG TTCGCGCAGG AGTGGGAGCC GCCCGACCAG
GACGACGAGG ACTACGACGA CGTCATGGAA GCCGCGTTCG AGGGGCTGCT GCCGGTCGCC
GAGCATGGCT GCGGCTACCG CTCTCACCTC GTCGTCAAGG GCCCCGCCGC GGGCCAGGTC
TGGGGCGACT GGACGTGCGT CGGTGAGGTG CTGGCCCCCG AGGCCGAGTC GTTCGGCACG
TGGTACCACG ACTGGCTGGA GAGCTCGCTG CGGGAGGTGC TGGGTGACCG GATCACGGCA
ACCGTGCACG ACGAGACCGG CTGGAGCGTT GACCGGCGGC TGCTCGGCCT CCTCCCGCCG
CCGCCCGCCG GTGGCGACGC ACAGCCGGAG GTAAGGGTCC TACGGCTGCT CAGGCGGATC
TACCCCGCCC TCTACGAACG CCGCCACGAC GACGCCCGCG ACCTCCTCGC CCAAGCCCGC
GCGGTCGGCG CTCCGGCGGC CTACGAGGTG GGCATCGCGC TTGCGGACGC CGTGCTCCTG
CGCGAGGAAG GCAGGATCGC CGACGCCCTG ACCACCGTGG AACATACGAT CCCGCGGTGC
GGCTGGCCAT TCGAGAAGGC GCGGCTGCAC CGCCTGCGAG TCGAGCTGCT CCTGATGCAG
AGCCGGCTGG ACGACGCGCG TGCGGCGACC GAGGAGCACA TCGCGCACTG CCCCGATGAC
GACTTCGGCT ACGTACGCCG TGCCCTCCTG CTGCTGATGA CCGGCGATCT CCCAGCCGCC
GAGAACGTCC TGCGCGCCGA CGCCCCGCTC GGGAGAGGAT TCGGATCGGT AAGCCACCCG
TATCCCGCCG ACCGAGCCGC GACCGCGCTA CGGCTCCGTG CCCGGCGCCT GGCCTGGGAG
TGCCGCCGCT GGGGACATCC CACCAACGCG CTCCGCTTCG ACGCGATCGC CACCAGCCAA
TCGGCCTGTC GTTGA
 
Protein sequence
MDLDVIRRGI DALREYDTDL CCFGAREHRY RFKPPLDEAA LAAVEARIGV RFPADYRTFL 
TRLGNGGAGP YYGVHGVRPD GDWARFRPFP FAQEWEPPDQ DDEDYDDVME AAFEGLLPVA
EHGCGYRSHL VVKGPAAGQV WGDWTCVGEV LAPEAESFGT WYHDWLESSL REVLGDRITA
TVHDETGWSV DRRLLGLLPP PPAGGDAQPE VRVLRLLRRI YPALYERRHD DARDLLAQAR
AVGAPAAYEV GIALADAVLL REEGRIADAL TTVEHTIPRC GWPFEKARLH RLRVELLLMQ
SRLDDARAAT EEHIAHCPDD DFGYVRRALL LLMTGDLPAA ENVLRADAPL GRGFGSVSHP
YPADRAATAL RLRARRLAWE CRRWGHPTNA LRFDAIATSQ SACR