Gene Sros_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2594 
Symbol 
ID8665880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2821948 
End bp2823999 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338307 
Protein GI271964111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.397172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCCC TGCGCAGGGT CGCGACCGCC GTCGCGCTGA CCCTCGGCCT GCTGCTGCCT 
GCCTCCGCCG CCCTCGCCCA CGAAGAACGC CCCGTGACCT TCCCCGACGG GTCGGGCAGC
GTCCCGGTGC GCCGTACCGG GGAACCCGAC CTGCTGGTCT GCAAGAGCGA CCGTGCCGAC
TTCGGCAGGC GGATCGCAGG CTTCCCGCCC GATCTCAGGC GGCGCAACCT GGAGCTGTTC
GAGCGCTGCC AGGAGAAGGG GTACCGCCAC CTGCAGGAGG CGGTGGACCG GGTCGACCGC
CCCGGGATGA CCGTCGCGAT CCTGCCCGGC GTCTACCTGG AGGAGCCCAG CCAGCCGGAA
CCCACCGGCG CCTGCGCCCA CCTGAACGCC CGCTGGTCGT CGTGGGGCTA CCAGATCCTC
TCCTACGAGC AGCAGAAGCA GTGCCCGCAC AACCAGAACC TGGTCGCCAT CCTCGGGATC
AGGAACCTGC AGATCGAGGG CACCGGCGCG GAACCGACGG ACGTGATCAT CGACGCGCAG
TACAAGAAGC TGAACGCGAT CCGGGCCGAC ATGACCAACG GGATCTATTT CCGCAACTTC
ACCGCTCAGC GGACCACGTT CAACTCGATC TACATCCTGG CCACCGACGG CTTCGTCATC
GACACGATGC TCACCCGCTG GAACGACGAG TACGGCTTCC TGACCTTCGC CAGCGACCAC
GGCCTCTACA CCGACTGCGA GTCGCACGGC AACGGCGACT CGGGCATCTA TCCGGGAAGC
GCCTCCGACA TCAACGACGG CCGTGGCCAC GACGTGCCGC GCTACTCCAT CGAGATCCGG
CGCTGCAAGA GCCACCAGAA CATGGTGGGC TACTCCGGTA CGGCGGGCGA CTCGGTCTAC
GTCCACGACA ACGAGTTCTA CGACAACATG GGCGGCGCCT CGATGGACAG CGCGTTCGCC
GGGCATCCGG GCCTGCCGCA GAACCACGCG CGGTTCGAGC GCAACCGGAT CCACGACAAC
AACCAGGACT ACTACCGGTA CGTCGCCGAC GGCACCTGCG CCAAACCCGC GGCCGAACGC
GGCTACGAGC GAGGTGTCGT CTGCCCGCAG ATCGGCATGC CCCGGGGTAC CGGGATCGTC
ACGGCGGGCG GCAACTGGAA CGTCTTCCGC GACAACTGGA TCTGGGGTCA CGAGTCCGCC
GCCTTCGGCC TGCTCGCGGT GCCCGCCTTC ATCCGGGGCG AGGAGTCCTG GGACAAGCAG
TTCGACACCT CCCACCACAA CCTCTACGAG GGCAACCACC TCGGAGTGAC GCCCGAGGGG
ACCCGCCGCC CCAACGGGGC CGCCGTCTCC TGGGACGGCC AGGGGAGCGG CAACTGCTGG
CAGCGCGGCA TCGGCCCCTC CACCCCCTGG GCCCTGCCCG CCTGCGGCTC GGCCGGGCCG
GCCCGGATCC TCAGCGAACC CGTCAAGATC GCTAAGAACT ACCTGTGCAA CGACTACAGC
GTCAGCGAGA GGCGGCTACC GGCCGGATGC GACTGGTACG GCTCGACCGG CCTTGCCAGG
GTCGAGGTCC AGCTGGCCCT CGCCACCTCC CTCGTCCTCG GCCTGATCGC GATCCTGCTC
CAGCTCCGCA GGCTCCGCTC GCGCGCGGGA CTCGCCGTCA CGTCCGCCGG CCTCGCGGGA
CTGACCCTGG ACGTCTTCGG CGCGGCCCAC ACCTACACCC CGCTGCCCGC CGCCGCCCTG
GCACTGATGG GCGTGTGGTG GATCGGGGCC GGATGGCTGT GGCTCGGGAG GGGAGGAGCG
GCCGGGGCGC GGACCGGGGC CGGGGGACCG TCGCGCGGCC GGGCGTTCGG CTGGTTCACG
GCCGTGCTCG GCGCCCTCGC CCTGATCGAC GCGTTCGACA AGGCGGTCAT GATGATCCCG
CTGCTGCCGC TCGGGCCCGG CTGGATCCGG GGCCTGCTCG CCGGCGTCTG GACCCTCTGG
GCGATCGTCC TGCTCGCCCC GAAGCCGCGC CGTACGGCCG CCGCCGCGCG CATCGAGGAA
GTGCCCGCGT GA
 
Protein sequence
MRPLRRVATA VALTLGLLLP ASAALAHEER PVTFPDGSGS VPVRRTGEPD LLVCKSDRAD 
FGRRIAGFPP DLRRRNLELF ERCQEKGYRH LQEAVDRVDR PGMTVAILPG VYLEEPSQPE
PTGACAHLNA RWSSWGYQIL SYEQQKQCPH NQNLVAILGI RNLQIEGTGA EPTDVIIDAQ
YKKLNAIRAD MTNGIYFRNF TAQRTTFNSI YILATDGFVI DTMLTRWNDE YGFLTFASDH
GLYTDCESHG NGDSGIYPGS ASDINDGRGH DVPRYSIEIR RCKSHQNMVG YSGTAGDSVY
VHDNEFYDNM GGASMDSAFA GHPGLPQNHA RFERNRIHDN NQDYYRYVAD GTCAKPAAER
GYERGVVCPQ IGMPRGTGIV TAGGNWNVFR DNWIWGHESA AFGLLAVPAF IRGEESWDKQ
FDTSHHNLYE GNHLGVTPEG TRRPNGAAVS WDGQGSGNCW QRGIGPSTPW ALPACGSAGP
ARILSEPVKI AKNYLCNDYS VSERRLPAGC DWYGSTGLAR VEVQLALATS LVLGLIAILL
QLRRLRSRAG LAVTSAGLAG LTLDVFGAAH TYTPLPAAAL ALMGVWWIGA GWLWLGRGGA
AGARTGAGGP SRGRAFGWFT AVLGALALID AFDKAVMMIP LLPLGPGWIR GLLAGVWTLW
AIVLLAPKPR RTAAAARIEE VPA