Gene Sros_8893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8893 
Symbol 
ID8672231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9820607 
End bp9821815 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344268 
Protein GI271970072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGA CCTCCCTCCG GTGGCACGCA CAAGGAGCGC CCGCCCGCGC CGCGTGGCGG 
GCTTGGGCGC GTGGGGACAT CGACGACGCC GAGCGCCTGG CCGCGCATGC CGGTCACCGG
CAGCACGCGC ACCTCCGGTT CCTGACCAGC TACGTGCGCG GAAACTACGA GCAGGCGCTG
ACCCACTACG AGGCCATTCG CCGCCTGTAT CCGGCCTACA CGGAACTGGA TGAACCCGCC
GCGCATGCTC TGCTGCATCT GGACCGGCCG GCTGAGGCCT ACGCGCACGT ACAGCGGCGC
CGGAGGAAAC GACCGCTGCC CCCCGACCTC GTGTCGCGGA TGGATCACCC GCTGGGCGTC
GAGATCGACC ACGCGACCGT CCTTCCCTTC GCCGATCACG CGCTGGCGCC CTACCTGCCC
GCCGTGGACG CCACGCTCGA CGGCCACCCT GTGCGCACCC ACATCGACAC CGGCGGCACG
TTCCTCGTCA TGGGAACCAG GCGCGCCGAC GCACTGGGCA TCCGGCTGAT CTCCAGCGGA
AAGAATCATC ACGGTACGAC CCGCACCGAC CTCTACACCG GGATGGCCAG GGAACTGACG
CTCGGCGACG TCGTCTTGAC CAACGTGCCG GTGGAGGCCA TGCCGACGCT ACGCGACGAC
CAGGACCTCG TCATCATCGG CACCAACGTC CTGCAGCGGT TTCTCACCAC CGTCGACTAC
CCCCGCCGGC GCCTGCTCCT GTCACGACGC CGCGACCCAC GGCAGGCGGC CGACCATCTT
GCGCTTCTCG ACGGCCGGCC GGAGGTCGCC CGGGTCCCCT TCTACCTGTG GGCAGACCAC
TACATGTTCG CCCGTGGAGG CTTCGGCACC CGGCAGGACC TCAACTTTTT CATCGACTCC
GGACTGGTCT ACGTCGGCCA GGAGGACGGC TCACCGCCCC GCCAGGCATG CCTGTACACG
ACCGCGCGGC GATACCGCTC CTGGGGTGTG CACCGGGCTC GCGCGGCCCG CCCGCACTTC
AGCGTCGACG AACCGATCCG CCTGGGACCG CTCCGTCAGG ACGACCAGTT CGTGGCCACG
ACGCCGGCCC GGCGTGTGCC CTGGGCGTCG TTCGGGGGCG TTCGCATCGA CGGCCTGCTT
TCCCACTCCT TCCTCGACAA GTACGCCTGG ACCCTCGACT TCGACCGGCA CGAATACACA
TTCCGATGA
 
Protein sequence
MSWTSLRWHA QGAPARAAWR AWARGDIDDA ERLAAHAGHR QHAHLRFLTS YVRGNYEQAL 
THYEAIRRLY PAYTELDEPA AHALLHLDRP AEAYAHVQRR RRKRPLPPDL VSRMDHPLGV
EIDHATVLPF ADHALAPYLP AVDATLDGHP VRTHIDTGGT FLVMGTRRAD ALGIRLISSG
KNHHGTTRTD LYTGMARELT LGDVVLTNVP VEAMPTLRDD QDLVIIGTNV LQRFLTTVDY
PRRRLLLSRR RDPRQAADHL ALLDGRPEVA RVPFYLWADH YMFARGGFGT RQDLNFFIDS
GLVYVGQEDG SPPRQACLYT TARRYRSWGV HRARAARPHF SVDEPIRLGP LRQDDQFVAT
TPARRVPWAS FGGVRIDGLL SHSFLDKYAW TLDFDRHEYT FR