Gene Sros_5889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5889 
Symbol 
ID8669183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6454041 
End bp6455555 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003341367 
Protein GI271967171 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0952006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.418244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAC GGCGGGTGAT GGGCATCGAG ACGGAGTACG GCATCTCCGT GCCCGGACAG 
CCGGGGGCCA ACGCGATGGT GACCTCCTCT CAGGTCGTCA ACGCCTACCT GGCCGCCTCG
GCCGCCAGGG CGCGGCGGGC ACGATGGGAC TTCGAGGAGG AGAACCCGCT CCGCGACGCC
CGGGGCTTCG ACCTGTCCAG GGAGGTGGCC GATCAGAGCC AGCTCACGGA CGAGGATCTG
GGCCTGGCCA ACGTGATCCT CACCAACGGG GCTCGGCTCT ACGTCGACCA CGCCCATCCC
GAATACTCCA CTCCCGAGTG CACCAACCCG CGGGCCGCGG TGATCTGGGA CAAGGCGGGG
GAGCGGGTCA TGCACGACGC CGCGGTCCGC GCCTCGGCCA TCCCCACCAA CGCCCCGATC
CAGCTCTACA AGAACAACAC CGACAACAAG GGCGCCTCCT ACGGCTGCCA TGAGAACTAC
CTGATGCGGC GGGCCACGCC GTTCGCCGAC ATCGTCAGGC ACCTGACGCC CTTCTTCGTC
TCCCGGCAGG TCGTCACCGG CGCCGGCAAG GTCGGCATCG GGCAGGACTC GCGGGGCGAG
GGCTTCCAGA TCAGCCAGCG GGCCGACTTC TTCGAGGTCG AGGTGGGCCT GGAGACCACG
CTCAAGAGGC CGATCATCAA CACCCGGGAC GAGCCGCACG CCGACCCGGA GAAGTACCGC
CGCCTGCACG TCATCATCGG CGACGCCAAC ATGTCGGAGA TCTCGACCTA CCTGAAGCTG
GGCTCCACCG CGCTGGTCCT GGCCATGATC GAGGACGGCT ACTTCACCCG CGACCTCGCC
GTGGAGAACC CCGTCCAGGC GCTGCGCGCG GTCTCCCACG ACCCGACGCT CAAATACGAG
ATCGCGATGC GCGACGGCCG CAAGCTCACG GCGGTCCAGC TCCAGATGGA GTATCTGGAG
CTGGCGCGCA AGTACGCCGA GGAGCGCAGC GGCAACGGCG TGGACGAGCT CACCAAGGAC
GTCCTGGACC GCTGGGAGTC GGTCCTGACC CGCCTGGCCG AGGACCCGAT GCAGCTGTCC
AGGGAGCTGG ACTGGGTGGC CAAGCTGGAG CTGCTGGAGG GCTACCGCAC CCGCGACGGC
CTTCCCTGGT CCCATCCGCG CCTGCAGCTC GTCGACCTTC AGTACTCCGA CATCCGTCCC
GACCGAGGCC TGTACAACCG GCTGGTCGCC CGGGGCCGGA TGCAGCGCCT GGTCACCGAG
GAAGAGGTCC AGCGGGCCAT CGAGGCCCCG CCCAGCGACA CGCGGGCCTA TTTCCGGGGA
CGTTGCCTGA GCCAGTTCAG CGAGTCGGTC GCGGCCGCCT CCTGGGACTC GGTCATCTTC
GACATCCCCG GCCGCGAGTC CCTGCAGCGC GTCCCGACCA TGGAGCCGCT GCGCGGTACC
AGAGCGCACG TGGGCGAGCT GTTCGACCGC TGCCGTACCG CCGCCGATCT GGTGAACGCG
CTCACCGGCG AGTAG
 
Protein sequence
MTVRRVMGIE TEYGISVPGQ PGANAMVTSS QVVNAYLAAS AARARRARWD FEEENPLRDA 
RGFDLSREVA DQSQLTDEDL GLANVILTNG ARLYVDHAHP EYSTPECTNP RAAVIWDKAG
ERVMHDAAVR ASAIPTNAPI QLYKNNTDNK GASYGCHENY LMRRATPFAD IVRHLTPFFV
SRQVVTGAGK VGIGQDSRGE GFQISQRADF FEVEVGLETT LKRPIINTRD EPHADPEKYR
RLHVIIGDAN MSEISTYLKL GSTALVLAMI EDGYFTRDLA VENPVQALRA VSHDPTLKYE
IAMRDGRKLT AVQLQMEYLE LARKYAEERS GNGVDELTKD VLDRWESVLT RLAEDPMQLS
RELDWVAKLE LLEGYRTRDG LPWSHPRLQL VDLQYSDIRP DRGLYNRLVA RGRMQRLVTE
EEVQRAIEAP PSDTRAYFRG RCLSQFSESV AAASWDSVIF DIPGRESLQR VPTMEPLRGT
RAHVGELFDR CRTAADLVNA LTGE