Gene Sros_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5052 
Symbol 
ID8668346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5575853 
End bp5577388 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340585 
Protein GI271966389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.164457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGTT TCCTGACCTT GACCATCGGG CTGCTGGCCG CCGTGGCCCT GCCCGCCATC 
CCGGCCCACG CCGGTACCCC TGACGGCACG CTCTCGCTCC GGACGGCGAG CGTGAAGGCG
GGTGACCCCA TAGAGCTGTC CTACTCCACG CCGCGCCCCG ACTCCAAGAA CTGGATCGGC
CTCTACGCCG ACCCGGGCAA CGGGCCCGTT GACCAGAAGT ACGTCGGTCC CTCGCTGAAG
TGGGTGTACA TCCCCTCCGG CTCGGGTACG GCCACCCTGC CCACCGACGG GCTCGAACCG
GGCGACTACG TCACCTACGC CCTGGCCAAG GACGGCTACG AGTGGCTGGC GGCACCCGCG
AAGGTGAAGA TCGCGAGCAG CGAGCCGCTC CACTTCGTGA CCCGCGCATT CACCCTGCGC
AACGCCCGCG CCAAGTCCCC GTACACGGCC ACCGTCAAGG GCACGGTCCG CGGCGAGGCG
AGCTTCCGGA AGGCGGACGG CCCGGCCTGG GTGCGGGTCG GCCAGGACGG CACGGTGACC
GGCACGCCGC CGGCGTCGAC GTCGGCGAAG ACGGCCACCT TCACGGTGGA GGCGCGCAAC
CAGGCGGGCG AGAGCGCCAC CGCCACGGTC GGCGTCCGCG TACGGCCGCC GGGCGGGCCG
CTGGTGCCCG AGCTCAGGAC CATGTCCTGG AACCTGTGGC ACGGCGGCAG CCAGGTCAAG
GGCGGCAGGG AGAAGCAGCT GAAGTTCCTG CTCGACCGCG ACGTCGACGT GGTCGGCATG
CAGGAGACGT CGTCCACGTC CGCCAAGGAA CTGGCCGAGG CGCTCGGCTG GGACCACTAC
CAGGCGGGCG CGGACCTGGG CATCGTCAGC CGGTACCCGA TCGTCTCGCG CGGGCCGCTG
CCCTCCGAGT CGGGCCTGGC GGGGATCAAC GCGAAGATCA GGCTGGACGA CCGGCACGAG
GTGGCCGTCT GGAACGTGCA CCTCGGCTAC ACCCCTTACG GCCCCTACGA TGCGTGCTTC
GGAAAGTGGG GCGTCGAGCG GCTGATGGCC AGGGAGGCCG AGTCGAAGCG CACGCGCCAG
ATCCAGGAGA TCATGTCGGC CATGTCCGGC GACCTCGCGG ACGCGAGCCG TACACCCGTC
CTGCTGACCG GTGACTTCAA CGCCCCCTCG CACCTCGACT GGACTGCGGA GACCAGGAAG
TGCGGTTACG ACTCCGTGTC CTGGCCCACC TCGGTCGCTC CCGAGCAGTC CGGGATGAAG
GACTCCTACC GCGTGGCCCA CCCCGATCCG GTCGCCGACC CCGGCATCAC CTGGTCGCCG
ATCTACACCA CGTTCACCGG CGGGTACGAT CACGACGGCC ACAAGGGGGA GCCCGAGCCG
CAGGATCGCA TCGACTTCGT CCACTACAAG GGCGATCTCA AGGTGAAGTC GTCCGACGCG
GTCGTCGAGG GCACCCCGGC ACCGATTCCG AACCACAAAG ACAACGCCTG GACCTCCGAC
CACGCCGCCG TGCTGACCAC GTTCGCCGTC CGTTGA
 
Protein sequence
MSRFLTLTIG LLAAVALPAI PAHAGTPDGT LSLRTASVKA GDPIELSYST PRPDSKNWIG 
LYADPGNGPV DQKYVGPSLK WVYIPSGSGT ATLPTDGLEP GDYVTYALAK DGYEWLAAPA
KVKIASSEPL HFVTRAFTLR NARAKSPYTA TVKGTVRGEA SFRKADGPAW VRVGQDGTVT
GTPPASTSAK TATFTVEARN QAGESATATV GVRVRPPGGP LVPELRTMSW NLWHGGSQVK
GGREKQLKFL LDRDVDVVGM QETSSTSAKE LAEALGWDHY QAGADLGIVS RYPIVSRGPL
PSESGLAGIN AKIRLDDRHE VAVWNVHLGY TPYGPYDACF GKWGVERLMA REAESKRTRQ
IQEIMSAMSG DLADASRTPV LLTGDFNAPS HLDWTAETRK CGYDSVSWPT SVAPEQSGMK
DSYRVAHPDP VADPGITWSP IYTTFTGGYD HDGHKGEPEP QDRIDFVHYK GDLKVKSSDA
VVEGTPAPIP NHKDNAWTSD HAAVLTTFAV R