Gene Sros_2982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2982 
Symbol 
ID8666269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3250218 
End bp3251828 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338679 
Protein GI271964483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.688388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.388989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGC CAAGAGAGGC CATGCAGTCG GCGATGGCCG CTGCGCAGGC GATAGGGCAG 
GGAGGCGACC CGCGGCAGGT GGTGGGCCAG CTGGCCAACC AGGCGATGGG GCAGCTCGCG
GGCCAGGCGG GCAGTGAGAA GGGATTCGCG CTCAACCCCT CCGACTTCCT GGCCGCCCGC
GACCAGGGCG CGTCCCTGGG CACGGTCATC GAGGCCCGTG GCGCGTCGCT GAACGAGGCC
GGCGAGATCG TCAACCGCAG CTTCACCGGC AACGGCCCGG ACGGCCAGGC GGTCCACGTG
ATCTCGCCGG TGGTGATCCC CAAGAGCACC ACCGCCCGGG TGGTCGTCCC CCTGATCGTG
CTGGCGGTGC TCGGCCTGGT CGGCCTGGTC GCCACCGGTC TCCCCGAGGG CGCGCGGGTG
CTGTTCGGCC CGCACTACTG GGCCGTGCTG GTGCTGGCCG CCGCGTTCCT GTGGTGGCGG
CGGAGCGTGG TGATGGTGCC CGAGGGCTGC AAGGCTCTGA TCACCAAGTT CGGCAAGCTC
GTGCAGATCG CCGAGCCCGG CAGGGTGACG CTGCTCAACC CGTGGAAGCG GGTCAGCTAC
ATCGTCAACA CCACCCGCGA ATACCCCTTC AACGCCCCCA TCCGCGAGGC GCCGACCCAG
CAGGGCGTCA AGGCCAGCGT GGACCTGTTC CTGCAGTTCC GGATCGAGGA TCCGGCCGAG
TTCATCTTCG TCCTCGGCTC GGTCAGCGGC TTCCAGGCCA AGCTGCAGAA CGCCATCAGC
GAGGTCACCC GCTCCCTCAT CTACGCCCAG CGCGCCGAGG ACATCTACGA TCTGGTCGGG
GAGAGCACCC TGGGCATGCT GGACAACCTC AACCAGCAGT TCCTGCCCGC CGTACGGCTC
ACCGACGTGA ACATCACCCA CGCCGAGCCG TCCAGCCAGG AATACCGGAT GGACCTGGCC
GCCCCGGAGA TGATCAGGGT CGCCAAGGAG GCGTACACCT ACGAGTACGA GCTCCAGCTG
CGCAAGGAGC AGAACGAGGG CGACCTGATC AAGGAGCTCG CCGGGCTGCA GGAGCAGCTC
TCGGCCATCC ACGCCGAGAT CGCCGGCTAC CAGGCCCGGA TGGACACCGC GCTGGAGCGC
GCGTCCCACC AGGCGAAGGC GCAGGCGGGC CAGCGGCTGG TGGAGGCGGA GTCCACCGCC
AAGGCCAACG CCGCGCTGCT GGAGGCGCAG GCGCTGGACA TCCGCGCGCT GAGCGCGGCC
GAGGCGCCGG AGATCCTGGA GTACCGGTTC CAGCAGGACC TGCTCGACAA GCTGGAGTCC
GTGGCCTCCC ACCTGCCCCG CGTGGTGCAG GTCGGCGAGA CGACCGACAT CGACCTGCTC
TCCCTGGCCA GGCAACTGGT CGGCAGCCGG GAGGCGCAGC TGTTCTCCGC GGCGGACATG
ACGGCGATCA GGGAACGGAT CACCGAGATC GGCAGGCGGG TCGAGGGCCG CGAGGCGGAG
ATCACCACCC TGCTCAACCC GGTCGCGGAG ACCGCGGCGC CCGCCGTACG GGAAACTGCG
GCGCCCGCCG TACGGGAGAC CGCGGCGCCC GCCGCACAGG AAGAGGCCTG A
 
Protein sequence
MSRPREAMQS AMAAAQAIGQ GGDPRQVVGQ LANQAMGQLA GQAGSEKGFA LNPSDFLAAR 
DQGASLGTVI EARGASLNEA GEIVNRSFTG NGPDGQAVHV ISPVVIPKST TARVVVPLIV
LAVLGLVGLV ATGLPEGARV LFGPHYWAVL VLAAAFLWWR RSVVMVPEGC KALITKFGKL
VQIAEPGRVT LLNPWKRVSY IVNTTREYPF NAPIREAPTQ QGVKASVDLF LQFRIEDPAE
FIFVLGSVSG FQAKLQNAIS EVTRSLIYAQ RAEDIYDLVG ESTLGMLDNL NQQFLPAVRL
TDVNITHAEP SSQEYRMDLA APEMIRVAKE AYTYEYELQL RKEQNEGDLI KELAGLQEQL
SAIHAEIAGY QARMDTALER ASHQAKAQAG QRLVEAESTA KANAALLEAQ ALDIRALSAA
EAPEILEYRF QQDLLDKLES VASHLPRVVQ VGETTDIDLL SLARQLVGSR EAQLFSAADM
TAIRERITEI GRRVEGREAE ITTLLNPVAE TAAPAVRETA APAVRETAAP AAQEEA