Gene Sros_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1988 
Symbol 
ID8665270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2139517 
End bp2141316 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337719 
Protein GI271963523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTG AGATGCAGAA GTTCATCAAG CTGCTGGAGA GCGAACTCGG ATACTCCGAG 
AAGAGCGGCG GTTACACCAA GTTCGGCCAC TGGTACGGCG ACAACGTCGA GTTCGACGCC
GACTACACGG CGGCTCCCTG GTGTGACATG TACCTGTCCT GGGCGGCCAA GAAGCTCGGC
TACGAGGACT GGGTCGGCCA GTTCGCCTAC ACCGTCTACC ACGCCGAGTG GTTCAAGGAG
CAGGACGCCT GGGGCACCAC TCCCAAGCCC GGCGCCATCG TCTTCTTCGA CTGGAGCGGC
TCGAAGAAGA TCGACAACAT CGACCACGTG GGGATCGTCA CCAAGGTGAC GGGCAGGACG
ATCCACACGA TCGAGGGGAA CATCGACGGG GGCGTCGCCA AGCGCAAGGA ACGCGACACC
GGCAAGGTCG TCGGCTACGG GTATCCGGAG AAGATCAAGG CCCGGCTGGA CAGGGAGGCC
TCCCAGAAGC AGACGGTCGT CACCGCCGAC GCCCGGGCGG ACAACTCGGT GGCGCTCACC
CCCGGGCCCA ACCTGCTCGC GATGGTCCCC CCGCCCGACC TCGGCCGGGA GGCCGAGACC
CCGGCGGCCA CGGCGGAGCC CGGCCCCAAG CACGCCAAGA CCGCCGCCAA GTCGCCCGCC
AAATCCTCCG CCCGGGAGAC GAAGGCCGCC GAGACGGCCC CGCGGACCAC CCCGAAGACC
AACCCGAAGA CCGGCGAGGC CACCCCGAGG ACCGGCGCGA CCGCCGTCGA TCCCCAGATC
TCCGCAGGCC GTACCACCAC GACCGGCAAG CACGCCAAAC CCGCCACCGC GGACACCAAC
GCGCTCGCGA CCACCCCGAC CGCGCTCAGC AGCGGCCTCA TGCCGCAGAC TCCCCAGCTC
GGCACGCCCG CCGTGCTCGC GCCGGTCCTG CTCGCCGCGG TCGCGATCAT CGCCCACGCC
AAGGCCCGGC AGTCGAGGAC ACGGCTCGCG TTCGCCGCGG GCGACAGCGC GCCCGTCCGC
CCGCCCCGCC GGACCCCGGG CCGCCGCCGC GCCCCCGGCC GCCGCCGGAT CACCAAGGGC
ACGCCCGTCC TGGCCGAGGA GCTCCGGATC ACCCCGGAGG CCGTCCCCGA GACCACGCTC
CCAGCCGCCG CGACGGCCGA CTTCCCCTCG GTCACGACCC CGCTCGGCCA CGTCAGGGAG
AGCGGGCCAC TCATCCCCGC CGAGCAGACC GGCCCTCTCA TCCCCGCCGA GCAGACCGGG
CCTCTCATCC CCGCCGAGCA GACCGGGCCT CTCACACAGG TCGAGCAGAC CGGGCCCCTC
ACCCGCGTCG GGGAGGCCGG ACTGGCCGAC CTCATCGGGG AGGCCGGACT GGCCAACCTC
ACCCTCGCCG AGCAGGCGAG CCTGGCCCGC GCCGAGGAGG CCATCCGGGC CAGTCTCGCC
CGCGACGCGC AGAGCGGCCT GGACGATCTC TTCCGGCCCG GGGAGAGCCT CCGGCCAGGC
CTCACCCACG CCGGCCGGCC CGGCCCGCCC GGCCCGGCCC GCGCCCAGGG AACCGGCCCG
GCCAACCGGC GCCACGACGG CCCTCAGGCC CCCTACCAGG GCAGGCGCCG CCTCCGCGAG
CGCCCGGTCG TCGAGTCCTC GACCTTCGTC CAGGACGCCC CGCTACGCGG CCGGCGCCAT
CGCCGGTCCG AGCCCGTGGT CAGCACGACC ATCCCGACCG GGCCACTCGC CGCCGAGCCG
TCCGACGTCC TGGTCCACAG CGGCTACCGG GGCCGCCGCC GGGCGAGGGT GCCGGCCTGA
 
Protein sequence
MTPEMQKFIK LLESELGYSE KSGGYTKFGH WYGDNVEFDA DYTAAPWCDM YLSWAAKKLG 
YEDWVGQFAY TVYHAEWFKE QDAWGTTPKP GAIVFFDWSG SKKIDNIDHV GIVTKVTGRT
IHTIEGNIDG GVAKRKERDT GKVVGYGYPE KIKARLDREA SQKQTVVTAD ARADNSVALT
PGPNLLAMVP PPDLGREAET PAATAEPGPK HAKTAAKSPA KSSARETKAA ETAPRTTPKT
NPKTGEATPR TGATAVDPQI SAGRTTTTGK HAKPATADTN ALATTPTALS SGLMPQTPQL
GTPAVLAPVL LAAVAIIAHA KARQSRTRLA FAAGDSAPVR PPRRTPGRRR APGRRRITKG
TPVLAEELRI TPEAVPETTL PAAATADFPS VTTPLGHVRE SGPLIPAEQT GPLIPAEQTG
PLIPAEQTGP LTQVEQTGPL TRVGEAGLAD LIGEAGLANL TLAEQASLAR AEEAIRASLA
RDAQSGLDDL FRPGESLRPG LTHAGRPGPP GPARAQGTGP ANRRHDGPQA PYQGRRRLRE
RPVVESSTFV QDAPLRGRRH RRSEPVVSTT IPTGPLAAEP SDVLVHSGYR GRRRARVPA