Gene Sros_3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3710 
Symbol 
ID8666998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4108459 
End bp4110192 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339376 
Protein GI271965180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.582604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.151364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CGCGTACGGC ACGTCCCGCC GGAGCCTCTC CCCAGGACCT TCTCGCTGAG 
ATCGCCGTCG ATCCCATGGG TCAGGTCACC GCCTCGGTCT ACGAGACGGC CCGGCTGGTG
GCGTCGGCAC CGTGGCTGAG CGGCCACCGG CGGCGGATCG AGTTCCTGAT CCGGGGCCAG
AACCCGGACG GCACCTGGGG AGGACCGGGC CCCTACGTGC TGGTCCCCAC CCTCAGCGCG
GTCGAGGCCC TGTTCACCGT GGTGAGCGCC GCCCGGCCCG GCGAGCGGGC CGCATCCGGC
CACAGCGATG TCGCGGGTGC CGTCCACCGC GGCCTGCACG CGCTGTTCGC GCGGCTGGAC
CCCCAGTCGA GAGCCGTCCT TCCGGACACC ATCGGGGCGG AGTTCCTCAT CCCCTGGCTC
GTCGGCGAGA TCAACCGGCA TCTGGACGCG CTGGAGAGGC ACCCCGTCCC CGGACTGGCC
GCCTGGCACG GCTCGGGGCG CCTCCACACA CCCGACGGGT GGGATCCGGC CCCCCTCGAC
CGGCTCCGCC ATGCCGCCGG GCAGGGGCAC GCGCTGCCCC CGAAGGTCTG GCACTCCCTG
GAGGCGCTGG GGGGCACCGC CGTACAGGCT CCCCAGGTGC GTCCCGTCGG CGGGGCCGTG
GGCTGCTCGC CGGCCGCCAC CGCCGCCTGG CTGGGCCCCG GGGACGGCGA CGGGGCGGGC
ACGGAGCGGC GGCGCGCGAT CGAATACCTG CACACCGTGC AGGCCAGGCA CGACGGCCCG
GTCCCGGGCG TGACGCCGAT CGGCGTCTTC GAGCGGGCCT GGGTGCTCTC GGCGTTGGCC
GAGGCGTTTC CGGGCACCGC GCCGCCCCGG GAGCTGGGCC ACAGCCTGCA CGCCGCCTTC
GGCGACCTCG GCGTGCCCGC CGGCGCGGGC CTGCCGCCCG ATTCGGACGA CACCGCCGGG
GCGCTCCACG CCCTGGCGCT CATCGGCAGG CCCCGGTCCC CGGAATGCCT CTGGGCCTAC
GAGGCCGACA CCCATTTCCG CTGCTTCGCC GCCGAGCGCA CTCCTTCCAC CAGCACCAAC
GCCCACATCC TCATGGCCTT CGGGGACCAT GAGGCCGGCG CGGCGGACCG GGCCCGCTAC
CGGCGGGCGA TCGGCAAGAT CTCCCACTGG CTGCACGGGC AGCAGCGGCC CGACGGGAGC
TGGATCGACA AATGGCACGC CTCCCCCTAC TACGCCACGG TCTGCAGCGT CTCGGCCCTG
GCCCGCCACG GGGACGCGTC GTCCGCCGCG GCGCTGCGCG CGGCCGCCGA GTGGGTGCTC
GGCACCCAGC GCCACGACGG GTCATGGGGC CTGTGGTCCG GCACCTCCGA GGAGACCGCG
TACGCGATGC AGACCCTGCT CCGGGCCCCC GCACGCGACG ACGGCGCGGT GCGCCGCGCC
GTGCTGCGCG GTGACGCCTT CCTCGCCGGG CAGCATCGCG CGGACCACCG TGACCACGAC
GGACGCGACG ACCACGACGG ATCCGGCGGG CGCCGCGGCC ATGACCGCCA CGGCGAGCGC
GGCGGACCCG ACGGACCCGA CGGACCCGAC GGTCACGGTC GCCGGGGCGA CGCCCGGCGG
TACCCGCCCC TGTGGCACGA CAAGGACCTC TACGCGCCCG TCGCGGTCAT CCGGGCCGAG
ACGCTGGCCG CCCGCACCCT GGCCCGGACC CGGATCCTGG AGAGGCAGCG ATGA
 
Protein sequence
MSTTRTARPA GASPQDLLAE IAVDPMGQVT ASVYETARLV ASAPWLSGHR RRIEFLIRGQ 
NPDGTWGGPG PYVLVPTLSA VEALFTVVSA ARPGERAASG HSDVAGAVHR GLHALFARLD
PQSRAVLPDT IGAEFLIPWL VGEINRHLDA LERHPVPGLA AWHGSGRLHT PDGWDPAPLD
RLRHAAGQGH ALPPKVWHSL EALGGTAVQA PQVRPVGGAV GCSPAATAAW LGPGDGDGAG
TERRRAIEYL HTVQARHDGP VPGVTPIGVF ERAWVLSALA EAFPGTAPPR ELGHSLHAAF
GDLGVPAGAG LPPDSDDTAG ALHALALIGR PRSPECLWAY EADTHFRCFA AERTPSTSTN
AHILMAFGDH EAGAADRARY RRAIGKISHW LHGQQRPDGS WIDKWHASPY YATVCSVSAL
ARHGDASSAA ALRAAAEWVL GTQRHDGSWG LWSGTSEETA YAMQTLLRAP ARDDGAVRRA
VLRGDAFLAG QHRADHRDHD GRDDHDGSGG RRGHDRHGER GGPDGPDGPD GHGRRGDARR
YPPLWHDKDL YAPVAVIRAE TLAARTLART RILERQR