Gene Sros_9095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9095 
Symbol 
ID8672441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10033310 
End bp10034713 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003344461 
Protein GI271970265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCAA AGGTGTTACT CGGCGGGCTG GTGTCCGCCA TGCTGCTCAC GCTGATCGGT 
GTGGCCGCCG GCCCGGCCAG CGCCCAGCAG TCGCTGCGGC TGCAGTACAG GACGAGCGCC
GCCGGAGGGA CCGCCGACCA GGTCGAGCCC TGGTTCGACC TGGTCAACGA CGGCGCCTCG
GCCGTGCCGC TGAGCGGCGT GAGAATCCGT TACTACTTCA AGGCCGACCC CGGCGCGTCG
CAGTACCGCT TCGCCTGCTC CTGGGCGGTG GTCTCCTGCT CCACGGTCAC CGGGACCTTC
GGCACGATCG CGCCCGGTAC GGCCACCGCC GACCGCTACC TGGAGGTGGG CTTCACCTCC
GGGAACCTGG CGGCCGGGGC GCGGACCGGT GACCTGCAAC TGCGCTTCCA CCGGTCCGAC
TGGCAGCGGA TCACGCAGAG CGACGACCAC TCCTTCGGTC CGGCCCGCAC GACCTACGGC
GACTGGACCA GGGTCACCGT CCACCGGAAC GGCGCCCTCG TCTGGGGTAC CGCGCCCACG
GGAGGCGACC CGTCCCCGTC GCCGACCCCC ACGGTCACGC CCACCAACCC CGGGGGCGGC
GGGGTGGTGT TCGACGACTT CACCTACGGC ACCTCGGACG ACCCGGCCCT CACCGCCCAC
CACTGGACCG TGCGGACCAA CAGCGGCGGC CCCGGCGTGC CCGGCGCCAC CTGGCCCAAG
GAGAACGTCA CCTTCCCGAC GGTCTCCGGC GCGAACAAGG CACTGCAGCT CAGGGCCGAC
ACCGACGGCA CCGGGGCGGG CACCCGGCAG TCGGAGGTGC TGCACCAGCG CAAGTTCCTC
GAAGGCACCT ACGCGGCCCG GGTGAAGTTC TCCGACGCCC CGGTGAGCGG TCCGGACGGG
GACCACATCG TGCAGACCTT CTTCACGATC ACCCCCCTGG CCTTCGACAT GGACCCGGAC
TACAGCGAGC AGGACTTCGA GTATCTGCCC AACGGCGGCT GGGGCGAGCC CGGCAACATC
ATGTACGCCA CGTCGTGGGA GACCTACCGC AACGAGCCCT GGGAGGCGGT GAACGTCCAC
AACGAGGTCC GGCAGAGCTA CGCGGGCTGG CACGACCTGG TCCTGCAGGT CTCCGGCGGC
CGGATCAGGT ACTACATCGA CGGCGCCCTC TTCGCCGAGC ACGGCGGCGT CTACTATCCG
GAGACAGCGC AGTCGGTCAA CTTCAACCTC TGGTTCATCA GCGGCGGCCT GGTCGGCAGC
TCGGCCCAGC GCGGCTACGT CCAGCAGGTC GACTGGTTCT ACCACTCCAA GAACGAGGTC
GTCGCCCCCG CCGAGGTGCT GAACCGGGTC ACCGGCTACC GGTTTGCCTC GACCGCCTGG
GTGGACACCG TTCCCAGTCC CTGA
 
Protein sequence
MRAKVLLGGL VSAMLLTLIG VAAGPASAQQ SLRLQYRTSA AGGTADQVEP WFDLVNDGAS 
AVPLSGVRIR YYFKADPGAS QYRFACSWAV VSCSTVTGTF GTIAPGTATA DRYLEVGFTS
GNLAAGARTG DLQLRFHRSD WQRITQSDDH SFGPARTTYG DWTRVTVHRN GALVWGTAPT
GGDPSPSPTP TVTPTNPGGG GVVFDDFTYG TSDDPALTAH HWTVRTNSGG PGVPGATWPK
ENVTFPTVSG ANKALQLRAD TDGTGAGTRQ SEVLHQRKFL EGTYAARVKF SDAPVSGPDG
DHIVQTFFTI TPLAFDMDPD YSEQDFEYLP NGGWGEPGNI MYATSWETYR NEPWEAVNVH
NEVRQSYAGW HDLVLQVSGG RIRYYIDGAL FAEHGGVYYP ETAQSVNFNL WFISGGLVGS
SAQRGYVQQV DWFYHSKNEV VAPAEVLNRV TGYRFASTAW VDTVPSP