Gene Sros_3272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3272 
Symbol 
ID8666560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3559577 
End bp3561982 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003338954 
Protein GI271964758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCAGT GGTCACGGGT TCTGCGGATC GTGGTGGCCG CGGCGGGGAC GGTCACGGTC 
GGGGTGGCGG TCAACCAGAT CCTGAATGGC GGAACGTGGA ACTGGTGGGC GTTGGGCGCC
TCGGCGGGGC TCGCGATAAC GGCGGAAGGG ACGAACCGGT GGCTGGCCCG CCGGGAGGTG
GCGGAGAACG GTTCCGCTCC GTCACCGACG CCGCAGCCCG GTTCTCGGGG GGATCGGCCG
ATCGAGATAG CGGGGAATGT GTCAGGGATC ATCGTGTCTG GCGACAACGC CACGATCACG
CAGACGCATC AGACGGTGCT ACCGCCCCCC ACCCCGATCG CCGCCATCGA TGCCCCCGCC
GGGTTGGGGA ACCTGCCGTT GGCCGCAGGC GAGTTCGTCG GCCGTGAGGA AATCCTCGCC
CAGCTGGATT CGGCGATGGG CGGGGATGAG CCGGTGGTGG TGGCCGTGGT GCAGGGACTG
GGCGGGATCG GCAAGTCCAC CCTGGCCGCA CGATATGCCG CCCTCCACCA TGACAGGTTC
CATCCGGTGT GGTGGATCAC TGCGGACAGC CCCGCCGCGC TGGAGGCCGG ACTCGCCGCC
CTGATCACCG CGCTGGATCC TCGGGATGCT GAAGGCGTGG ATCTGAAAGC CCGGACAGAA
CGCGCGACCG TGTGGCTGGC CACCCACTCG GGATGGCTGC TGGTCCTGGA TGACGTCACC
CGTCCGCAGG ACATCGCTCC GCTGCTGGGC CGGGTGCGGT CCGGACGGGT GGTGGTGACC
AGCCGGCTCC GTCAAGGCTG GCAACGGATC GGCGCCCGGG TCCTGCACCT GGATGTCCTG
TCCGAAGACG AGGCCGTCGA TCTGCTGACC AGGCTCGCCC AACCCGAGGA TCCCGGCCAC
GATGTCCGGC CGGGCGCCCT CGACCTCGTG CAGGAGCTGG GTTTCCTACC ACTGGCGATC
GACCAGGTCG GCGCCTACCT CCACCAGACC GCCCTGACCC CAGCCGCCTA CCTCACCTTG
CTGCGCGCCC AGCCGGAAGT CTTCTTCGAC CAAGCCGCCG AAGGCGCCGA CAGCGACCGG
ACTGTCGCCC GGATCTGGCG GATCACCCTC GACCAGCTCA CTGGCTCTTC CCCGCTCCCG
GGCGATCTCC TGCGGATCTT GGCATGGATG GCACCAGACG CCATTCCCCG CACTCTCCTC
GCCCCCCTCA CCAGGCCACG TCCCGGCGCC CCTCGCCGCC GGCGGTGGCG GCGCACACGG
CACGGTTGGG GGGCACGGCC TCTCACCGAT CAGGCCGGGC TCACGGAGGC CCTCGGCCTG
CTGGCCGCCT ACAGCATGAT CACCCTCAGT AGCAAAACGA TCAGCGTGCA CCGGCTCGTC
CAGGCCGTCG CCCGCGCCCC CGGCACCTCC ACCCCCGGCG GCACCGCCGA TCTCCACCGC
CGGCCCGACC AGATCGCCAC CGCACGCGAC ACCGCCACCG AGCTCCTCTG GCTCATCCTC
CCCTTCGATC TCGAACACCC TGATGCCTGG CCTCTCGCAC GAGCACTCCT CCCTCACATC
ACCGCCCTCA CCGACCACGC ACCCCCCATC ACCGACGCCA TCACGATCAA CCCGCTTCTC
TACCACGTCG GCAATTTCCT GCGCGGGCAG GGCGCACTCC AGCAGGCCAT CACCTACCTG
CAACGCTCTG TGGCGCTCTC AGAGCGTCTC CACGGCCCCG ACCACCCCGA CACCCTGGCC
TCACGCAATG ATCTCGCCCA CACCTACCAG TGGTCGGGGG ATCCGGGCCG CGCCATCCCC
TTGCTGAAGG CGACGCTCGC CGACTGCGAG CGGGTGCTGG GCCCGGACCA CCGAAACACC
TTGCAGACCC GCAACAACCT CGCCAACGCC TACCGGGAGG CGGGTGACTT CGACCGGGCC
ATCCCTCTGC TGGAGGAGAC GCTCGCCCAC CGGGAGCGGG TGCTGGGCAC CGATCACCCC
GACACCCTGA ACTCCCGTAA CAACCTCGCC ATCGGCTATG ACGCTGCGGG GGATCGGGAG
CGCGCCATCT CCCTGCTGGA GGCGACACTC GCCGACAGCG AGCGGGTGCT GGGCGCAGAC
AACCTCCACA CCGCGTCCTT CCGCAACAAC CTGGCCCTCT ACCACCTGGA GGCGGGTGAT
CTCGGCCGGG CCATCCCTCT GCTGGAGGAG ACGCTCGCCC ACCGGGAGCG GGCGCTGGGC
ACCGATCACC CCGACACCCT GAACTCCCGC AACAACCTCG CCGCCGGCTA CCAGGAGGCG
GGTGATCTCG ACCGCGCCAT CCCCCTGTAC GAAGCCACGC TCACCGCCTC CGAGCGGATA
CTGGGCACAG ATCACCGCTT CACCACTGCG GTGCGTGCCA AGCTGACAAC CGTTCGGCCG
GACTGA
 
Protein sequence
MRQWSRVLRI VVAAAGTVTV GVAVNQILNG GTWNWWALGA SAGLAITAEG TNRWLARREV 
AENGSAPSPT PQPGSRGDRP IEIAGNVSGI IVSGDNATIT QTHQTVLPPP TPIAAIDAPA
GLGNLPLAAG EFVGREEILA QLDSAMGGDE PVVVAVVQGL GGIGKSTLAA RYAALHHDRF
HPVWWITADS PAALEAGLAA LITALDPRDA EGVDLKARTE RATVWLATHS GWLLVLDDVT
RPQDIAPLLG RVRSGRVVVT SRLRQGWQRI GARVLHLDVL SEDEAVDLLT RLAQPEDPGH
DVRPGALDLV QELGFLPLAI DQVGAYLHQT ALTPAAYLTL LRAQPEVFFD QAAEGADSDR
TVARIWRITL DQLTGSSPLP GDLLRILAWM APDAIPRTLL APLTRPRPGA PRRRRWRRTR
HGWGARPLTD QAGLTEALGL LAAYSMITLS SKTISVHRLV QAVARAPGTS TPGGTADLHR
RPDQIATARD TATELLWLIL PFDLEHPDAW PLARALLPHI TALTDHAPPI TDAITINPLL
YHVGNFLRGQ GALQQAITYL QRSVALSERL HGPDHPDTLA SRNDLAHTYQ WSGDPGRAIP
LLKATLADCE RVLGPDHRNT LQTRNNLANA YREAGDFDRA IPLLEETLAH RERVLGTDHP
DTLNSRNNLA IGYDAAGDRE RAISLLEATL ADSERVLGAD NLHTASFRNN LALYHLEAGD
LGRAIPLLEE TLAHRERALG TDHPDTLNSR NNLAAGYQEA GDLDRAIPLY EATLTASERI
LGTDHRFTTA VRAKLTTVRP D