Gene Sros_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2003 
Symbol 
ID8665285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2156892 
End bp2159444 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003337734 
Protein GI271963538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.614926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.156337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGTC TATCCCGTCC TGCCGCTTTA TGGACCGCTT CAGCGGTCGT GGCGGCTCTG 
ATCGCGGTAC CGTCCGGTCC CGCCCTCGCG GACACGTTGT TGTCGCAGGG CAAGACCGCG
ACGGCCTCGT CGTCCGAGAA CGCCGGCACC GGCCCTGCCC TGGCGGTCGA CGGCGACACC
GGCACCCGCT GGTCCAGCGC GAGCACCGAC TCCCAGTGGC TCCAGGTGGA CCTGGGGGCC
TCGGCCGGCA TCAGCAAGGT GGTGCTGAAC TGGGAGGCCG CCTACGGCAG CGGCTACAAG
GTCCAGGCGT CCGAGGACGG AAGCACGTGG ACGGACCTGA AGACCGTCAC CGGCGGCGAC
GGCGGTACCG ACACGTGGGA CGTCACCGGC AGCGGCCGGT ACGTCCGGAT GCTCGGCGTC
ACGCGGGCCA CCGGTTACGG CTACTCCCTG TGGGAGTTCC AGGTCTTCGG CACCGGCTCG
GGCACCGGAG AGCAGTGCGG GACCGCCAAC GCCGCGCTGA ACAGGCCCTC CGCCGCGTCG
TCGGCCGAGA ACGGCGGCAC CCCCGCGAAG AACGCCTTCG ACGGCGACAA CGGGACCCGC
TGGTCCAGCG CGGCCAGTGA CCCGCAGTGG GTCCGGGTGG ACCTCGGCTC GGTCCAGGAC
GTCTGCGGGA TCGACCTGAG GTGGGAGGCC GCGTACGGGA CCGCCTTCAA GCTCCAGGCG
TCGGACGACG GCAACACCTG GAACGACCTG AGGTCCGTCA CGGGCGCCAC CGGCGGCACG
CAGTCGTACG ACGTGAGCGG CTCCGGCCGG TACGTACGGA TGCTCGGCAC CGCGCGGGCC
ACCGGCTACG GCTACTCCCT GTGGGAGTTC GGCGTGCGCG TCGCCTCGGA CGGGCCGCAG
CTGCCCGGCG GCGGGGACCT GGGCCCGAAC GTCCACGTCT TCGACCCGTC CATGTCCAGC
GCCGGCATCC AGAGCCGGCT GGACACGGTC TTCGACCAGC AGGAGTCGGC CCAGTTCGGC
ACCGGCCGAC ACGCCTTCCT GTTCAAGCCC GGCTCCTACA ACGTGAACGC CGACATCGGC
TTCTACACCT CGATCGCCGG CCTGGGGCAG TCGCCGGACG ACGTCACCAT CAACGGCGGC
GTGACCGTCG ACGCGGGCTG GTTCAACGGC AACGCGACCC AGAACTTCTG GCGCTCGGCG
GAGAACCTGT CGATCCAGCC GACCGGCGGC ACCAACCGGT GGGCGGTCTC CCAGGCCGCC
CCGTTCCGCC GGATGCACAT CAAGGGCGCG CTGAACCTGG CGCCCACCGG CTACGGCTGG
GCCAGCGGCG GCTACATCGC CGACAGCAAG GTCGACGGCG CCGTCGGCCC CTACTCGCAG
CAGCAGTGGT ACACCCGTGA CAGTTCGGTC GGCGGCTGGG TCAACGGCGT GTGGAACATG
GTGTTCTCCG GAGTGGAGGG CGCCCCCGCC ACCTCCTTCG CCGACAAGTC CTACACCACG
CTGAACAGCA CGCCGATCAG CCGCGAGAAG CCGTACCTCT ACGTGGACGG CGCCGGCGCC
TACCGGGTGT TCGTCCCGTC CAAGCGGACC GACGCGCGCG GCGCGAGCTG GGCGAACGGC
CCCACCCCCG GCAGTTCGAT CCCGCTCGGC CAGTTCTACG TCGCCAAGCC GGGCGACTCG
GCCGCGACCA TCAACGCCGC GCTCGCCCAG GGCCTGAACC TGCTCCTCAC CCCGGGCATC
TACACCGTCA ACCAGGCGAT CAACGTGACC CGGCCGAACA CCGTGGTCCT GGGGCTCGGA
CTGGCCACAC TGATCCCGGC CAACGGGGTC AGCGCGATCA AGGTGGCCGA CGTGGACGGC
GTGAAGCTCG CCGGGTTCCT GATCGACGCG GGCACCCAGA AGTCCGACGT GCTCGTGGAG
GTCGGCCCGC AGGGATCGAG CGCCGACCAC GCCGCCAACC CGACCTCGCT GCAGGACGTG
TTCGTCCGGA TCGGCGGCGC GTTCGCCGGC AACGCCACCA CCAGCGTCGT GGTCAACAGC
GACGACGTGA TCATCGATCA CACCTGGCTG TGGCGCGGCG ACCACGGTGC GGGCATCGGC
TGGAACGTCA ACACCGCCGA GACCGGCCTG GTGGTCAACG GTGACGACGT GCTGGCCACG
GGCCTGTTCG TCGAGCACTA CCGCAAGTAC GACGTGGAAT GGTACGGCGA GCGCGGCCGG
ACGATCTTCT TCCAGAACGA GAAGGCGTAC GACCCGCCGA ACCAGGCGGC CTACATGAAC
GGCACCACCA AGGGGTGGGC GGCCTACAAG GTAGGCAACG CGGTCAACGC ACATGAGGGC
TGGGGCCTCG GCAGCTACGT CTACTTCAAC GTCGACCCGA CGATCGAGGT GGAGAACGGG
TTCGAGGCTC CGGTGAAGCC GGGCGTCAGG TTCCACAGCC TGCTCACCGT GTCCCTCGGC
GGCAACGGCC GGGTCAACCA CGTGATCAAC GGCACGGGTG GCCCCGCCTC CGGCACGGCG
ACGATCCCGT CGAAGCTCGT CTCCTACCCG TGA
 
Protein sequence
MRGLSRPAAL WTASAVVAAL IAVPSGPALA DTLLSQGKTA TASSSENAGT GPALAVDGDT 
GTRWSSASTD SQWLQVDLGA SAGISKVVLN WEAAYGSGYK VQASEDGSTW TDLKTVTGGD
GGTDTWDVTG SGRYVRMLGV TRATGYGYSL WEFQVFGTGS GTGEQCGTAN AALNRPSAAS
SAENGGTPAK NAFDGDNGTR WSSAASDPQW VRVDLGSVQD VCGIDLRWEA AYGTAFKLQA
SDDGNTWNDL RSVTGATGGT QSYDVSGSGR YVRMLGTARA TGYGYSLWEF GVRVASDGPQ
LPGGGDLGPN VHVFDPSMSS AGIQSRLDTV FDQQESAQFG TGRHAFLFKP GSYNVNADIG
FYTSIAGLGQ SPDDVTINGG VTVDAGWFNG NATQNFWRSA ENLSIQPTGG TNRWAVSQAA
PFRRMHIKGA LNLAPTGYGW ASGGYIADSK VDGAVGPYSQ QQWYTRDSSV GGWVNGVWNM
VFSGVEGAPA TSFADKSYTT LNSTPISREK PYLYVDGAGA YRVFVPSKRT DARGASWANG
PTPGSSIPLG QFYVAKPGDS AATINAALAQ GLNLLLTPGI YTVNQAINVT RPNTVVLGLG
LATLIPANGV SAIKVADVDG VKLAGFLIDA GTQKSDVLVE VGPQGSSADH AANPTSLQDV
FVRIGGAFAG NATTSVVVNS DDVIIDHTWL WRGDHGAGIG WNVNTAETGL VVNGDDVLAT
GLFVEHYRKY DVEWYGERGR TIFFQNEKAY DPPNQAAYMN GTTKGWAAYK VGNAVNAHEG
WGLGSYVYFN VDPTIEVENG FEAPVKPGVR FHSLLTVSLG GNGRVNHVIN GTGGPASGTA
TIPSKLVSYP