Gene Sros_5368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5368 
Symbol 
ID8668662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5880868 
End bp5882868 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003340874 
Protein GI271966678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0878491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGGA CCTGTCTGGC GCTGCTCATG GTCCTGACCG CCTTATCGAT ACCGGCCTTG 
TCGATACCGG CCTTGTCGAT ACCGGCCGCC GCGCAGGCGA CGGCCGCGCC GAACTGCGAC
ACCCCGGCCG TCACCACGTA CGGCCCGGCC TCCGTCACCG GTGCGATCGT CGGCGCCACC
GTCCACGAAG GCCACGCCTA CGTGGTCTCC CGCGGACCGA AGCCGCCGGT CGTGGCCGAG
ATCGACCTGG CCACCCGCAA GGTGACACGC ACCGCGACCC TGCCGGACGG CCCCGCGGCA
GGCGAGCCGG AAGGCGGCTG GGCCACCGCC GTCGCCGGCG GCAAGCTGTA CATCGGCACC
TACCCCGTAC CCGACCTCTA CAGCTACGAC CTGGCCACGG GCCAGGTAGC GCACCTGCAC
TCCTTCGGCG CGAACGGCGG CTTCGTCTGG AGCATGGCCG CCGCACCGGA CGGCACCCTC
TACCTCGGCA CCTCCTCCGA CGGCAGGCTC TGGGAGTACG TCCCCTCGAC CGGCGCGATC
CGCAGGTACG GTGTGCTGGT CCAGGGCGAG CACTACGTCA GAGCGGTCGC CGCCGACGAG
ACCACGGCCT ACGTCGGCCT GCTGGAGAAG GCCAAGCTGA TGGCCGTCGA CCGGGTCAGC
GGCGCGGTCA GGGAGCTGGC CCAGGGGCCC GCGGCCGTGG GCACGGTCTC GCTCCACGGC
GATCGGGTGC TGGCGGCGAG CGGGAGCACG CTGATCGACG TGCGCAAGGA CGGCACGGAC
CGGCGGCAGT TCGAGGCCGG AGTGGGCATC ATCGACGCGT TCGAGGTGGC CCCCGACGGC
CTGGTCTACC TCACCTCGCG TCCCGACGGC GCCGTCTACA GCTACCGCAC CGGGCAGGAC
GCGGTGACCA AGGTCGCCGA CCCGCCCTCG CGGGGCGAGG AGACCCGGGC ACTGAAGCTG
GCCGGGCAGC AGCTCATCGG CTTCGCCGGC AGCGGCGGTA TGTGGTGGCT GGACCTGCCG
AGCCGGACCT CCGAGTTCGT CGACCTCATC GAAGCCGGCT TCCCGGCCGG CGCCGAGCGG
ACCCAGAGCA TGCTGCTGGT GCCGAACCGG GCCCTCTACA TCGGCGGCCA CTTCGCCGTG
GAAGTGCGCG ACCTGCGGAC CGGAAAGCAG CGCAGAGTCC GCGTGGTGGG CGAGCCGAAG
GCCATGGTGC GACGGGGTGA CAAGATCTAC GCTGCGCTGT ATCCGAGCGG TCAGATCATC
TCCATCGACG TGCACACCGA CGAAGTGCGC GGCCTCGGCT ACATCGGGCA CGACCAGTCA
CGGCCGTGGG ACATCGAGCA CGACCCCGTC ACCGACACCA TCCTGGTGGC CTCCGCGCCG
ACCGGCGCGA AGCTGAAGGG CGCTCTGACG GTGCTCGACC CCGACACGGG GAAGATGGAC
GTCTACCCGG ACGTCATCCC CGACCAGAGC CTGATGAGCC TGAGCGTCGA CTCCAAGCGC
GGCGTCGTCT ACCTCGGTGG CGACGTGCTC GGCGGAGGCG GCACACCGCC GACCAAGACG
ACGGCCTCGA TCGCCGCCTT CGACCTCAAG CAGCGCAAGG TGGTGTGGCA GATCGACCCG
CTGCCGAACC ACCGCACCTT CCAGGACCTC AAGGTGCACA ACGGGCTGCT GTACGCGGTC
TACAAGCGCG TGCTGGGTGC CTGGATCGCC CTGGACCCCG CCACCGGCAC CATCAAGCAC
CAGGGCCTGC TGTCGGGACA CGGCGAACTG CAGGTCCACA AGGGCCGGGT GTTCACCTCC
ACCTACTTCG GCGGTGGCAA CGCCTACGAG CTCGGATCGC AGGCCAAGCT GCTGGCGACC
GGGCTGGGCG ACGAGTGGCA CACCAATCCG CAGCTGCAGT TCGAACCCGG CTCGTTCGAC
GCGTGGACGA TCGTCGGCCG GGATCTGGCG AAGGTACGGC TCGACCCGCG CTGCCCGCCC
GTGCAGATCC CGTCCCCGTA G
 
Protein sequence
MPRTCLALLM VLTALSIPAL SIPALSIPAA AQATAAPNCD TPAVTTYGPA SVTGAIVGAT 
VHEGHAYVVS RGPKPPVVAE IDLATRKVTR TATLPDGPAA GEPEGGWATA VAGGKLYIGT
YPVPDLYSYD LATGQVAHLH SFGANGGFVW SMAAAPDGTL YLGTSSDGRL WEYVPSTGAI
RRYGVLVQGE HYVRAVAADE TTAYVGLLEK AKLMAVDRVS GAVRELAQGP AAVGTVSLHG
DRVLAASGST LIDVRKDGTD RRQFEAGVGI IDAFEVAPDG LVYLTSRPDG AVYSYRTGQD
AVTKVADPPS RGEETRALKL AGQQLIGFAG SGGMWWLDLP SRTSEFVDLI EAGFPAGAER
TQSMLLVPNR ALYIGGHFAV EVRDLRTGKQ RRVRVVGEPK AMVRRGDKIY AALYPSGQII
SIDVHTDEVR GLGYIGHDQS RPWDIEHDPV TDTILVASAP TGAKLKGALT VLDPDTGKMD
VYPDVIPDQS LMSLSVDSKR GVVYLGGDVL GGGGTPPTKT TASIAAFDLK QRKVVWQIDP
LPNHRTFQDL KVHNGLLYAV YKRVLGAWIA LDPATGTIKH QGLLSGHGEL QVHKGRVFTS
TYFGGGNAYE LGSQAKLLAT GLGDEWHTNP QLQFEPGSFD AWTIVGRDLA KVRLDPRCPP
VQIPSP