Gene Sros_5939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5939 
Symbol 
ID8669233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6511415 
End bp6512914 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content70% 
IMG OID 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_003341417 
Protein GI271967221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.161214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000467819 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCCAGA TCGATCCCGA TTTCCTCGCT CTGCCCCTGA GGCGGCTGGC GGACGCGGCC 
CTGCAACGCG CCCGTGACCT CGGCGCCGAG CACGCCGACT TCCGGCTTGA GCGCGTCCGC
TCGGAGACCC TGCGCCTGTC CGACGCCTCA CTCGAAGGCG CCATCGACGC CGACGACCTC
GGCTACGCCG TACGGGTCGT CAAGAACGGC ACCTGGGGCT TCGCCTCCGG CATCGACCTT
ACCCCTGAGG CGGCCGTGAG GGCCGCCGAG CAGGCGGTGG AAGTGGCCGT CATCTCCGCG
GCCGTCAACC GCGAACCCAT CACGCTGGCC CCCGAACCGG TCCACTCCGA CGTCACCTGG
GTCTCGGCCT ACGACGTCGA CCCCTTCGCG GTGCCGCTGC GCGACAAGGT CGCCCTGCTC
GCCGACTGGT CGGACGCCCT GCTGAGGGAG CCGCGAGTGG ACCACGTCCA GGCCTCGCTG
CAGCAGGTCA AGGAGCAGAA GTTCTACGCC GACACCGCCG GCACCTCCAC CACCCAGCAG
CGGGTGCGCC TGCACCCCGA GCTGGAGGTG ATGAAGGTCG AGGACGGGCG CTTCGAGTCG
ATGCGCACGC TGGCCCCGCC GGTCGGCCGG GGGTACGAAT ACCTCACCGG CACCGGCTGG
GACTTCCCCG GCGAGCTGGC CCGCCTGCCC GAGTTCCTCG AGGAGAAGCT GAAGGCGCCC
TCCGTCGAGG CGGGACGCTA CGACCTGGTC ATCGACCCGT CCAACCTGTG GCTGACGATC
CACGAGTCCA TCGGGCACGC CACCGAGCTG GACCGGGCCC TCGGCTACGA GGCGGCCTAC
GCCGGGACCA GCTTCGCCAC CTTCGACCAG CTCGGCAAGC TGGTGTACGG CTCGCAGGTG
ATGAACGTGG TCGGCGACCG CACGACGGAG CACGGCCTGT CCACGGTCGG CTACGACGAC
GAGGGCGTGG CGACCAAGCG GTTCGACATC GTCTCCGGCG GCGTCCTGGC CGGATACCAG
CTCGACCGGC GGATGGCGCG GTTGAAGGGC CTCGGCGCCT CCAACGGCTG CGCCTTCGCC
GACTCCCCCG GCCACATGCC GATCCAGCGC ATGGCCAACG TCTCGCTGCT GCCTGCGCCC
GATGGACCCT CCACCGAGGG GCTGATCTCC GGGGTGGAGC GCGGCATCTA CGTCGTGGGC
GACAAGAGCT GGTCCATCGA CATGCAGCGT TACAATTTCC AATTCACCGG CCAGCGGTTC
TACCGGATCG AGAACGGCAG GATCGCCGGC CAGGTCCGCG ACGTCGCCTA CCAGGCCACG
ACCACCGACT TCTGGCGGTC GATGGCGGCC GTCGGCGGGC CGCAGACCTA CGTGCTGGGC
GGCGCGTTCA ACTGCGGCAA GGGCCAGCCC GGCCAGGTCG CCCCGGTCAG CCACGGCTGC
CCGTCGGCGC TCTTCCGCGA TGTGCGCATT CTCAACACCC TGCAGGAGAG TGGCAATTGA
 
Protein sequence
MRQIDPDFLA LPLRRLADAA LQRARDLGAE HADFRLERVR SETLRLSDAS LEGAIDADDL 
GYAVRVVKNG TWGFASGIDL TPEAAVRAAE QAVEVAVISA AVNREPITLA PEPVHSDVTW
VSAYDVDPFA VPLRDKVALL ADWSDALLRE PRVDHVQASL QQVKEQKFYA DTAGTSTTQQ
RVRLHPELEV MKVEDGRFES MRTLAPPVGR GYEYLTGTGW DFPGELARLP EFLEEKLKAP
SVEAGRYDLV IDPSNLWLTI HESIGHATEL DRALGYEAAY AGTSFATFDQ LGKLVYGSQV
MNVVGDRTTE HGLSTVGYDD EGVATKRFDI VSGGVLAGYQ LDRRMARLKG LGASNGCAFA
DSPGHMPIQR MANVSLLPAP DGPSTEGLIS GVERGIYVVG DKSWSIDMQR YNFQFTGQRF
YRIENGRIAG QVRDVAYQAT TTDFWRSMAA VGGPQTYVLG GAFNCGKGQP GQVAPVSHGC
PSALFRDVRI LNTLQESGN