Gene Sros_3706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3706 
Symbol 
ID8666994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4102551 
End bp4104002 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003339372 
Protein GI271965176 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.292623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.130124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATAT TTCGGTCGGG ATCGGGCGTG CTACGCCGTA AACCCATCGA GCACATCGAA 
GAACCGGAGG GCGGCAAGAG CGAGCAGCTC ACCCGCGTGC TGGGACTCTG GCAGCTCACC
GCGATCGGGG TGGGCGGCAT CATCGGCGCA GGCATCTTCA CGCTCGCGGG CACGGTCGCC
AACGGCACGG CGGGTCCCGC GGTGGTGGTG TCGTTCCTGA TCGCCGGGAT CGCGAGCGCG
GCGGCGGCGC TCTCCTACGC GGAGTTCGCG GGCCTCATCC CGAAAGCGGG GTCGGCCTAC
ACCTACGGCT ACGCGGTCCT CGGCGAGCTG CCGGGCTGGT TCATCGGCTG GGACCTGCTG
CTGGAGTACA CCGCGATCGT CGCGGTCGTC GCCATCGGCA TCTCCGGCTA CTTCTCGTTC
CTGCTCGGCG ACCTGGGCCT GCAGTTGCCC GCGTGGATGC TGGGGGCCCC GGGAACGGGG
GAAGGGCACC AGGTGGACCT GTTCGCCGCC ATACTGTGCC TGTTCATCGC ATATCTGCTG
AACCTCGGCA TGAAGAACGC GGCCCGGTTC GAGACGGCCG TCGTCGGGCT GAAGGTCGCC
GTGGTGCTGA TGGTCATCGT CATCGGCTTC TTCTACATCA ACTCCGACAA CTACGTCCCG
TTCTTCCCCT TCGGCATCGG CGGCGCCATA ACCGGGGCGG CCACGGTCTT CTTCGCCGTC
TTCGGCTACG ACGCCATGAG CACGGCGGCG GAGGAGTCCA AGGACGCCCA GCGCCACATG
CCGAAGGCGA TCGTGTACTC GCTCGCCATC TCGATGGTCC TGTACGTGCT GGCCTGCCTG
GTGCTGACGG GCATGCAGAA ATACACGGAG ATCGACAAGG AGAGCGGCTT CTCCACGGCG
TTCAAGTCCG TGGGCCTGAG CCGCCTGGCC GACGTGATCG CGGTCGGGGC GATCGTCGGC
ATCCTCACCG TGATGTTCAC CTTCATGCTC GGGGTGAGCC GTGTCTGGTT CTCGATGAGC
CGCGACGGGC TGCTGCCCAA GTGGTTCGCC AAGACGCACC CGACGCGGCA CGTGCCGACG
CGCGTGACGT GGATCGTCGG CGTCGCCTCG GCGTTCATCG CCGGGTTCCT CCCCATCAGG
GAGGCCGCGG AGCTGACCAA CATCGGCATC CTGCTCGCCT TCGCGGTCGT GTGCACGGCG
GTGATCGTGC TGCGCTACCG GCAACCCGAC CTGCCTCGGA CCTTCCGCTG CCCCGGAATG
CCGCTGGTGC CCGCGATCGG CGTCGTCTTC TCGCTCTGGC TGATCACCTT CCTGCAGTGG
CAGACGTGGG TGCGCTTCCT GGCGTGGTTC CTGATCGGCC TGGTCGTCTA CTTCGCATAC
TCCTACCGGC ACTCCGAGCT GGCCAGGGCG GAGGCGGCGC ACGGCGGCGG GGTCCCGCCG
GACGAGCGCT GA
 
Protein sequence
MAIFRSGSGV LRRKPIEHIE EPEGGKSEQL TRVLGLWQLT AIGVGGIIGA GIFTLAGTVA 
NGTAGPAVVV SFLIAGIASA AAALSYAEFA GLIPKAGSAY TYGYAVLGEL PGWFIGWDLL
LEYTAIVAVV AIGISGYFSF LLGDLGLQLP AWMLGAPGTG EGHQVDLFAA ILCLFIAYLL
NLGMKNAARF ETAVVGLKVA VVLMVIVIGF FYINSDNYVP FFPFGIGGAI TGAATVFFAV
FGYDAMSTAA EESKDAQRHM PKAIVYSLAI SMVLYVLACL VLTGMQKYTE IDKESGFSTA
FKSVGLSRLA DVIAVGAIVG ILTVMFTFML GVSRVWFSMS RDGLLPKWFA KTHPTRHVPT
RVTWIVGVAS AFIAGFLPIR EAAELTNIGI LLAFAVVCTA VIVLRYRQPD LPRTFRCPGM
PLVPAIGVVF SLWLITFLQW QTWVRFLAWF LIGLVVYFAY SYRHSELARA EAAHGGGVPP
DER