Gene Sros_7610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7610 
Symbol 
ID8670931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8403331 
End bp8404509 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content72% 
IMG OID 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_003343027 
Protein GI271968831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACTC AGGTCGGGAT CGTCGGGGCG GGCCCCGCCG GGCTGCTGCT GTCCCATCTG 
CTCCATCTGC GGGGCATCGA CTCGGTCGTG CTGGAGGCGC GCAGCCGGGA GTACGTCGAA
CAGCGCGTCC GGGCCGGCGT GCTGGAGCAG GGCACGGTCG ACGTGCTCAA CGAGGCGGGC
GTCGGCGAGC GGATGCGCGC CGAGGGGCTG CCGCACCACG GCATCGAGCT GCGGTACGGC
GGGGCCGGGC ACCGCATCCC GTTCGAGAGG CTCGTCCCCG GCCGGGCCAT CACCGTGTAC
GGCCAGCAGG AGGTCGTCAA GGACCTGATC GCCCGGCGGC TGGCCGACGG CGGGAAGATC
CTCTTCGACG TCCCGGACGT CGCCCCGCAC TCACTCCAGG CCGACCCCTA CCTCACCTTC
GGGGGCGAGC GCCTCGACTG CGACGTCATC GCGGGCTGCG ACGGCTTCCA CGGCGTCTGC
CGGCCGTCCA TCCCCGACGG GGTTCTGTCG ATCTTCCAGC GGGACTATCC GTTCGCCTGG
CTCGGCATCC TCGCCCAGGT CCCGCCGTCG GCCGAGGAGC TGATCTACTC GCGCAGCGAC
CGGGGCTTCG CGCTGCACAG CATGCGCTCC CCGGAGATCA GCCGCTTCTA CCTCCAGGTC
CCGCCGGACG CCTCGCTGGA CGACTGGCCG GACGAGCGGA TCTGGGCCGA GCTGCGGGCG
CGGCTGGAGA CGGTCCCCGG GTTCGCGCTG ACCGAGGGGC CGATCATCTC CAGGGACCTG
TCCGCGATGC GCTCGTTCGT CGCCGAGCCG ATGCGCTACG GCAGGCTCTA CCTCGCCGGG
GACGCCGCCC ACATCGTGCC GCCGACCGGG GCCAAGGGCC TCAACCTGGC CGTCGCCGAC
GTGCGGGTGC TGACCGAGGC CCTGGCGCAC CTCTACGCGA CGGGCTCCAC CGACCTGCTG
GACGCCTACT CCGCCACCTG CCTGAAAAGG GTCTGGCGGG CCCAGCACTT CTCCTGGTGG
ATGACCACGC TGCTGCACAC CTTCGACACC GACGACGCCT ACGGCAGGCG CCTGCAGACC
TCCCACCTCG ACTACGTCAC CTCCTCGGAG GCCGCCGCGA CCACGCTCGC GGAGAACTAC
GTCGGCCTGC CCCTCGACTC CGGAGCACCC CGTGACTGA
 
Protein sequence
MRTQVGIVGA GPAGLLLSHL LHLRGIDSVV LEARSREYVE QRVRAGVLEQ GTVDVLNEAG 
VGERMRAEGL PHHGIELRYG GAGHRIPFER LVPGRAITVY GQQEVVKDLI ARRLADGGKI
LFDVPDVAPH SLQADPYLTF GGERLDCDVI AGCDGFHGVC RPSIPDGVLS IFQRDYPFAW
LGILAQVPPS AEELIYSRSD RGFALHSMRS PEISRFYLQV PPDASLDDWP DERIWAELRA
RLETVPGFAL TEGPIISRDL SAMRSFVAEP MRYGRLYLAG DAAHIVPPTG AKGLNLAVAD
VRVLTEALAH LYATGSTDLL DAYSATCLKR VWRAQHFSWW MTTLLHTFDT DDAYGRRLQT
SHLDYVTSSE AAATTLAENY VGLPLDSGAP RD