Gene Svir_07000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_07000 
Symbol 
ID8386038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp714238 
End bp715437 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID644974797 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_003132598 
Protein GI257054766 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.126563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT ACCGTCGAGT GGGTTACGTC CCACCGAAGC GGCACACGCA GCACCGCGAC 
GAGAACGGCA ATCTCTACTA CGAGGAGCTC ATGGGCGAGG AGGGCTTCTC GTCCGACTCC
TCGCTGCTCT ATCACCGCCA CCTTCCCTCG GCGATCGTCG ACTCCCAGGT GTGGGAGTTG
CCGGACCAGA CGACCACGCC AAACCATCCG TTGCGGCCCC GTCATTTGAG GCTGCACGAC
CTGTTCCCCG GCGACAGCTG GAAGGACGTC GACGTGGTGA CCGGACGTCG GCTGATCCTC
GGCAACGCCG ACGTGCGCAT CTCGTACGTG GTGGCGGGCA AGGAGTCGCC GCTGTACCGC
AACGGGCTCG GCGACGAGAT CGTCTACGTC GAGTCGGGTG ACGCGGTGGT GGAGACCGTG
TTCGGTGCCT TGAAGGCGAC CGCCGGCGAC TACGTGATCC TCCCGATGTC CACGACGCAC
CGCTGGGTGC CGCAGGGCGA AGAGCCGTTG CGGGCGTACG CGATCGAGGC GAACAGCCAC
GTCGCGCCGC CGAAGCGCTA CCTGTCCCGG TACGGGCAGT TGCTGGAGCA CGCGCCTTAC
TGCGAGCGCG ACCTGCACGG GCCGACGGAG GTGCTGATCC GGGAGGGGAC GGACGTCGAG
GTGCTGCTCA AGCACCGTGG CCCCGGCGGG GTCGTGGGCA CCCGCCTGGT GTACCCGTAC
CACCCGTTCG ACGTCGTCGG CTGGGACGGC TGCCTGTATC CGTACACGTT CAGCATCCAC
GACTTCGAAC CCATCACCGG TCGCGTGCAC CAGCCGCCAC CCGTGCACCA GGTGTTCGAG
GGGCACAACT TCGTGGTGTG CAACTTCGTG CCGCGCAAGG TGGACTACCA CCCACAGGCC
ATCCCGGTGC CCTACTACCA CTCCAATGTG GACTCCGACG AGATCATGTT CTACTGCGGC
GGTGACTACG AGGCCCGGAA GGGCTCGGGC ATCGGCCAGG GCTCGGTCTC GATCCACCCG
GGTGGCCACG CGCACGGTCC GCAGCCCGGC GCGTACGAGC GCAGCATCGG GGTGGAGTTC
TTCGACGAGC TGGCCGTGAT GGTCGACACC TTCCGCCCGC TCGAACTCGG TGAGGGGGCG
TTGGCCTGCG AGGACCCGAA CTACGCGTGG ACCTGGGCCG GGCGGGGGCC GAAGCAATGA
 
Protein sequence
MAYYRRVGYV PPKRHTQHRD ENGNLYYEEL MGEEGFSSDS SLLYHRHLPS AIVDSQVWEL 
PDQTTTPNHP LRPRHLRLHD LFPGDSWKDV DVVTGRRLIL GNADVRISYV VAGKESPLYR
NGLGDEIVYV ESGDAVVETV FGALKATAGD YVILPMSTTH RWVPQGEEPL RAYAIEANSH
VAPPKRYLSR YGQLLEHAPY CERDLHGPTE VLIREGTDVE VLLKHRGPGG VVGTRLVYPY
HPFDVVGWDG CLYPYTFSIH DFEPITGRVH QPPPVHQVFE GHNFVVCNFV PRKVDYHPQA
IPVPYYHSNV DSDEIMFYCG GDYEARKGSG IGQGSVSIHP GGHAHGPQPG AYERSIGVEF
FDELAVMVDT FRPLELGEGA LACEDPNYAW TWAGRGPKQ