Gene Sare_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0454 
Symbol 
ID5705451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp522140 
End bp523321 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content66% 
IMG OID641269979 
Productmolybdopterin biosynthesis-like protein MoeZ 
Protein accessionYP_001535374 
Protein GI159036121 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.427252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000325534 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGTCGCTGC CCCCGCTCGT CGAGCCCGCC GCCGAGCTGA CCGTTGACGA GGTCCGTCGC 
TACTCACGCC ACCTGATCAT CCCGGATGTC GGGGTCGAGG GCCAGAAGCG GCTGAAGAAT
GCCCGGGTGC TCTGTGTCGG TGCCGGTGGT CTCGGCTCGC CGGCGCTGAT GTACCTGGCC
GCCGCCGGGG TGGGTACGCT CGGCATCGTC GATTTCGACA CTGTCGACGA ATCCAACCTC
CAGCGTCAGA TCATCCATGG CGTGTCCGAT GTCGGCCGGG CCAAGGCGGA ATCGGCCGCC
GCGACCATCC GAGAGATCAA CCCGCTGGTT GCGGTGGAGA TCCACGACGT GGCGTTGGAT
CGGGACAACG TCAAAGACAT CTTCGCCCGA TATGACCTGA TCGTGGACGG CACCGACAAC
TTCGCCACCC GGTACATGGT CAACGACGCG GCTGTGCTGC TCGGAAAGCC GTACGTGTGG
GGCTCGATCT ACCGCTTCGA CGGGCAGGCG TCGGTGTTCT GGGCCGAGCA CGGTCCCTGC
TATCGCTGCC TCTACCCGGA GCCTCCGCCG CCCGGTATGG TGCCGTCCTG CGCGGAGGGC
GGCGTGCTCG GCGTGCTGTG CGCGTCGATC GGCTCGATCC AGGTGAACGA GGCGATCAAG
CTACTCGCCG GCATCGGTGA GCCGCTGGTC GGTCGGCTCA TGGTCTACGA CGCACTGGAG
ATGAGCTACC GCAAGATCAA GGTGCGGAAG GACCCGAACT GCGCCCTCTG CGGCGAGAAC
CCCACGGTCA CCGACCTGCT GGCGGACTAC GAGGACTTCT GCGGCGCGGT GTCGGCTGAA
GCCCAGGAGG CGGTGATCGA CGCGACGATC ACCGCGGGTG AACTGAAGGC GTGGCAGGAT
GCCGGCAAGG ACTTTCTCCT GGTTGACGTG CGGGAGCCGG CCGAATTCGA GATCGTGCGG
ATTCCGGGCG CCACGCTGAT TCCCAAGGGC GAGATCATCT CTGGCGAGGC GCTGGTCAAG
CTTCCGCAGG ACCGCCAGAT CGTGCTGCAC TGCAAATCAG GTGTCCGCTC CGCGGAGGCG
CTCGCCGCGC TGAAAGCCGC CGGCTTCCGG GACGCGGTGC ACGTCCAGGG CGGCGTTCTC
TCCTGGATCA AGCAGGTCGA TCCGTCGCTG CCCGCGTACT GA
 
Protein sequence
MSLPPLVEPA AELTVDEVRR YSRHLIIPDV GVEGQKRLKN ARVLCVGAGG LGSPALMYLA 
AAGVGTLGIV DFDTVDESNL QRQIIHGVSD VGRAKAESAA ATIREINPLV AVEIHDVALD
RDNVKDIFAR YDLIVDGTDN FATRYMVNDA AVLLGKPYVW GSIYRFDGQA SVFWAEHGPC
YRCLYPEPPP PGMVPSCAEG GVLGVLCASI GSIQVNEAIK LLAGIGEPLV GRLMVYDALE
MSYRKIKVRK DPNCALCGEN PTVTDLLADY EDFCGAVSAE AQEAVIDATI TAGELKAWQD
AGKDFLLVDV REPAEFEIVR IPGATLIPKG EIISGEALVK LPQDRQIVLH CKSGVRSAEA
LAALKAAGFR DAVHVQGGVL SWIKQVDPSL PAY