Gene Sare_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4230 
Symbol 
ID5704401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4802216 
End bp4803811 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content73% 
IMG OID641273649 
Productoxidoreductase molybdopterin binding 
Protein accessionYP_001539002 
Protein GI159039749 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.14088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAGTA CGAACCGTCG CCACGCCGCC CTTGCTGGGG TGACCGCCGC TGCTGTGGCG 
ATCGGCTCGG CGGAACTCGT CGCGGTGTTG ACCGGCCCCC GTTCTGCCCC CTTGGTCGCC
GTCGGCGGGC TGGTGGTCGA CACCGTGCCC GAGCCGCTCA AGCAGGCTGG CATCGCGTTA
TTCGGCAGGT ACGACAAGGT AGCCCTGCTG GTGGGGATGG CCCTGCTGCT TGCCGGCTTC
GCGGCGCTGC TCGGGGTGCT GTCCCGTCGG CAGCTGGCGT ACGGGCTGAC CGGTGTCACG
GCGTTCACGG CGCTGGGGGC GTTCGCAGCC CTGACCCGGG CCGGCGCTGA TCTCGCGGAC
GCCCTGCCGG CGCTGGTTGG CGGTAGCCTC GGCGGGTTGG TGCTCTGGGC GTTCATCCTG
GGTCCGCTGG AGCTGGATCC GTGGCCCTGG TCGTCACCCC TGCCACCGGC CGGGCCGGGG
GTGCCGGTTG CCGTCTCGGC CGACCATGGG GAGGTCGCGG GGCCGGCCCC GGAGTCCCGG
CGACGGTTCC TCGCCGCGAG TGGGTTGTTG CTCGGGGCGG CGGGGGCGGC CGGTGTCGGC
GGCCGGTGGT TGGCTGGCCG GCGGGGGGTT TCGGTGGCCC GCGAGGCGGT CGTGTTGCCG
GCCCCGGCGT CGCCGGCGCC CGCCGTCCCG GCCGGCGCCG ACCTGAAGGT CACCCAGCTG
GCTCCCTACG TCACACCCAG ATCCGCCTTC TACCGGATCG ACACGGCCCT GGTGGTGCCG
CAGGTTGACC CCGCCACCTG GCAGTTGCGC ATCCACGGTC GGGTCCGCAA CCCGATCACC
CTCAGCTTTG CCGACCTGCT GGCACGGCCG CTGGTCGAGC GCTACGTCAC GCTGGCCTGT
GTGTCGAACG AGGTCGGCGG CGACCTGATC GGCAACGCCC GCTGGCTGGG GGTGCCGCTG
CGGGACCTGT TGGCGGAGGC GGAGCCGCAG GAGGGCGCGG ACCAGGTCGT TGGGCGGTCG
GTTGACGGCT GGACCTGTGG CACCCCCACG GCCGTGCTGC GGGACGGCCG GGACGCGCTG
CTGGCAATCG GTATGAACGG TGAGCCGCTG CCGGTTGAGC ATGGCTTCCC GGCCCGGATG
GTGGTGCCGG GTCTGTACGG CTACGTGTCG GCCTGCAAGT GGGTCACCGA ACTGGAGTTG
ACCAGCTTCG CGGACTTCGA CGCGTACTGG GTGCCGCGCG GTTGGTCGGC GCTGGGCCCG
GTGAAGACCC AGTCGCGAAT CGACACGCCG CGTCGGCGGA ACCGGCTGGT GGCTGGGGAG
GTGGTCGTCG CGGGGGTGGC CTGGGCCCAG CACCGCGGCA TCCGGCGGGT CGAGGTCCGG
GTGGACGAGG GCCCTTGGCA GGAGGCCGAC CTCGCACCGA CGGTCTCGGT GGATACCTGG
GTGCAGTGGT CGTGGCGGTG GGACGCGACG CCGGGGGAGC ACACGCTCCA GGTTCGGGCT
ACCGACGCGA CCGGTGAGAC GCAGACCGGC CGGCCTGCTC CGGTCGCGCC GGACGGCGCG
ACCGGCTGGC ACACGGTGCG CGTGACGGTC CGTTAG
 
Protein sequence
MTSTNRRHAA LAGVTAAAVA IGSAELVAVL TGPRSAPLVA VGGLVVDTVP EPLKQAGIAL 
FGRYDKVALL VGMALLLAGF AALLGVLSRR QLAYGLTGVT AFTALGAFAA LTRAGADLAD
ALPALVGGSL GGLVLWAFIL GPLELDPWPW SSPLPPAGPG VPVAVSADHG EVAGPAPESR
RRFLAASGLL LGAAGAAGVG GRWLAGRRGV SVAREAVVLP APASPAPAVP AGADLKVTQL
APYVTPRSAF YRIDTALVVP QVDPATWQLR IHGRVRNPIT LSFADLLARP LVERYVTLAC
VSNEVGGDLI GNARWLGVPL RDLLAEAEPQ EGADQVVGRS VDGWTCGTPT AVLRDGRDAL
LAIGMNGEPL PVEHGFPARM VVPGLYGYVS ACKWVTELEL TSFADFDAYW VPRGWSALGP
VKTQSRIDTP RRRNRLVAGE VVVAGVAWAQ HRGIRRVEVR VDEGPWQEAD LAPTVSVDTW
VQWSWRWDAT PGEHTLQVRA TDATGETQTG RPAPVAPDGA TGWHTVRVTV R