Gene Sare_1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1544 
Symbol 
ID5703525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1779406 
End bp1780368 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content70% 
IMG OID641271055 
Productluciferase family protein 
Protein accessionYP_001536431 
Protein GI159037178 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03557] F420-dependent oxidoreductase, G6PDH family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0708084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0100122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACG TCGGCTACAC CCTGCTGAGT GAGCAGGCCG GGCCGAAGGA ACTGGTCGAC 
CACGCCGTAC GCGCTGAGGC CGCCGGCTTC GATCACCTGG TCGTGTCCGA CTGCTACTCC
CCCTGGCTCG ACTCCCAGGC ACACTCTCCG TACGCCTGGT CGGTGCTCGG CGCGGTTGCC
CAAGCCACCT CCCGGGCGGC GCTGATGTCC TACGTGACCT GCCCGATCCG CCGATACCAC
CCGGCTGTGG TGGCGCAGAA GGCGAGCACC GTCGGGGTGC TCTCCGACGG CCGGTTCACC
CTCGGCCTCG GCGCCGGCGA GCGCCTCAAC GAGTATGTTG CCGGCAGCTG GCCGCACGTG
CAGCAGCGGC ACGAGATGTT CGAGGAAGCG CTGAAGATCA TAAAGCCGCT GCTGAACGGC
GAGACGGTGA CCTTCTCCGG CAACCACTAC GCCGTACCCG ACGCCTACCT GTGGGACCGG
CCGGCGCAGC CGGTCCCGAT GGCCGTCGCC GCCTCCGGCC GCCAGTCGGC CACCCTCGCC
GCCGAATACG CCGACGCCAT CATCGCCGCC GAGCCGGATC CCCACCTGCT TCAGGTGTAC
GAACAGACCG GCGGCGCGCG GAAGCCGCGC TACGGGCAGG TGGTCATCTG CTGGGGCCCG
GACGAGGCCG AGTGCCGTGC GATCCTGCAC GACCAGTTCC GCTGGTTCGG GCTGGGCTGG
AAGGTCAAGG CTGAGCTGCC CGGCCCCGAC TCCTTCGCCG CCGCCACCCA ATTCGTCAGT
GAGGAGGACG CTGCCACCGG CATCCCCTGC GGCCCGGACG TCGACCGGCA CGTCGCGGCG
TTCCGGCGCT ACGTCGACGC CGGCTTCAGC CATCTCGCGC TCCTCCAGGT CGGTGGCGGG
AGCCAGCCGA TGTTCCTGGA GTGGGCACAC GAGCGACTCC TGCCCCGCCT ACGAGAGCTG
TGA
 
Protein sequence
MVNVGYTLLS EQAGPKELVD HAVRAEAAGF DHLVVSDCYS PWLDSQAHSP YAWSVLGAVA 
QATSRAALMS YVTCPIRRYH PAVVAQKAST VGVLSDGRFT LGLGAGERLN EYVAGSWPHV
QQRHEMFEEA LKIIKPLLNG ETVTFSGNHY AVPDAYLWDR PAQPVPMAVA ASGRQSATLA
AEYADAIIAA EPDPHLLQVY EQTGGARKPR YGQVVICWGP DEAECRAILH DQFRWFGLGW
KVKAELPGPD SFAAATQFVS EEDAATGIPC GPDVDRHVAA FRRYVDAGFS HLALLQVGGG
SQPMFLEWAH ERLLPRLREL