Gene Sare_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4271 
Symbol 
ID5705776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4847338 
End bp4848288 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content70% 
IMG OID641273690 
Productluciferase family protein 
Protein accessionYP_001539043 
Protein GI159039790 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03560] probable F420-dependent oxidoreductase, Rv1855c family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0653657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0110143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTCT CCGTGTTCAC CGAGCCACAC CGCGGAGCCA GCTACGACGA CCAGCTCCGG 
TTCGCACGCC ACACGGAGAG CTGCGGCTAC GACGGATTCC TCCGCGCCGA CCACTACCGG
GCGATGGGTG ACGAGCCAGC GCTACCCGGC CCGACGGATG CCTGGCTGAC ACTGGCCGCT
CTGGCTCGCG AGACCAGCCG TATCCGGCTC GGAACGCTGG TCACGTCGGC GACCTTCCGG
CTACCCGGCC CGCTGGCCGT CATGGTGGCC CAGGTCGACC AGATGAGTGG TGGCCGAATC
GAGCTGGGCA TCGGCGCCGG CTGGTACGAA CGGGAGCACA CCGCCTACGG GATCCCGTTC
CCCGGTGTCA GGGAACGGTT CGACCGGCTG GCCGAGCAAC TCGAGATCGT CACCGGGCTG
TGGCGTACCC CTTCAGAAGG CAGGTTCAAC TACAGCGGAG AGCACTATCA GCTCGTTGAC
GCGCCAGCGT TGCCGAAGCC GGTCCAGCGA CCCGGGCCAC CGGTGATCGT GGGCGGTCGC
GGCCCGAAGC GTACCCCCGA GCTGGCCGCC CGGTACGCCG ACGAGTTCAA CATGCCGTTC
ACCTCCGTCC CGCAGGCCGC AGCCGCGTAC GAGCGGGTCC GAGAGGCGTG CGACGCCGCC
GGTCGGACCG ACTCCGGCCG GCAGCCGCTG GTGCTCTCGG CAGGAATCGT GGTGGCCGTT
GGCCGCACGG ACGCCGAGGC ACGTCGCCGC GCCGCGCCAC TGCACCGGCC CAGCGCCCTG
CCCCCGGAGG ACCCGGTCGT CGGTTCGCCC GCCCAACTCG TCGAACGGAT CGGTGAGTTC
GCCGCGGTCG GAGCGACCCG CGTCCATCTG CGCCTCATCG ACCTCGCAGA CCTCGACCAC
CTGGAGCTCA TCGCCGCCGA CGTGCTCCCC CAACTGGATG GACCACAATG A
 
Protein sequence
MRVSVFTEPH RGASYDDQLR FARHTESCGY DGFLRADHYR AMGDEPALPG PTDAWLTLAA 
LARETSRIRL GTLVTSATFR LPGPLAVMVA QVDQMSGGRI ELGIGAGWYE REHTAYGIPF
PGVRERFDRL AEQLEIVTGL WRTPSEGRFN YSGEHYQLVD APALPKPVQR PGPPVIVGGR
GPKRTPELAA RYADEFNMPF TSVPQAAAAY ERVREACDAA GRTDSGRQPL VLSAGIVVAV
GRTDAEARRR AAPLHRPSAL PPEDPVVGSP AQLVERIGEF AAVGATRVHL RLIDLADLDH
LELIAADVLP QLDGPQ