Gene Sare_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2114 
Symbol 
ID5704968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2436186 
End bp2437154 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content71% 
IMG OID641271599 
Productalcohol dehydrogenase 
Protein accessionYP_001536970 
Protein GI159037717 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.76358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00286846 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGTCGGG CGGTGATCGA CCGGCACGGG CCGCCGGGCG TGCTGCGGGT CGAGGAAGTC 
GAGGACCCTC TACCCACTGC TGGTCACGTC CTGGTCCGGG TGGCGGCGGC CGGCGTCAAC
TTCGTCGACC TGCACCAACG TGGGGGCGCG TACCGGGTCG ACCTTCCGTT CCTGCCGGGG
TTCGAGGGCA GCGGAACAGT GCTCGCCGTC GGTGACGGAG TCACCGGTGT GCACGAGGGG
GACCGGATCG CCTGGTCCGG CTGCCCTGGT TCCTACGCCA CCCACTGTCT GGTGCCCGCC
CAGCGAGTGG TGCCCGTACC GGATCCGATC TCGCTGACCG ACGCGGCGGC CGTCCTGGTC
CAGGGCATGA CGGCACACTT CCTCGTGTCG GATGTGGCGC CCCTGGCCGA GGCTGACGTG
TGCCTGGTGC AGGCAGCCGC CGGCGGGGTG GGTGGCCTGC TCACCCAACT GGCCGTGCTG
CGGGGCGCCA CCGTGATCGG CACGGTGTCG AGCTCCGCGA AGGCGGCGGC GGCGCGGCAG
GCGGGTGCGA CACACGTGGT CGACTACTCC CGGGAGCCGT TCCACCCCAG GGTTCTGGAG
ATCACCGGTG GGCGCGGTGT GGACGTCGTG TACGACGCCG TGGGGCGCGA CACGTTCGAG
ACCGGCCTGG CCTGCCTACG CCCGCGTGGC ATGTTCGTGC TGTACGGGCA GTCCAGCGGC
CAGCCCGGAC CGATCGAGCC GCAGGTCCTG AACGCCCGTG GGTCGCTGTT TCTCACCAAG
GCATCGCTCG GGCACTACGA CACCACCCGG GAACAGTTCC TGCGTCGGGC GGCCGCCGTG
TTCGACCTGG TGGCGAGCGG CCGGCTACGT CCTCGGGTGC ACGCCACGTA CCAGTTGGAC
GACGCCCCGG CAGCGCACGA GGCAGTCGAG TCCCGGACCG CCGCCGGTAA GGTTCTGCTC
TGCCCCTGA
 
Protein sequence
MRRAVIDRHG PPGVLRVEEV EDPLPTAGHV LVRVAAAGVN FVDLHQRGGA YRVDLPFLPG 
FEGSGTVLAV GDGVTGVHEG DRIAWSGCPG SYATHCLVPA QRVVPVPDPI SLTDAAAVLV
QGMTAHFLVS DVAPLAEADV CLVQAAAGGV GGLLTQLAVL RGATVIGTVS SSAKAAAARQ
AGATHVVDYS REPFHPRVLE ITGGRGVDVV YDAVGRDTFE TGLACLRPRG MFVLYGQSSG
QPGPIEPQVL NARGSLFLTK ASLGHYDTTR EQFLRRAAAV FDLVASGRLR PRVHATYQLD
DAPAAHEAVE SRTAAGKVLL CP