Gene Sare_4846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4846 
Symbol 
ID5707625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5502127 
End bp5503167 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID641274242 
Productalcohol dehydrogenase 
Protein accessionYP_001539587 
Protein GI159040334 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.475925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG TACGCTTCCA CGCTCCCGGT GACGTCCGGA TCGAGGACGC CCCCGAGCCC 
ACTCCCGGGC GGGGAGAGGT GAAGCTTCGC GTCCGTAACT GTTCCACCTG CGGGACCGAT
GTGAAGATCT CCCGGTTCGG CCACCATCAC ATCGTGCCGC CCCGCGTGAT GGGGCACGAG
ATCGCCGGCG AGGTGGTGGA GGTCGGCGCC GCAGTGGACG GCTGGTCGCC CGGCGACCGG
GTCCAGGTCA TCGCCGCCAT TCCCTGCGGG CAGTGCGGCG AGTGCCAGCA GGGTCGCCGG
ACGGTCTGCC CCAACCAGGA GTCGATGGGC TACCACTACG ACGGTGGGTT CGCCGAGTAT
CTCGTGGTGC CGAGCACAGT GCTCGCCGTC GACGGGCTCA ACCGTATCCC GGACGGGGTC
AGCTACGCCG AAGCGTCGGT CGCCGAACCG CTGGCGTGCG TCCTCAACGG GCAGAACCTC
GCTCAGGTCG GTGCCGGGGA CGACGTGGTG GTGATCGGTT CCGGCCCGAT CGGCTGTCTG
CACGTACGGC TGGCCCGGGC CCGGGGTGCC AGGAGTGTGG TCCTGGTCGA CCTCAACCAG
GACCGGCTCA GTCAGGCCGC GGCGCTGGTC GCGCCGGACG CGACGATCTG CGCGGCCGAC
ACCGACCCGG TCGACGCCGT GCTCAAGTTC ACCAACGGGC GCGGCGCCGA CGTCATCATC
ACCGCCGCGG CCTCCGGCGC CGCTCAGGAA CAGGCCGTTC AGATGGCCGC CCGGCAGGGC
CGGATCAGCC TCTTCGGTGG GCTGCCGAAG GACCGGCCAG TCATCGGCCT GGACGCCAAC
CTGGTGCACT ACCGGGAGCT GACACTGGTC GGCGCGAACG GGTCGAGCCC CGCCCACAAC
GCCGAGGCGC TGCGCCTCAT CGCCTCCGGT GAGGTGCCGG TCGCCGACCT CATCACCCAC
CGGTTGCCGC TGGACGGCGC CATCGACGCC TTCGAACTGG TCGCCACCGG TGAGGCGATC
AAGGTGACGA TCGAGCCCTG A
 
Protein sequence
MKVVRFHAPG DVRIEDAPEP TPGRGEVKLR VRNCSTCGTD VKISRFGHHH IVPPRVMGHE 
IAGEVVEVGA AVDGWSPGDR VQVIAAIPCG QCGECQQGRR TVCPNQESMG YHYDGGFAEY
LVVPSTVLAV DGLNRIPDGV SYAEASVAEP LACVLNGQNL AQVGAGDDVV VIGSGPIGCL
HVRLARARGA RSVVLVDLNQ DRLSQAAALV APDATICAAD TDPVDAVLKF TNGRGADVII
TAAASGAAQE QAVQMAARQG RISLFGGLPK DRPVIGLDAN LVHYRELTLV GANGSSPAHN
AEALRLIASG EVPVADLITH RLPLDGAIDA FELVATGEAI KVTIEP