Gene Sare_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0462 
Symbol 
ID5705459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp530969 
End bp532015 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID641269987 
Productalcohol dehydrogenase 
Protein accessionYP_001535382 
Protein GI159036129 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00242186 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCGCTG TGACGGTGTC TCCCCGGGTC CGAGATTCGC TCCGCCTGGT CGAGGATTGG 
CCCGAGCCGG CCGCCGAGGA AGGCGCCATC CTGGTCGAGG CGCTGGCGGT GGGGGTCTGC
GGCACCGACC TGGAGATCGT CGAAGGCCAG TACGGCGAGG CGCCCCCCAG CCAGGAGCGG
CTGGTCATCG GGCACGAGTC GCTGGGCCGG GTGCTGGAGG ACCCGACCGG CACCCTGCAT
CCCGGTGACC TGGTGGCCGC TGTCGTGCGG CACCCCGATC CGGTGCCCTG CCCGAACTGC
GCGGTGGACG AGTGGGACAT GTGTCGCAAC GGGCGGTACA CCGAGCACGG CATCAAGGGG
CTCCCCGGCT TCGCCCGGGA CCGGTGGCGG GTACAGCCGA GGTTCGCCGT GCCGCTGGAC
ACCACCCTCG CCCGGGTGGG GGTGCTGCTG GAACCAACAA GCGTGGTCGC CAAGGCGTGG
GAGCACATCG AGCGGGTCGG TCACCGCGCC CAATGGGATC CACGGATCGC GCTGATCACC
GGTGCCGGGC CGATCGGCCT GCTCGCCGCG TTGCTCGCCA CCCAACGTGG ACTGACCGTG
CACGTACTGG ACCGGAACAC GACGGGACCG AAGCCGGATC TGGTCCGGGC ACTCGGCGCC
ACCTACCACA CCGGGTCGGT CAACGATGTG GACGTCGAGC CGGACGTCCT GGTGGAGTGC
ACCGGCGCGC CGACGGTGGT GCTGGATGCG ATGTGCAAGG TCGGCCCGAC CGGCGTCGTG
TGTCTGACCG GCGTGTCCAC CGGCGGTCGG ATCATCGACT TCGACGCCGG GGCACTGAAC
CGGACTCTCG TATTGGAGAA CAACGCGGTT ATCGGGTCGG TCAACGCCAA CCGCCGGCAC
TGGGGCCAGG CCGCCGACGC GCTGGCCCGG GCCGACCGAT CGTGGCTGGA ATCCCTGATC
ACCCGGCGGG TGCCGATGTC CGACTTCGCG ACCGCGTACG CGATCAGCGA CGGGGACATC
AAGGTGGTCC TGGATCTCAC CGCCTGA
 
Protein sequence
MRAVTVSPRV RDSLRLVEDW PEPAAEEGAI LVEALAVGVC GTDLEIVEGQ YGEAPPSQER 
LVIGHESLGR VLEDPTGTLH PGDLVAAVVR HPDPVPCPNC AVDEWDMCRN GRYTEHGIKG
LPGFARDRWR VQPRFAVPLD TTLARVGVLL EPTSVVAKAW EHIERVGHRA QWDPRIALIT
GAGPIGLLAA LLATQRGLTV HVLDRNTTGP KPDLVRALGA TYHTGSVNDV DVEPDVLVEC
TGAPTVVLDA MCKVGPTGVV CLTGVSTGGR IIDFDAGALN RTLVLENNAV IGSVNANRRH
WGQAADALAR ADRSWLESLI TRRVPMSDFA TAYAISDGDI KVVLDLTA