Gene Sare_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4000 
Symbol 
ID5704886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4550402 
End bp4552189 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content67% 
IMG OID641273425 
Product3-hydroxybutyryl-CoA dehydrogenase 
Protein accessionYP_001538781 
Protein GI159039528 
COG category[I] Lipid transport and metabolism 
COG ID[COG1250] 3-hydroxyacyl-CoA dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.257206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0011144 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGAGT TCGCCAGTGT GGGTGTGGTG GGTCTGGGCA CCATGGGTAC CGGCATCGTC 
GAAGTTTTCG CCCGTAACGG CATCGACGTC GTCGCGGTCG AGGTGGCCGA GCCGGCGCTG
GAGCGTGGTC GGGCCACCCT GACCGGCTCA ACCGACCGGG CGGTGGCCAA GGGCAAACTC
GCGGCGGCTG ACCGGGACGC GCTCCTGGCC CGCGTCGACT GGCAGGTCGG GCTGGATGCG
TTGCACTCGG TGGACCTGGT GATCGAGGCC GTTCCCGAGC ACCTCGACCT GAAACAGCGG
ATCTTCGCGG AGCTGGATCG AGTCTGCAAG CCCGAGGCGG TCCTCGCCAC CAACACCTCG
TCGCTGTCCG TCACCGAGAT TTCGGTGGCC ACGACCCGCC CCAACCAGGT CCTCGGCATC
CACTTCTTCA ACCCGGCGCC GGTTATGAAG CTGGTCGAGA TCGTCCGGAC GGTGGTCACC
TCGGCGGAGG TGGTCGCCGA CGTGGAAGGG CTCTGCAAGC GGCTGGGCAA GGTCGATGTG
ACGATCAATG ACCGCGCCGG CTTTATCGCC AACGCCCTGC TCTTCGGCTA CCTCAACCAT
GCGGTCAGCA TGGTCGAGGC GCGGTACGCC AGCCGGGAGG ACATCGACGC GGCGATGAAA
CTCGGCTGCG GTCTGCCGAT GGGCCCGTTG GCGCTGATGG ACCTGATCGG GCTGGACACC
GCGTACGAGA TTCTGGACAC GATGTACCGG CGGGGTGGTC GGGACCGTCG GCATGCTCCG
GTGCCGCTGA TCAAGCAGAT GGTTACCGCC GGGCTGCTCG GGCGGAAGTC GGGGCGCGGC
TTCTACACCT ACGAGCGGCC GGGTTCGCCG GTGGTCGTAC CGGACGAGCA GACGCCAGCG
AGCGCGGAGT CGACCCTGGG CGCCGGCGTT CGGGGCATCA CGAAGGTCGG CGTGGTCGGT
TCCGGAACGA TGGCGACCGG GATCATCGAA GTGTTCGCCA AGGCCGGCTA CGAGGTGGTC
TCGGTGACCC GGGGGGCGGA GAAGTCCGCC AAGCTCTTCG AGGCGGTGAA GACCTCACTC
AACAAGGGAG TGGTGCGGGG CAAGCTCGGC GAGGCCGATC GGGACGCCGC GCTGGGCCGG
ATCACGTGGT CCGCGACCTT GGAGCACCTC ACCGACGTGG ATCTGGTGGT CGAGGCGGTC
ATCGAGGAAC TGAGTGTCAA GAAGGCGTTG TTCGCCAGCC TCGACGAGAT CTGCAAGCCG
GGCGTCGTTC TCGCCACCAC CACCTCGTCG CTGCCGGTGA TCGACGTGGC GATGGCCACT
CAACGTCCGG CGGACGTGGT GGGCCTACAC TTCTTCAACC CGGCACCGGT CATGCCGCTG
GTGGAGATCG TGCAGACCAT CCGCACGTCG GCGGAGACCA CCGCCACCGC GAGGGCAGTC
TGCGCGGACC TGGGCAAGAC GGGTGTGGTC TGTGGAGACC GCTCCGGGTT CATCGTGAAC
GCGCTGCTCT TTCCGTACCT CAACGACGCG GTCAAGATGC TGGAGGCGAG CTACTCGACC
GCCGATGACA TCGACCACGC GATGAAGCTC GGTTGCGGGT ACCCGATGGG TCCGTTCGAT
CTGCTCGACG TGGTCGGCCT GGACGTGTCC CTGGCTATCC AGCGGGAGCT CTACCTGGAG
CTGCGCGAGC CGGGCTTCGC CCCCGCTCCG CTGCTGGAGC ACCTGGTCAC CGCCGGCTAT
CTGGGCCGGA AGTCTGGCCG TGGCTTCCGT GACCACACCC GCCGCTGA
 
Protein sequence
MREFASVGVV GLGTMGTGIV EVFARNGIDV VAVEVAEPAL ERGRATLTGS TDRAVAKGKL 
AAADRDALLA RVDWQVGLDA LHSVDLVIEA VPEHLDLKQR IFAELDRVCK PEAVLATNTS
SLSVTEISVA TTRPNQVLGI HFFNPAPVMK LVEIVRTVVT SAEVVADVEG LCKRLGKVDV
TINDRAGFIA NALLFGYLNH AVSMVEARYA SREDIDAAMK LGCGLPMGPL ALMDLIGLDT
AYEILDTMYR RGGRDRRHAP VPLIKQMVTA GLLGRKSGRG FYTYERPGSP VVVPDEQTPA
SAESTLGAGV RGITKVGVVG SGTMATGIIE VFAKAGYEVV SVTRGAEKSA KLFEAVKTSL
NKGVVRGKLG EADRDAALGR ITWSATLEHL TDVDLVVEAV IEELSVKKAL FASLDEICKP
GVVLATTTSS LPVIDVAMAT QRPADVVGLH FFNPAPVMPL VEIVQTIRTS AETTATARAV
CADLGKTGVV CGDRSGFIVN ALLFPYLNDA VKMLEASYST ADDIDHAMKL GCGYPMGPFD
LLDVVGLDVS LAIQRELYLE LREPGFAPAP LLEHLVTAGY LGRKSGRGFR DHTRR