Gene Sare_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2153 
Symbol 
ID5706971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2475335 
End bp2477212 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content71% 
IMG OID641271638 
ProductFkbH like protein 
Protein accessionYP_001537009 
Protein GI159037756 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis 
TIGRFAM ID[TIGR01681] HAD-superfamily phosphatase, subfamily IIIC
[TIGR01686] FkbH-like domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.153198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CGGAGCAGGC CGTCGCGCTG GCGATGGACG AGTTGGACGT GTGGAACGGT 
GTCCGCGAAG CGGTCCACAC CAACCAGTAC CCGGATGCCC GGCTACGGAC GCGACTGCTG
CGGGAACGCG ACCCGGTGCT GCTGCGCCGG GTGGGCAGAC TGCTCGACCG ACCGGAGTTG
GTCGGCGACG ACCTGCGTCC GATTCGGGTC AGCGTGCTCG CCCCGTTCAC GATCGGGTCG
TTCACCGACC TGCTACGGGC GTACCTGGTC GGCGCCGGCC TCGCCCCGAC GCTACAGGTC
GCCCAGTACG GTTCCTTCGA CCTGGCGCTG GCCACCGGGG AGTTCCCGGC ACACCAGCCG
GATCTCGTGG TGTTGCTGCT CGACGAGTCG GTGTTCCTAC CGACGGTGTG GTCTCCCGCC
GACGTCGACG ACCTCACCGA GCAGGTCCAG CGACGGCTGG ACGACTTCTG CTCGGCCGTG
ACGACCAGCG CCGCCGACGC GACAGCGTCG GTGGTGCTGC ACACGGTGCC GCTGCCGGCC
GAGGTGCGTG ACTCGTTCAT CGCGGCACGG GACCGGGCGG TGGTGGCCGG CTGCTGGCAC
CGCCTCAACG CGACGCTGCT CGACCTGTCC CGCCGGCACC CGCGAGTGCT CGCCGTAGAC
CTCGTCGGGG CGCTGGCGGA CACACCCTTC GCCGTTCGCG ACGATCGCCT GCACCGCTAC
GGTGACCTGC CCTACAGCGA CGGCGCACTG TCCTGCCTGG CACGCGAGGT ACGCCGGGTG
GCGCAGGCCA GCGCCGGGTC GTCCCGCAAG GTGCTGGCCC TCGACCTGGA CAACACGCTG
TGGGGCGGGG TTGTCGGCGA AGTCGGCGCC GAGGGGGTCA CGCTCGGCGG CCTCTATCCC
GGCAACGCCT ACCGACAGGT GCAACGGGCC GCGCAGAGAC TGCGCGAGCA GGGCGTGGTC
CTGGTCCTGA CCAGTAAGAA CGACACCGCC GTGGCGACTG ACGCGATGAT GTCTCACCCG
GAGATGCTGC TGCGGCCCGA TGCGTTCTCC TACCGTGCGA TCAACTGGTC ATCGAAGGCG
GAGAACCTAC GGGCGGCCGC CGCACACCTG GGGCTGTCCA CCGCCGCGAC GGTCTTCCTG
GACGACTCGC CGTTCGAACG CGGCCAGGTG TCCGGCTCGC TACCCGAGGT GGCCGTCCTT
CCGGCCGACG GCGATCCGGC TCGACTGGTA CGGACCCTGT TGGAGCCGGG TTGGTTCGAC
ACCCTGGACC TGACCGAGAC CGACCGCCGA CGCCCGGAGC TGTACCGCGG CCGGGCCGAG
CGCAGCACGT TCTCCACCGG CTTCGGCTCC TCGCAGGACT ACCTGCGGGC CCTCGGCATC
CACCTCGCCG TCGAGCCGGC GAACCGGTAC ACCGCGGCGA GGGTGGCCCA ACTGGCTGCC
CGCACCAACC AGTTCAACCT CACCGGCGTG CGGTTCGACC AGCCCGCGAC GACGGCGATG
GCAGCCGACC CGGGGTACCT GGTCGCTGCC TGCGCCGTCA CCGACCGCTT CGGCGACGAG
GGCGTCGTCG GCGCGGTCTG GGTGTGCCGC GGAGCACCTA CCTGGGAGGT GCTGAACCTG
GTCCTCAGCT GCCGGGTACT CGGCCGGGGG GTGGAGCTGG CGATCGTCGG GTGGCTGGTT
CGACAGGCCC GGCTGGCCGG CGCCGCGGCG GTGGAGGGCC GGTACACGCC GACAGCGAAG
AACGGTGCCG CACGCGACTT CTGGACCAGG GCCGGATTCA CCGCGGTCAC CGCGGAGCTC
TACCGCCTGG ACCTGCGTGG CGCCGACGAC CCAACCCCCG ATTGGATCAG CACAGAGGAG
CCCCAGCAGC ATGGATGA
 
Protein sequence
MTAAEQAVAL AMDELDVWNG VREAVHTNQY PDARLRTRLL RERDPVLLRR VGRLLDRPEL 
VGDDLRPIRV SVLAPFTIGS FTDLLRAYLV GAGLAPTLQV AQYGSFDLAL ATGEFPAHQP
DLVVLLLDES VFLPTVWSPA DVDDLTEQVQ RRLDDFCSAV TTSAADATAS VVLHTVPLPA
EVRDSFIAAR DRAVVAGCWH RLNATLLDLS RRHPRVLAVD LVGALADTPF AVRDDRLHRY
GDLPYSDGAL SCLAREVRRV AQASAGSSRK VLALDLDNTL WGGVVGEVGA EGVTLGGLYP
GNAYRQVQRA AQRLREQGVV LVLTSKNDTA VATDAMMSHP EMLLRPDAFS YRAINWSSKA
ENLRAAAAHL GLSTAATVFL DDSPFERGQV SGSLPEVAVL PADGDPARLV RTLLEPGWFD
TLDLTETDRR RPELYRGRAE RSTFSTGFGS SQDYLRALGI HLAVEPANRY TAARVAQLAA
RTNQFNLTGV RFDQPATTAM AADPGYLVAA CAVTDRFGDE GVVGAVWVCR GAPTWEVLNL
VLSCRVLGRG VELAIVGWLV RQARLAGAAA VEGRYTPTAK NGAARDFWTR AGFTAVTAEL
YRLDLRGADD PTPDWISTEE PQQHG