Gene Sare_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4940 
Symbol 
ID5706490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5608318 
End bp5609529 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID641274335 
Productepocide hydrolase domain-containing protein 
Protein accessionYP_001539677 
Protein GI159040424 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.23787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCCGT ACCGTGTTGA AATCCCAGCA GAGGCCATCG ACGACCTTCG TGCTCGATTG 
GGCCAGACCC GGTGGCCGGC CGAGACCCCG GACGTCGGCT GGAGCCGCGG GGTGCCGCAG
ACCTACCTTC GGGATCTGGT CGAATACTGG CGCACCGAGT ACGACTGGCG CGCCACAGAG
GCTCGGATCA ACCAGTATCC GCAGTTCATG ACCAACGTCG ACGGTGCGAA CATCCACTTC
CTGCACGTGC GGTCACCCGA GCCCGACGCG GTGCCGATGG TGATCACCAC GGGCTGGCCG
AGCTCGATCA TCGAGTATCT CGACGTGATC GGCCCACTGA CCGATCCCCG AGCCCACGGC
GGCGACCCGA AGGATGCGTT CCACCTGGTC ATTCCCTCGC TGCCCGGGTT CGGGTTCTCC
ACCCCGCTCA CCGAGCACGG CTGGACGGTC CCTCGGATGT CGGCCGTCTG GGCCAAGTTC
ATGGCCGCCG TGGGGTATGA CCGATACATC GCGCAGGGCG CCGACTGGGG CTCGTTCATC
TCGCTCATTC TCGCCGGGGT CGACCCCGAT CACGTGCTCG CCGCTCACGT GAACTTCCTC
GTGACGCCGC CGACCGACGC GTCCGACCTG GCCGGCCTCA GTTCGGAAGA GCTGGCCCTG
CTGGACCCGT ACATGCTGCC CGCGCCCGGC TACATGGTCG AGCACGCGAC CAAGCCGCAG
ACCCTCAGCT ACTCTCTCAC CGACTCGCCG GTCGGCCAAC TCGCGTGGTA CATCGAGAAG
TTTCACCAGT GGTCGGGCGC GGACAAGTCC CCCGAGGACG TCTTCGACCG CGATGCGCTG
CTCGCCAACG TCACGCTGTA CTGGTTGACC GGGACGGCCG GCTCGGCGGC ACACTTCTAC
TGCGACAACG CGCCGTTCAC GCGTACCTCG GCGACCCCGC ATCCGGAACT GGCCGTCGCC
CACGAGAAGT TCGAAGCCCA CCGCACCTTT GTGGCGCCGC TGCCGCCGGT CACCAGGCCT
GTCGGGGTTG CGCTGTACCC GGACGACATC ATGATGCCCA TTCGCAGTTA CGCAGAGCGC
GCATTTACTG ACATCGTGCA TTGGAACAAA CTCGAGCGCG GAGGCCACTT CCCCGCCCTG
GAGGCGCCTG ACCTGTTCGT CGAGGACCTG CGGGCATTCC GGCGTGCCCT GCGCACCCGA
CAGGAAAGCT GA
 
Protein sequence
MRPYRVEIPA EAIDDLRARL GQTRWPAETP DVGWSRGVPQ TYLRDLVEYW RTEYDWRATE 
ARINQYPQFM TNVDGANIHF LHVRSPEPDA VPMVITTGWP SSIIEYLDVI GPLTDPRAHG
GDPKDAFHLV IPSLPGFGFS TPLTEHGWTV PRMSAVWAKF MAAVGYDRYI AQGADWGSFI
SLILAGVDPD HVLAAHVNFL VTPPTDASDL AGLSSEELAL LDPYMLPAPG YMVEHATKPQ
TLSYSLTDSP VGQLAWYIEK FHQWSGADKS PEDVFDRDAL LANVTLYWLT GTAGSAAHFY
CDNAPFTRTS ATPHPELAVA HEKFEAHRTF VAPLPPVTRP VGVALYPDDI MMPIRSYAER
AFTDIVHWNK LERGGHFPAL EAPDLFVEDL RAFRRALRTR QES