Gene Sare_4378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4378 
Symbol 
ID5705069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4946974 
End bp4948116 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID641273800 
Productepocide hydrolase domain-containing protein 
Protein accessionYP_001539150 
Protein GI159039897 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.925991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000447633 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACGAAA ACAACGCACT CACACCGTTC CGCATCGATA TCCCACAGGC CGACGTCGAC 
GACCTACGGA ACCGGCTGGC ACACACCCGC TGGCCGATCC CCGTTCCGGG CCGCGACGAG
CGCACCGACT TCAGCCGCGG CATCCCGCTG GTGTACCTGA AGGAACTCGC CGAGTACTGG
CACGACGAGT TCGACTGGCG TGCGCAGGAG AAGAAGCTCA ACGAGTACGA ACAGTTCACC
ACGGTCGTCA ACCGCCAGAC GTTCCACGTC GTCCACGTGC GGTCGACGAA CCCGGCGGCC
ACCCCCCTGA TGCTGAACCA CGGCTGGCCT GGCTCGTTCG TGGAGTACCA GCGACTCATC
CCGCTGCTGA CCGGTGAGTT CCACGTGGTC ATCCCGTCAC TGCCCGGCTT CGGGTTCTCC
ACCCCGCTGT CGGGGACCGG CTGGGAGTTG GCGCGGACGG CGGATGCCTA CGCCGAGATC
ATGACGCGCC TCGGCTACGA GAGGTTCGCG GCGCACGGTA CCGACATCGG TTCGGGTACC
ACCGGTCGCC TCGCGGCGGT CTACCCGGAG CGCGTCATCG GCACGCACCT CGGCGTCGAC
CCCCACTTGC TCGCGTTGGT CGGCGACAAG TTCCCCTACC CCGACGGTCT GTCCGACGAC
GAGATCACCC AGATCGAGGC CGTGCGCGCC GAGGACGCGG CCGATCGCGG GTATCTTCTG
ATGCACAACC ACCGCCCCGA CACGATCGGC GCGGCGCTCA CCGACTCGCC GGTCGGTCAG
CTCGCGTGGA TCGCCGAGAA GTTCAAGACC AGGGCCAACG GCGCCTGGCG GACGCCGGAC
GAGTCGGTCG ACCGCGACCA GCTCCTCACG AACATCAGCC TGTACTGGTT CACCCGCGGC
GGTGAGTCGA GCGCCCAGTT CTACTACGAG GCCGAGCACT CCGGACTCGA CTTGGTCATG
GCCTCAAGCG TGCCGTCCGG ATGGGCCGTG TTCAACTCCA ACCCGCTCGT GCGTCGGGCG
ATGGACCCGT GGAAGGCGAT CGGCCACTGG AGCGAGTTCA CCGAGGGCGG TCACTTCCCC
GCGATGGATG CGACGGAGTT GCTCGCGGAC GACATCCGCA CCTTCTTCCG CGGCATTGCC
TGA
 
Protein sequence
MNENNALTPF RIDIPQADVD DLRNRLAHTR WPIPVPGRDE RTDFSRGIPL VYLKELAEYW 
HDEFDWRAQE KKLNEYEQFT TVVNRQTFHV VHVRSTNPAA TPLMLNHGWP GSFVEYQRLI
PLLTGEFHVV IPSLPGFGFS TPLSGTGWEL ARTADAYAEI MTRLGYERFA AHGTDIGSGT
TGRLAAVYPE RVIGTHLGVD PHLLALVGDK FPYPDGLSDD EITQIEAVRA EDAADRGYLL
MHNHRPDTIG AALTDSPVGQ LAWIAEKFKT RANGAWRTPD ESVDRDQLLT NISLYWFTRG
GESSAQFYYE AEHSGLDLVM ASSVPSGWAV FNSNPLVRRA MDPWKAIGHW SEFTEGGHFP
AMDATELLAD DIRTFFRGIA