Gene Sare_3896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3896 
Symbol 
ID5705834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4437687 
End bp4438697 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content71% 
IMG OID641273321 
Productamidohydrolase 2 
Protein accessionYP_001538678 
Protein GI159039425 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.271019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTACAC CGGTCGTCGA CGTGCACTCG CACGCCGTAC CGAAGGGCTG GCCCGACCTC 
GGTGCGGCCT GCGGTGGATC CGGCTGGCCC TGGCTGCGGG TCGACTCCGA GCGAGCCGCG
ATGATCATGC TCGGGGAGAC CGAGTTCCGG CCGGTCGGTG TGGAGTGCTG GGATCCGGCC
ACCCGACTGG CGGACATGTC CACCGACGGT GTCGACGTGC AGGTGGTCTC GCCGACACCG
GTCTTCTTCT GCTACGACCG CCCCGCCGTC CAGGCGGTCA AGGTGGCCCG CATCTTCAAC
GACCGTATGT TGGAGATCAC GGCAGCCGCA GACGGCCGTT TGGTTCCGTT CTGCCAGGTG
CCGTTGCAGG ACCCGGAGGC CGCCTGCGCC GAGCTGGACC GCTGCCTCGC CGCGGGGCAC
GCCGGGGTGG AGATCGGAAA CCATGTCGGC GACCTCGACC TGGACGACAC CGGCATCGTC
GAGTTCCTCA CCCACTGCGC CGAGGTGGGC GCGCCGGTCT TCGTCCACCC GTGGGACATG
CCAGGCGGGC CGCGGCTGGA CCGGTGGATG GCCCGATGGC TCGCCGGGAT GCCGGCCGAG
ACCCACCTGT CGGTGCTGGC GATGATCCTC GGTGGCGTCT TCGACCGGGT GCCGGAGACG
TTGCGGATCT GCTTCGCACA CGGCGGCGGC AGCTTCCCGT TCTGGCTGGG CCGCGCGGAC
AACGCCTGGC ATCGCCGGGG AGACCTCGTC CGCGGCGCCT CGGAAGGGCC CCCCGGCTCG
TACCTGGACC GGTTCTTCGT CGATTCGGTG GTGTTCGATC CGGCGGCGCT GCGGCTCCTG
GTCGACACGA TGGGCGCCGA CCAGGTGCTG GTCGGCAGTG ACTATCCGTA CCCACTCGGG
GAGCGGCCGG TTGGTGCGGT CGTGCACCGG TCCGACTTCC TCACCGCCGA CCAGCGCATC
AGCCTGCTCG GCGGCAACGC GTTGCGGTTC CTCGGCCGGG CGCCGGGATG A
 
Protein sequence
MGTPVVDVHS HAVPKGWPDL GAACGGSGWP WLRVDSERAA MIMLGETEFR PVGVECWDPA 
TRLADMSTDG VDVQVVSPTP VFFCYDRPAV QAVKVARIFN DRMLEITAAA DGRLVPFCQV
PLQDPEAACA ELDRCLAAGH AGVEIGNHVG DLDLDDTGIV EFLTHCAEVG APVFVHPWDM
PGGPRLDRWM ARWLAGMPAE THLSVLAMIL GGVFDRVPET LRICFAHGGG SFPFWLGRAD
NAWHRRGDLV RGASEGPPGS YLDRFFVDSV VFDPAALRLL VDTMGADQVL VGSDYPYPLG
ERPVGAVVHR SDFLTADQRI SLLGGNALRF LGRAPG