Gene Sare_4704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4704 
Symbol 
ID5707213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5325700 
End bp5326974 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID641274102 
Productamidohydrolase 
Protein accessionYP_001539448 
Protein GI159040195 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0167174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCGG TACGTCGGCC GGACCCGTCG GCTGTATTCG CGGTGCGCGC GGCCCGCATG 
TTCGACGGAT TTGAGCTACA CACCGGACAT CCACTGGTGT TCGTGAAGAA GAGTCGGATC
GTGGGTATCG ACAAGTCCGG GGCCCATCCG GCGACCGAGG TGCCGGTTGT CGACCTTGGT
GATGCCACAC TACTGCCGGG TCTGATCGAC ACTCACGTGC ACCTGGCCTT CGATCCCGAG
GTGAGCGCCA AGCAGGAGAT CGTCACGGAC AGCGACGCGA CGATCGTGCG ACGGATGCGG
CGACACGCCG GGCAGCACCT GATGGCCGGC GTCACGACCG TGCGGGACCT CGGCGATCGC
GGCTATCTCA GCCTCGACGT ACGCGATTCT GCCGGCCAGG CTTCGGGTCT GTACCCGGAG
ATTCTGTGCG CCGGTCCGCC AATCACCAGA CACGGCGGTC ACTGTTGGTT CCTGGGGGGA
GAGGCCGACG GTGCCGACGC TATCCGGAAG GCCGTTGCGC ATCGCGTTGC ACGGGGCGTT
GACACGGTGA AGATCATGGC CACCGGCGGT GCGATCACTC CCGGATGGCG TCCGGACGAG
TCCCAGTACA ACGCCGAGGA GCTTCGGTGT GCCGCGGAGA CGGCGCACCG GTCCGGGGTG
CCCATCACCG CACACGCACA CGGTCCGCAG GGCATCGCTG ACGCCGTTGC CGGGGGCGCG
GACGGTGTCG AGCACTGCTC GTTTTTCACC AGGGATGGCA TCGAACCGGA CTGGGAACTG
GTCGATGCCA TGGCTGAGGC GGGAACGTAC GTGGGCGCCA CCGAGGCATG GCTTCCGGAG
GGCAAGATGC TGGCACCGCA TCTGGCTCAG CGTCTAGAAC AACGTACCCA GACCTTTGCC
CGGATGCACC GCGTGGGAGT GCGCCTGGTG TGTTGCTCGG ATGCGGGAGC GGGTCCCCGT
AAGCCACACG GGGTGCTGCC CCACGGCATC GTCCACCTCG GTGCGAACGG ATGGGCCAAC
GTTGAGGCCC TCAGGTCGGT GACGACCCTC GCTGCGGAAG CCTGTGCACT CGCCGACCGG
AAGGGACGGA TCGCGGTCGG ACACGACGCC GACCTGCTCG CCGTAGCAGG TAACCCACTC
GAACGGCTGA CGGACATGTT CCGGGTCTCC GCCGTCTGGC GGGGTGGCAC CCCGGTCGAC
CTGCGGACCG TCGGCAGTCG GGAGCGAAGA ACGGGGCCGG GCGGATCCAC GGTTGAGGGA
TTGGGCCGTT CGTGA
 
Protein sequence
MRPVRRPDPS AVFAVRAARM FDGFELHTGH PLVFVKKSRI VGIDKSGAHP ATEVPVVDLG 
DATLLPGLID THVHLAFDPE VSAKQEIVTD SDATIVRRMR RHAGQHLMAG VTTVRDLGDR
GYLSLDVRDS AGQASGLYPE ILCAGPPITR HGGHCWFLGG EADGADAIRK AVAHRVARGV
DTVKIMATGG AITPGWRPDE SQYNAEELRC AAETAHRSGV PITAHAHGPQ GIADAVAGGA
DGVEHCSFFT RDGIEPDWEL VDAMAEAGTY VGATEAWLPE GKMLAPHLAQ RLEQRTQTFA
RMHRVGVRLV CCSDAGAGPR KPHGVLPHGI VHLGANGWAN VEALRSVTTL AAEACALADR
KGRIAVGHDA DLLAVAGNPL ERLTDMFRVS AVWRGGTPVD LRTVGSRERR TGPGGSTVEG
LGRS