Gene Sare_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0095 
Symbol 
ID5707065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp106526 
End bp107461 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content75% 
IMG OID641269621 
Producturea amidolyase related protein 
Protein accessionYP_001535021 
Protein GI159035768 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000550439 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGGCA TCCCACCGGC CGACCGGGTC GGGGCGGCCG ACACACCCGG CAGAAGTGCG 
GAGCCCGGCA TGGTCGAGGA GCCCGGCATG GTCGAGGTAG TGCGGGCCGG CGCTCTCACC
ACCGTGCAGG ATCTCGGCCG GCCCGGCTGG GCGCACCTCG GCGTACCGCG CTCCGGTGCC
CTCGACCCGA GCGCGCTCCG GTTGGCCAAC CGGCTGGTCG GCAACCCGGA GACCGCCGCC
GGTCTGGAGA TCACCCTCAC CGGCTGTGGG CTGCGGTTTC GCGGTGCCAC CACCGTCGCG
GTCACCGGGG CGGACGTCCC CGTGCGGGTC AATGATCGGC CCGGCGATGT AGGACGGCCG
CTCGCCGTGC CGGCGGGCGC GGTGCTGCGG GTCGGCCCAC CCCGCACCGG CCTGCGGTCC
TGGCTCGCGG TCGCCGGTGG GTTCGCCGTC GAACCGGTGC TCGGCAGCCG CGCCACGGAC
ACCCTTTCCG GGCTCGGCCC GCCCCTGCTG CGCGACGGCG ACCGGCTTCC CATGGGCGTG
CCAGCTGGGC CGCCCGCCCC GGTGGACGCC ACCGCGACCG TGCCGACGCC GGCCGAGGTG
CGGCTGGCAC TGCGCCTTGG CCCGCGGGCC GACTGGTTCA CGCCACTCGC GCTCGAACTG
CTGCTCGGCA CGGCCTACAC CCTCACTCCG CTCAGTAACC GCATCGGTGC TCGGCTGTCC
GGGGCGCCGC TGCCCCGCGC GGTGGTGGGG GAACTGCCCA GTGAGGGCCT CGTGCTCGGT
GCGGTGCAGG TGCCGGCGGA CGGCCAACCC CTGGTCTTCC TCGCCGACCA TCCGACCACC
GGTGGATACC CGGTCGTCGG GGTGGTGGTC GACGTGACCC CGCTTGCGCA GGCCCGGCCA
GGCACTACGG TGAGGTTCCA TGGATCTCAA CGCTGA
 
Protein sequence
MTGIPPADRV GAADTPGRSA EPGMVEEPGM VEVVRAGALT TVQDLGRPGW AHLGVPRSGA 
LDPSALRLAN RLVGNPETAA GLEITLTGCG LRFRGATTVA VTGADVPVRV NDRPGDVGRP
LAVPAGAVLR VGPPRTGLRS WLAVAGGFAV EPVLGSRATD TLSGLGPPLL RDGDRLPMGV
PAGPPAPVDA TATVPTPAEV RLALRLGPRA DWFTPLALEL LLGTAYTLTP LSNRIGARLS
GAPLPRAVVG ELPSEGLVLG AVQVPADGQP LVFLADHPTT GGYPVVGVVV DVTPLAQARP
GTTVRFHGSQ R