Gene Sare_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1000 
Symbol 
ID5704682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1124381 
End bp1126030 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content71% 
IMG OID641270515 
Producturocanate hydratase 
Protein accessionYP_001535902 
Protein GI159036649 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.208315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC CGATCCGTGC CACGCGCGGC ACCACCCGCA CCGCCCAGGG CTGGCCCCAG 
GAAGCCGCGC GGCGGATGCT GATGAACAAC CTCGACCCCG AGGTGGCCGA ACGTCCCGAG
GACCTGGTGG TATACGGCGG GACCGGGAAG GCGGCACGGG ACTGGCCGTC GTACCGCGCA
CTGCTGGACA CCCTCACCGA CCTGCGCGAC GACGAGACGA TGCTGGTGCA GTCCGGTCGA
CCGGTCGCGG TGATGCGAAC CCACGAATGG GCGCCACGGG TGCTGCTCGC CAACTCCAAC
CTGGTCGGAG ACTGGGCGAC CTGGCCGGAG TTCCGGCGCC TGGAACAGCT GGGCCTGACC
ATGTACGGGC AGATGACCGC CGGATCGTGG ATCTACATCG GCACCCAGGG GATCCTCCAG
GGCACCTACG AGACGTTCGC GGCCGTCGCC GCGAAGCGGT TCGGCGGATC CCTGGCCGGC
ACGCTGACGC TGACCGCCGG CTGCGGTGGG ATGGGCGGGG CGCAACCGCT CGCGGTGACC
ATGAACGGCG GCTCCTGCCT GATCGTGGAC GTCGACCGGT CCCGCCTCGA ACGCCGGGTG
CGCGAACACT ACCTGGACGA GGTCGCCGAC TCGCTCGACG ACGCCGTACA ACGGGCACTC
GCCGCCCGCG ACCAACGACG GGCACGCAGC GTCGGAGTGG TCGGCAACGC GGCCACCATC
TTCCCCGAGC TGCTTCGCCG CGGCGTCCCG GTGGACGTGG TGACCGACCA GACCAGCGCC
CACGACCCGC TGTCGTACCT GCCGGAAGGG GTCGAGCTGA CCGACGCCCG CGACTACGCG
GCGGCCAAGC CGGCCGAGTT CACCGACCGT GCCCGCGCGT CGATGGCCCG GCACGTCGAG
GCGATGGTCG GCTTCCTCGA CGCGGGCGCC GAGGTCTTCG ACTACGGCAA CTCGATCCGC
GGCGAGGCGC AGCTCGGTGG ATACTCGCGC GCCTTCGACT TCCCGGGTTT CGTGCCCGCC
TACATCCGTC CGCTGTTCTG CGCGGGCAAG GGCCCGTTCC GGTGGGCGGC GCTCTCCGGC
GACCCGGCCG ACATCGCCGC CACCGACCGG GCCATCCTCG ACCTCTTCCC GGAGAACGAA
CCGCTGGCCC GGTGGATCCG GATGGCCGGC GAACGGGTGG CGTTCCAGGG ACTACCAGCC
CGGATCTGCT GGCTCGGCTA CGGCGAACGA GACCGGGCCG GGGTGCGGTT CAACGAGATG
GTCGCCGCCG GGGAGTTGTC CGCACCGGTG GTCATCGGGC GCGACCACCT GGACTGCGGT
AGCGTCGCCA GCCCGTACCG GGAGACCGAG GCGATGGCCG ACGGCTCCGA TGCGATCGCC
GACTGGCCGC TGCTCAACGC ACTGGTGAAC ACGGCCAGTG GGGCCTCGTG GGTGTCCATC
CACCATGGTG GCGGGGTCGG GATCGGCCGG TCCATCCACG CCGGCCAGGT CTGCGTCGCC
GACGGCAGCG CCCTCGCCGG GCAGAAGATC GAACGGGTGC TCACCAACGA CCCGGCGATG
GGCGTCGTGC GACACGTCGA CGCCGGCTAC GACGAGGCCC GGCAGGTCGC CGAACGGACC
GGGCTACACA TCCCGATGAC AGCGGCGTAA
 
Protein sequence
MTQPIRATRG TTRTAQGWPQ EAARRMLMNN LDPEVAERPE DLVVYGGTGK AARDWPSYRA 
LLDTLTDLRD DETMLVQSGR PVAVMRTHEW APRVLLANSN LVGDWATWPE FRRLEQLGLT
MYGQMTAGSW IYIGTQGILQ GTYETFAAVA AKRFGGSLAG TLTLTAGCGG MGGAQPLAVT
MNGGSCLIVD VDRSRLERRV REHYLDEVAD SLDDAVQRAL AARDQRRARS VGVVGNAATI
FPELLRRGVP VDVVTDQTSA HDPLSYLPEG VELTDARDYA AAKPAEFTDR ARASMARHVE
AMVGFLDAGA EVFDYGNSIR GEAQLGGYSR AFDFPGFVPA YIRPLFCAGK GPFRWAALSG
DPADIAATDR AILDLFPENE PLARWIRMAG ERVAFQGLPA RICWLGYGER DRAGVRFNEM
VAAGELSAPV VIGRDHLDCG SVASPYRETE AMADGSDAIA DWPLLNALVN TASGASWVSI
HHGGGVGIGR SIHAGQVCVA DGSALAGQKI ERVLTNDPAM GVVRHVDAGY DEARQVAERT
GLHIPMTAA