Gene Strop_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1110 
Symbol 
ID5057557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1255767 
End bp1257416 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content72% 
IMG OID640473377 
Producturocanate hydratase 
Protein accessionYP_001157959 
Protein GI145593662 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC CGATCCGCGC CGCGCGTGGC ACCACCCCCA CCGCCCGGGG CTGGCCCCAG 
GAAGCCGCAC GGCGGATGCT GATGAACAAC CTCGACCCGG AGGTGGCCGA ACGCCCCGAG
GACCTGGTCG TCTACGGCGG GACCGGGAAG GCCGCGCGGG ACTGGCCGTC GTACCACGCG
CTACTGAACA CCCTCACCGA CCTGCGCGAT GACGAGACGA TGCTGGTGCA GTCCGGCCGG
CCGGTCGCGG TGATGCGGAC CCACGAGTGG GCACCGCGGG TGCTGCTGGC CAACTCCAAT
CTGGTCGGCG ACTGGGCGAC CTGGCCGGAG TTCCGCCGGC TGGAGCAGCT CGGCCTGACC
ATGTACGGGC AGATGACCGC CGGATCGTGG ATCTACATCG GCACCCAGGG CATCCTTCAG
GGCACCTACG AAACGTTCGC GGCCGTCGCC GCGAAGCGGT TCGGCGGGTC GCTGGCCGGG
ACGCTGACCC TGACCGCCGG CTGCGGTGGG ATGGGCGGGG CCCAGCCGCT CGCGGTGACC
ATGAACAGCG GCGCCTGCCT GATCGTGGAT GTAGACCGGT CCCGCCTCGA ACGCCGGGTA
CGCGAGCGCT ACCTGGACGA GGTCGCCGAC TCACTCGACG ACGCGGTACA GCGGGCAACC
GCCGCGCGGA ACGAACGGCG GGCGCGCAGC ATCGGCGTGG TCGGCAACGC GGCCACCCTC
TTCCCCGAGC TGCTGCGCCG AGGCATCCCG GTGGACGTGG TCACCGACCA GACCAGCGCC
CACGACCCAC TGTCGTACCT GCCGGAGGGG GTTGAGCTGG CCGACGCCCG CGACTACGCG
GCGGCCAAGC CGGCCGAGTT CACCGACCGC GCCCGCGCGT CGATGGCCCG ACACGTCGAG
GCGATGGTCG GCTTCCTCGA CGCGGGCGCC GAGGTGTTCG ACTACGGCAA CTCGATCCGC
GGCGAGGCCC AGCTCGGTGG CTACGCGCGC GCCTTCGACT TCCCGGGATT CGTGCCCGCC
TACATCCGGC CGCTGTTCTG CGCGGGTAAG GGCCCGTTCC GGTGGGCGGC GCTCTCCGGC
GACCCGGCCG ACATCGCCGC CACCGACCGG GCCATTCTCG AACTCTTCCC GGAGAACGAG
CAGCTGGCCC GTTGGATCCG GATGGCCGGC GAACGGGTGG CGTTCCAGGG GCTGCCGGCC
CGGATCTGCT GGCTCGGCTA CGGCGAACGG GACCGGGCCG GGGTACGGTT CAACGAGATG
GTCGCCGCCG GAGAGCTCTC CGCGCCGGTG GTCATCGGCC GGGATCACCT GGACTGCGGT
AGCGTCGCCA GCCCGTACCG GGAGACCGAG GCGATGGCCG ACGGCTCCGA CGCCATCGCC
GACTGGCCGC TGCTCAACGC CCTGGTCAAC ACCGCCAGCG GGGCCTCCTG GGTCTCCCTG
CACCACGGCG GCGGGGTCGG GATCGGCCGC TCCATCCACG CCGGCCAGGT CTGCGTCGCC
GACGGCAGCG CCCTCGCCGG GCAGAAGATC GAACGAGTCC TGACCAACGA CCCGGCCATG
GGCGTCGTCC GGCACGTTGA CGCCGGCTAC GACGACGCCC GGCAGGTCGC CGAGCGGACC
GGCCTGCACA TCCCGATGGC GAAGTCGTGA
 
Protein sequence
MTQPIRAARG TTPTARGWPQ EAARRMLMNN LDPEVAERPE DLVVYGGTGK AARDWPSYHA 
LLNTLTDLRD DETMLVQSGR PVAVMRTHEW APRVLLANSN LVGDWATWPE FRRLEQLGLT
MYGQMTAGSW IYIGTQGILQ GTYETFAAVA AKRFGGSLAG TLTLTAGCGG MGGAQPLAVT
MNSGACLIVD VDRSRLERRV RERYLDEVAD SLDDAVQRAT AARNERRARS IGVVGNAATL
FPELLRRGIP VDVVTDQTSA HDPLSYLPEG VELADARDYA AAKPAEFTDR ARASMARHVE
AMVGFLDAGA EVFDYGNSIR GEAQLGGYAR AFDFPGFVPA YIRPLFCAGK GPFRWAALSG
DPADIAATDR AILELFPENE QLARWIRMAG ERVAFQGLPA RICWLGYGER DRAGVRFNEM
VAAGELSAPV VIGRDHLDCG SVASPYRETE AMADGSDAIA DWPLLNALVN TASGASWVSL
HHGGGVGIGR SIHAGQVCVA DGSALAGQKI ERVLTNDPAM GVVRHVDAGY DDARQVAERT
GLHIPMAKS