Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1191 |
Symbol | |
ID | 5704097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1344043 |
End bp | 1345122 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641270709 |
Product | amidohydrolase |
Protein accession | YP_001536090 |
Protein GI | 159036837 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.251504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0810275 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCTGC ACGTGCGCGG TGTGCTCCTG CCCGAGGACG AGGTCCGGGA CATCTGGCTG GTCGGCGATC GGGTGACCTT CGAGCCGGTG GCCGGCGCGG AGACCGTCGC CGACGGCGGC TTCCTGCTCC CCGGCCTGAC CGACGCGCAC TGCCACATCG GCATCGCCCG AGGCGGGGCG CCCGTCAGCT CGATCGACCA GGCCCGCGAC CTGGCCCGTA CCGACCGCGA CGCCGGGGTG CTCGCCATCC GCGACGCCGG TTCCCCGTAC CCGTACCCGG AACTCGACGA CGCCCCGGAC CTGCCGCGGC TGGCCCGCGC GGGTCGGCAC ATCGCCCCGC CGAAGCGGTA CCTGCGCAAC ATCGGGGTGG AGGTCGGTGC GGCCGAGGTG ACCGCTACCG TCACCGGGCA GGCCCGAGCC GGCAACGGCT GGGTCAAGCT GGTCGGCGAC TGGATCGATC GCGGCGTCGG CGACCTGGCC CCGGCCTGGG ACGCGGACAC CATGGCCGCT GCGGTCCGGG CCGCGCACGC CGCCGGGGCG CGTGCCGCGG TGCACACCTT CTCCGAGGCC GCCGTCGAGA TCATGGTGCG GGCCGGGGTC GACTCGGTGG AACACGGCAC CGGCCTGAGC CTGGACCTGA TCGACCTGAT GGCCCGGCAG GGCACCGCGC TCGTGCCCAC AATGATCAAC ATACGAACCT TCGGTGGGAT CGCCGACCAG GCGCGGGCGA AGTTCCCTGG CTACGCCGAC CACATGCTCG CCCTGCGGGA CCGCTTCCCC GACGTGGTCC GCGCCGCGTA CCAGGCCGGT GTGCCGATCT ACGTCGGCAC GGACGCCGGC GGCGGCATCG CGCACGGTCT CGCCGCCGAG GAGATGCTGT TGCTGCACGA ACGGGCCGGC ATGCCCGCCG AGGCCGTCCT GGCCGCCGCA TCCTGGCAGG CCCGCGAGTG GCTCGGGTTC CCCGGCCTGG TCGAGGGGGG CCTGGCCGAT CTGGTCGTCT ACCCGGAGGA CCCACGCCGA GACCTCCGTG TCGTCCGGTC CCCCACCCGC GTCGTCCTCC GCGGCCGCCT CGTCCGCTGA
|
Protein sequence | MVLHVRGVLL PEDEVRDIWL VGDRVTFEPV AGAETVADGG FLLPGLTDAH CHIGIARGGA PVSSIDQARD LARTDRDAGV LAIRDAGSPY PYPELDDAPD LPRLARAGRH IAPPKRYLRN IGVEVGAAEV TATVTGQARA GNGWVKLVGD WIDRGVGDLA PAWDADTMAA AVRAAHAAGA RAAVHTFSEA AVEIMVRAGV DSVEHGTGLS LDLIDLMARQ GTALVPTMIN IRTFGGIADQ ARAKFPGYAD HMLALRDRFP DVVRAAYQAG VPIYVGTDAG GGIAHGLAAE EMLLLHERAG MPAEAVLAAA SWQAREWLGF PGLVEGGLAD LVVYPEDPRR DLRVVRSPTR VVLRGRLVR
|
| |