Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4704 |
Symbol | |
ID | 5707213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5325700 |
End bp | 5326974 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641274102 |
Product | amidohydrolase |
Protein accession | YP_001539448 |
Protein GI | 159040195 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0167174 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCGG TACGTCGGCC GGACCCGTCG GCTGTATTCG CGGTGCGCGC GGCCCGCATG TTCGACGGAT TTGAGCTACA CACCGGACAT CCACTGGTGT TCGTGAAGAA GAGTCGGATC GTGGGTATCG ACAAGTCCGG GGCCCATCCG GCGACCGAGG TGCCGGTTGT CGACCTTGGT GATGCCACAC TACTGCCGGG TCTGATCGAC ACTCACGTGC ACCTGGCCTT CGATCCCGAG GTGAGCGCCA AGCAGGAGAT CGTCACGGAC AGCGACGCGA CGATCGTGCG ACGGATGCGG CGACACGCCG GGCAGCACCT GATGGCCGGC GTCACGACCG TGCGGGACCT CGGCGATCGC GGCTATCTCA GCCTCGACGT ACGCGATTCT GCCGGCCAGG CTTCGGGTCT GTACCCGGAG ATTCTGTGCG CCGGTCCGCC AATCACCAGA CACGGCGGTC ACTGTTGGTT CCTGGGGGGA GAGGCCGACG GTGCCGACGC TATCCGGAAG GCCGTTGCGC ATCGCGTTGC ACGGGGCGTT GACACGGTGA AGATCATGGC CACCGGCGGT GCGATCACTC CCGGATGGCG TCCGGACGAG TCCCAGTACA ACGCCGAGGA GCTTCGGTGT GCCGCGGAGA CGGCGCACCG GTCCGGGGTG CCCATCACCG CACACGCACA CGGTCCGCAG GGCATCGCTG ACGCCGTTGC CGGGGGCGCG GACGGTGTCG AGCACTGCTC GTTTTTCACC AGGGATGGCA TCGAACCGGA CTGGGAACTG GTCGATGCCA TGGCTGAGGC GGGAACGTAC GTGGGCGCCA CCGAGGCATG GCTTCCGGAG GGCAAGATGC TGGCACCGCA TCTGGCTCAG CGTCTAGAAC AACGTACCCA GACCTTTGCC CGGATGCACC GCGTGGGAGT GCGCCTGGTG TGTTGCTCGG ATGCGGGAGC GGGTCCCCGT AAGCCACACG GGGTGCTGCC CCACGGCATC GTCCACCTCG GTGCGAACGG ATGGGCCAAC GTTGAGGCCC TCAGGTCGGT GACGACCCTC GCTGCGGAAG CCTGTGCACT CGCCGACCGG AAGGGACGGA TCGCGGTCGG ACACGACGCC GACCTGCTCG CCGTAGCAGG TAACCCACTC GAACGGCTGA CGGACATGTT CCGGGTCTCC GCCGTCTGGC GGGGTGGCAC CCCGGTCGAC CTGCGGACCG TCGGCAGTCG GGAGCGAAGA ACGGGGCCGG GCGGATCCAC GGTTGAGGGA TTGGGCCGTT CGTGA
|
Protein sequence | MRPVRRPDPS AVFAVRAARM FDGFELHTGH PLVFVKKSRI VGIDKSGAHP ATEVPVVDLG DATLLPGLID THVHLAFDPE VSAKQEIVTD SDATIVRRMR RHAGQHLMAG VTTVRDLGDR GYLSLDVRDS AGQASGLYPE ILCAGPPITR HGGHCWFLGG EADGADAIRK AVAHRVARGV DTVKIMATGG AITPGWRPDE SQYNAEELRC AAETAHRSGV PITAHAHGPQ GIADAVAGGA DGVEHCSFFT RDGIEPDWEL VDAMAEAGTY VGATEAWLPE GKMLAPHLAQ RLEQRTQTFA RMHRVGVRLV CCSDAGAGPR KPHGVLPHGI VHLGANGWAN VEALRSVTTL AAEACALADR KGRIAVGHDA DLLAVAGNPL ERLTDMFRVS AVWRGGTPVD LRTVGSRERR TGPGGSTVEG LGRS
|
| |