Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0307 |
Symbol | |
ID | 5703975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 341197 |
End bp | 342870 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269833 |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_001535228 |
Protein GI | 159035975 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02288] phenylacetic acid degradation protein paaN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.777048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000751962 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGAGA CCCCGCACCC CCTGTACGAC AGGCACGCTG ACACCCTCAA CCGGGCGCTG ACCGCGATCT CGCAGCGCGG GTACTGGTCC GCCTACCCGG AGTCGCCCAG CCCGCGGGTG TACGGCGAGA CCGCCGCCGC CGACGGCAAG GCGGCCTTCG AGGCGTACCT CGGTCGTGAC TTTCCCCTTG ACCAGGCCGG CGACGGGAGC ACGGTGGCGA CCGAGGTCAG CCCGTTCGGT GTTGACCTGG GAGTCCGCTA TCCGCACGCG GCGGCCGACC AACTCACCAC CGCCGCCACC ACCGCGCTGC CGTCATGGCG GGACGCCGGG CCACAGACCC GGGCGGGTGT CTGCCTGGAG ATTCTGGACC GGCTGCACCG CAACATCTTC GAGCTCGCCA ACGCGGTGCA GTTCACCAGC GGCCAGGCGT TCGTGATGGC CTTCCAGGCG GGCGGGGCGC ACGCGCTGGA CCGGGCGTTG GAGGCCCTCG CCTACGCGTA CGCGGAGATG AGCCGGCACC CGGGGACGGC GGGCTGGGAG AAGGCGGCGG GCAAGGGTGA CCCGTTGCGG ATGACCAAGA CCTTCCACGT GGTGCCACGG GGTGTCGCGT TGGTGATCGG CTGTAACACG TTCCCGACCT GGAACTCGTA CCCGGGGCTC TTCGCCTCGC TGGTCACCGG CAACCCGGTG GTCGTCAAAC CGCACCCGCG GGCGGTGCTT CCGCTCGCCG TCACGGTGCG CTATGCCCGC CAGGTGCTCG CCGAAGCCGG CTTCGATCCG AACCTGGTGC AGCTCGCCCC CGAAGCGCCG GATGAGAAGC TCGCCTCCGC CCTGGCCCGG CATCCGGCCG TGCGGATCGT CGACTTCACC GGCTCCACCG AGTACGGCGA CTGGCTGGAA GCCAACGCCC GGCAGGCGCA GGTCTACACC GAGAAGGCGG GCCTGAACAC GGTCGTCGTC GACTCCACCA ACGACTTCGT CGGGATGTGC CGCAATCTGG GTTTCACCCT GAGCCTGTAC AGCGGCCAGA TGTGCACCAC CTCGCAGAAC ATCCTGATCC CCCGCGACGG AATCGAGACC GACCAGGGGC GCAAGAGCTT CGACGAGGTT GCTGCCGGTA TCGCCGCGGC TGTGGGCAAG CTCACCGCCG ACCCGGCGCG TGGCGTCGAG CTGACCGGCG CGATCGTCAA CGACGGGGTG TTGGAGCGCC TCGACGAGGT GACCAAGGTT GGCGAGCCGG TGCTGGAGTC GCGTACCGTC CAGCACCCGG CCTTCCCCGG TGCGGTGGTA CGAACGCCGA CCATCGTCCG GCTGAATGCC GACGACACCG CCACCTACGC ACGGGAGTGG TTCGGCCCGA TCTCGTTCGT GATCGGGACC GACTCCACCG AACACAGCCT GGCGATCCTG CGGGACACGG TGGGCGAGAA GGGCGCGTTG ACCGCTGCGG TCTACTCGAC CGAGGACGCG GTGTTGGACG CGGCCGAGAC CGCGGCGATC GAGGTCGGGG TGCACCTGTC GGCGAATCTG ACCGGGGGTG TGTTCGTGAA CCAGTCGGCG GCCTTCTCGG ATTTCCACGG CAGCGGCGCC AACGCGGCGG CCAACGCGGC GCTCACCGAT GGCGCGTACG TCGCCAACCG GTTCCGCATC GTCCAGAGCC GTCGTCACGT CTGA
|
Protein sequence | MTETPHPLYD RHADTLNRAL TAISQRGYWS AYPESPSPRV YGETAAADGK AAFEAYLGRD FPLDQAGDGS TVATEVSPFG VDLGVRYPHA AADQLTTAAT TALPSWRDAG PQTRAGVCLE ILDRLHRNIF ELANAVQFTS GQAFVMAFQA GGAHALDRAL EALAYAYAEM SRHPGTAGWE KAAGKGDPLR MTKTFHVVPR GVALVIGCNT FPTWNSYPGL FASLVTGNPV VVKPHPRAVL PLAVTVRYAR QVLAEAGFDP NLVQLAPEAP DEKLASALAR HPAVRIVDFT GSTEYGDWLE ANARQAQVYT EKAGLNTVVV DSTNDFVGMC RNLGFTLSLY SGQMCTTSQN ILIPRDGIET DQGRKSFDEV AAGIAAAVGK LTADPARGVE LTGAIVNDGV LERLDEVTKV GEPVLESRTV QHPAFPGAVV RTPTIVRLNA DDTATYAREW FGPISFVIGT DSTEHSLAIL RDTVGEKGAL TAAVYSTEDA VLDAAETAAI EVGVHLSANL TGGVFVNQSA AFSDFHGSGA NAAANAALTD GAYVANRFRI VQSRRHV
|
| |