Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0031 |
Symbol | |
ID | 9243858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 40318 |
End bp | 42354 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | phenylacetic acid degradation protein paaN |
Protein accession | YP_003677989 |
Protein GI | 297559015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.300256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.45539 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGAAA CACTCCAGAG CTACGCGCGG GGAGCATGGT TCACCCCCGC GGACGAGGGC GCGCCCCTGG CCGACGCGAA CACCGGGGAG ACCGTCGCAC GGATCTCGTC CGACGGCCCC GACGTCGCGG GGATGGTCGA CCACGCGCGG ACCGTCGGCG GCCCCGCGCT GCGCGCGCTC ACCTTCCACC AGCGCGCGAA CATCCTCAAG GAGGTCGCCA AGCACCTGAC GGGGTACAAG GAGGAGTTCT ACGCCCTCTC CCACCGGACC GGCGCGACCG CACGGGACAC CGCCGTGGAC GTGGACGGCG GTTTCGGCAC GCTCTTCAGC TTCTCCAGCA AGGGCCGCCG CGAGCTGCCC AACTCCACCG TCATCCTGGA CGGGCCCCTG GAGCCGCTCG GCAGGCAGGG CACGTTCGTG GGCCAGCACG TGTACACGTC CCGACCGGGC GTGGCCGTGC AGATCAACGC CTTCAACTTC CCGGTGTGGG GCATGCTGGA GAAGTTCGCC CCCGCCTTCC TCGCGGGGAT GCCCAGCATC GTCAAGCCCG CCGGGCAGAC CGCCTACCTC ACCGTCGCGG TCGTGCGGCG CATGGTCGAG TCGGGCCTGC TGCCCGAGGG CTCCCTCCAG CTCCTCGTCG GCAGCCACCG GGGACTGCTC GACGCCCTGG GTCCGCAGGA CGTCGTGGGC TTCACGGGGT CCGCCGCCAC CGGCGCCATC CTGCGCAACC ACCCCAACGT GGTCAGCGGA GGCGTGCAGC TCAACGTCGA GGCCGACTCC CTCAACTGCT CGATCCTCGG CCCGGACGTG ACCGAGGAGG ACCCCGAGTT CGACCTGTAC GTCAAGCAGG TCGTCACCGA GATGACCGTC AAGGCCGGGC AGAAGTGCAC CGCCATCCGC CGCGTCATCG TGCCCGCGTC GATGGCGGAG GCCGTCACCG GGGCCCTGAC CGAACGCCTG GCCAGGGTGG TGGTCGGCGC GGCCGACCAC CCGGACACGC GGATGGGCGC CCTGGTCTCG CTCGACCAGC GCGAGGAGGT CCGCAAGGCG GTCAAGGCCC TGCGCGCCAC CAGCGAGCTC GTCTACGGCG ACCCGGAGAG TGTGGAGGTC GCCGGGGCCG ACGACGAGTC GGGCGCGTTC ATGTCCCCGA TCCTGCTGCG CGCCGAGCCC GGCGCCCGGG AGCCGCACGA GGTGGAGGCC TTCGGCCCCG TCAGCACCGT GATCACCTAC GACGGCGTCG CCGAGGCCGT GGAGCTGGCC GCCCGGGGCA GCGGCAGCCT GGTGGGGTCG CTCGTCACCC GCGACCCGGA CGTGGCGCGC GAGGTCGTCC TGGGGCTGGC GCCCTGGCAC GGGCGCATCC TGGTGCTCAA CCGCGACGAC GCCAAGGAGT CCACCGGCCA CGGCTCCCCG CTGCCCGTGC TGGTGCACGG CGGACCCGGC CGCGCGGGCG GCGGCGAGGA GATGGGCGGC GTGCGCGGCG TCAAGCACCA CATGCAGCGC ACCGCCGTGC AGGCCCCGCC GGACATGGTC ACCGCGATCA CCGGCCACTG GACCACCGGC TCCGAGCGGA CCGTGGGCGA CGTGCACCCG TTCCGCAAGG ACCTGTCCCA GCTGCGGATC GGCGACACGA TCCGGTCGGC GGAGCGCACC GTCACCCGGG CCGACATCGA CCACTTCGCC GAGTTCACCG GCGACACGTT CTACGCGCAC ACCGACGAGG AGGCCGCCGC CGCCAACCCG CTGTTCGGCG GGATCGTGGC GCACGGCTAC CTGGTGGTGT CACTGGCGGC GGGCCTGTTC GTGGACCCGG CCCCGGGCCC GGTGCTCGCC AACTTCGGCG TGGACAACCT GCGCTTCCTC ACCCCGGTGA AGGAGGACGC CACCATCCGG GTGACGCTGA CCGCCAAGCA GATCACCCCG CGCACGAACG CCGACTACGG CGAGGTGCGC TGGGACGCCC TGGTCACCGA CCAGGACGGC GAGGCCGTGG CCACCTACGA CGTGCTCACG CTGGTCGCCA AGGGCGGGGA GGGGTAG
|
Protein sequence | MPETLQSYAR GAWFTPADEG APLADANTGE TVARISSDGP DVAGMVDHAR TVGGPALRAL TFHQRANILK EVAKHLTGYK EEFYALSHRT GATARDTAVD VDGGFGTLFS FSSKGRRELP NSTVILDGPL EPLGRQGTFV GQHVYTSRPG VAVQINAFNF PVWGMLEKFA PAFLAGMPSI VKPAGQTAYL TVAVVRRMVE SGLLPEGSLQ LLVGSHRGLL DALGPQDVVG FTGSAATGAI LRNHPNVVSG GVQLNVEADS LNCSILGPDV TEEDPEFDLY VKQVVTEMTV KAGQKCTAIR RVIVPASMAE AVTGALTERL ARVVVGAADH PDTRMGALVS LDQREEVRKA VKALRATSEL VYGDPESVEV AGADDESGAF MSPILLRAEP GAREPHEVEA FGPVSTVITY DGVAEAVELA ARGSGSLVGS LVTRDPDVAR EVVLGLAPWH GRILVLNRDD AKESTGHGSP LPVLVHGGPG RAGGGEEMGG VRGVKHHMQR TAVQAPPDMV TAITGHWTTG SERTVGDVHP FRKDLSQLRI GDTIRSAERT VTRADIDHFA EFTGDTFYAH TDEEAAAANP LFGGIVAHGY LVVSLAAGLF VDPAPGPVLA NFGVDNLRFL TPVKEDATIR VTLTAKQITP RTNADYGEVR WDALVTDQDG EAVATYDVLT LVAKGGEG
|
| |