Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3073 |
Symbol | |
ID | 9246929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3671495 |
End bp | 3672865 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | aminodeoxychorismate lyase |
Protein accession | YP_003680988 |
Protein GI | 297562014 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.658866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.521991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGAGG AGGCCCCGCA GGAGGACGAG GCTCCCGATG AGGAGCCCGA CGAGACCCCG GCCGAGGAGG CCCCCGAGGA GGAGCCCGAG CCCCGTCCCC GCCGCTCACG CCGCGCGTCC GCGGCCGACG ACGGTCGGCG CGGAGGCCGC CGCAGGCGCG GCCGCCGCGA GGAGCCGGAG GAGGAGCCCG AGGAGGAGGA CGAGTACGAG GAGCCCAACC TCGCCGACAT CGCCGAGGCC TACGGGGGCG GACGCAGCAG CCGCAAGAAG GCCAAGGAGC TCAAGAGGGC CCGCGCCAAC GCGGGCAAGG GCGGCAGGAA GCGCCGCCGC AGGAGCCGCG CCCTGACGAT CGTGCTGGCC CTGGTCCTGC TCCTGGTCGT GGCGGGCGGC GGCTACGCCG TCATCCGCAC CTACGTCCTG CCCGCCGACT TCGACGGCCA GGGCAGCGGC GAGACCGTGT TCGTCATCGA GCAGGGCGAC GCGGGCTCGG TCGTGGGAGA GAACCTCGCC GAGGCCGGGA TCGTGGCCAG CCCCCGCGCG TTCCTCAACG CGCTGGACGC CGTCCCCGAG GAGGAGCTCG GCTCCGGACT GGCCCCCGGC ACCTACTCCC TGGCCCAGGG CATGAGCGGC GAGGCCGCCG TGGCCGCCCT GCTCGACCCG GCCAGCCGCG TCGGCGGACG CGTCACCATC CCCGAGGGGC TGCGCACGGA CGGGATCTTC GAGAGGATCT CCGAGGCCAC CGACCTGAGC GTCGAGGAGC TGGACGCGGC CTACGCCCAG ACCGACGAAC TCGGCCTGCC CGACTACGCC ACCGAGGGGC CCGAGGGCTA CCTGTTCCCG TCCACCTACC GGTTCGACCC GGGCGCCGAC GCGCTCTCGG TGCTCAAGAC GATGGTCACC CAGCACACCC AGGTCGCCGA GGAGATCGAC CTGGAGGGCA GGGCCGAGGC GCTGGGCTAC GACGCCAACG AGGTCATGGC GATCGCGGCC ATCGTCCAGG CCGAGACCGG CACCAAGGAG GACATGCCCC TCATCTCCGC GGTCGTGCAC AACCGCCTGG AGGAGGGCAT GCAGCTCCAG ATGGACAGCA CGTGCTTCTA CGTCCTGGGT GAGGAGGGCA CCTTCCTCAA CGACGAGCAG CGCGCCTCCT GCGAGGCCGA CCCGCGCGGC TACAGCACCT ACGGCATGAC CGGGCTGCCC GCCGGGCCGT TCGTGGCCCC CGGACAGGAC GCCATCGAGG CGGCCCTGGA ACCGGCGGAC GAGGACTACC TCTACTTCGC GCTCGTCGAC CCCGAGAACG GTCACACCGG TTTCTCCACC ACCCTGGAGG AGCACAACCA GATGGTCGCC GAGAACCAGG CCGAGTGGTA G
|
Protein sequence | MEEEAPQEDE APDEEPDETP AEEAPEEEPE PRPRRSRRAS AADDGRRGGR RRRGRREEPE EEPEEEDEYE EPNLADIAEA YGGGRSSRKK AKELKRARAN AGKGGRKRRR RSRALTIVLA LVLLLVVAGG GYAVIRTYVL PADFDGQGSG ETVFVIEQGD AGSVVGENLA EAGIVASPRA FLNALDAVPE EELGSGLAPG TYSLAQGMSG EAAVAALLDP ASRVGGRVTI PEGLRTDGIF ERISEATDLS VEELDAAYAQ TDELGLPDYA TEGPEGYLFP STYRFDPGAD ALSVLKTMVT QHTQVAEEID LEGRAEALGY DANEVMAIAA IVQAETGTKE DMPLISAVVH NRLEEGMQLQ MDSTCFYVLG EEGTFLNDEQ RASCEADPRG YSTYGMTGLP AGPFVAPGQD AIEAALEPAD EDYLYFALVD PENGHTGFST TLEEHNQMVA ENQAEW
|
| |