Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2111 |
Symbol | |
ID | 9245961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2529911 |
End bp | 2531974 |
Gene Length | 2064 bp |
Protein Length | 687 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_003680042 |
Protein GI | 297561068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0378177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCC TCCTCGTGGA CAACCACGAC TCCTACACCT ACAACCTCTT CCAGCTCCTG GCGCGCGTCC TCGGCCGGAC CCCGGCGGTG CTCACCCACG ACGACCCCCG CTGGGACCGG CTCGACCCGG CGGACCTGGA CGCCGTGGTC GTCTCACCGG GCCCGGGCCG CCCCCAGAAC CCCCGCGACC TGGGCCGCGT CCCCGAACTC CTGGCCCGCA CCCGGCTGCC CGTCCTGGGG GTGTGCCTGG GCCACCAGGC CATCGCCCAC CTGGCCGGGG CGGACGTGGT CTCGGCCCCC CAGCCCCGCC ACGGCCACCT GTCCCGGATC CGGCACACCG GCGCGGGCCT GTTCGCGGGC CTGCCCCAGG GCCTCACCGC CGTCCGCTAC CACTCCCTGG CCGTTCCCGA GCCCCTGCCC GAGACGCTGA CGGCCACCGC ATGGGCCGAG GACGGCGTGG TCATGGGCCT GGAGCACCGC ACGCTGCCCC GCTGGGGCGT GCAGTTCCAC CCCGAGTCGG TGGCCAGCGA GCACGGCGCG GAGCTGGTCG CGAACTTCCT GGACCTGGCC CGCCCGCACA CCGCCCTGGC GGCCGCGCCC CCGGGCCCGC CCCCGGAACG GGACGCGGCC GCCCGGGGCG GGAAGGCCCC GACGGCGGTG CCCGGAGAGC GCCCCGACGC CCCCGCGGCC GCGGGGGCTC CGACCGTCCG TGTGCGCGAA CTCCCCATGG CCGTGGACAC CGAGGCCGCC TTCGCCCGCC TGTACGGCGA CGCCGAGTAC GCGTTCTGGC TGGACAGCTC CCGTCCGGAG GGCCCCGCGC GCTTCTCCTT CCTCGGCGCC GCCACCGGCG AGGTGCTCAC CTACCGCGTC GGCGGCCCCG TCACCGTGCG CGCCGCGGAC GGCACCGAGC ACCGCGAGCC CGGCAGCGTC TTCGACGCCC TCGACCGCCG CCTCCCCCCG CCCTCTCCCT CCGGCCTGCC CTTCGACTTC ACAGGAGGGT ACGTCGGCTA CCTCGGCTAC GAGCTCAAGG CCGACTGCGG CGGCGACAGC GCCCACACCT CGTCCGAACC CGACGCGGTG TGGCTGCGCT GCGACCGGTT CGTGGCCGTC GACCACCACC GCGACCGCAC CTACGCGGTG TGCGCCGACG ACCTCCCCGG CGGCGACGCC TGGCTCGACG CGACCCTGAC CGCCCTCACC GACCTGCCGC CGCTGTCGGC ACCCGTCCCC GCCGCCCCGT CCGCCGACCT GGCCCGCCTG CTGGAGCGCC CGCGCGAGGA CTACGTGGCC GACGTCAAGG AGTGCCTGGC CGAGCTCACC GCCGGGGAGA GCTACGAGAT CTGCCTGACC AACCGGGTCC GCCTGCCCCA CGACGGGGGA GACCTGGACT CCTACCGCCG CCTGCGCGCG GCCAGCCCCG CGCCCTACGC CGCCCTGCTG CGGCTGGGGC CCACGAGCGT GCTCAGCGCC TCGCCCGAGC GCTTCTTGCG CGTGGACGGC GACCGCGCGG CCGAGAGCCG CCCCATCAAG GGCACCGCGC CGCGCCACGC CGACCCGGCC GCCGACGCGC GCGCGGCCCG GGAGCTGCGC ACCGGCGCCA AGACCCGCGC CGAGAACCTC ATGATCGTCG ACCTGCTCCG CAACGACCTG GGGCGGGTGT GCGAGGTCGG CAGCGTCGAG GTGCCCGCGT TCATGTACAC CGAGTCCTAC GCCACCGTGC ACCAGCTGGT CTCCACCGTG CGGGGCGTGC TGCGCCCGCG GGTGTCGGCC GTGGAGGCGG TCCGCGCCTG CTTCCCCGGC GGGTCGATGA CGGGCGCGCC CAAGGAGCGC ACCATGCGCA TCCTCGACCG CCTGGAGACC TCCGCGCGCG GGGTCTACTC CGGCGCGCTG GGCTACGTGT CCTCCTCCGG CGCGGCCGAC CTCAGCATCG TCATCCGCAC CGCCGTGCTC AGCGGCGGCG AGCTGACCAT CGGCGCGGGC GGAGCGGTCG TGCTGGACTC CGATCCCGAG GAGGAGTACG AGGAGATGCT GCTCAAGGCC GACGTGCCCG CCCGCGCCCG CTGA
|
Protein sequence | MRVLLVDNHD SYTYNLFQLL ARVLGRTPAV LTHDDPRWDR LDPADLDAVV VSPGPGRPQN PRDLGRVPEL LARTRLPVLG VCLGHQAIAH LAGADVVSAP QPRHGHLSRI RHTGAGLFAG LPQGLTAVRY HSLAVPEPLP ETLTATAWAE DGVVMGLEHR TLPRWGVQFH PESVASEHGA ELVANFLDLA RPHTALAAAP PGPPPERDAA ARGGKAPTAV PGERPDAPAA AGAPTVRVRE LPMAVDTEAA FARLYGDAEY AFWLDSSRPE GPARFSFLGA ATGEVLTYRV GGPVTVRAAD GTEHREPGSV FDALDRRLPP PSPSGLPFDF TGGYVGYLGY ELKADCGGDS AHTSSEPDAV WLRCDRFVAV DHHRDRTYAV CADDLPGGDA WLDATLTALT DLPPLSAPVP AAPSADLARL LERPREDYVA DVKECLAELT AGESYEICLT NRVRLPHDGG DLDSYRRLRA ASPAPYAALL RLGPTSVLSA SPERFLRVDG DRAAESRPIK GTAPRHADPA ADARAARELR TGAKTRAENL MIVDLLRNDL GRVCEVGSVE VPAFMYTESY ATVHQLVSTV RGVLRPRVSA VEAVRACFPG GSMTGAPKER TMRILDRLET SARGVYSGAL GYVSSSGAAD LSIVIRTAVL SGGELTIGAG GAVVLDSDPE EEYEEMLLKA DVPARAR
|
| |