Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4359 |
Symbol | |
ID | 9248234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5193252 |
End bp | 5194547 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glutamate-1-semialdehyde-2,1-aminomutase |
Protein accession | YP_003682254 |
Protein GI | 297563280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.803676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTACTC AACAGACTTC AGCAGAGCTT TTCCAGCGTG CCTCCGCCGT CGTGCCGGGT GGCGTGAACT CCCCCGTCCG CGCCTTCGGC GCCGTCGGCG GCACCCCGCC CTTCTTCGTC AAGGGTGAGG GCCCCTACCT CACCGACGCC GACGGCCGCC AGTACGTCGA CCTCGTGTGC TCGTGGGGGC CGTTGATCCT CGGCCACGCG GCCCCCGCCG TCGTCGAGGC CCTCCACGGC GCGGTCGACG CCGGGACCTC CTACGGCGCG CCGACCCCCG GCGAGGTCGA GCTGGCCGAG CTGATCGTCG AGCGCACCCC GGTGGAGAAG GTCCGCCTCG TCAACTCCGG CACCGAGGCC ACCATGTCCG CGATCCGCCT CGCGCGCGGC TTCACCGGGC GCAGCAAGAT CGTCAAGTTC GCGGGCAACT ACCACGGCCA CGTCGACGCC CTCCTGGCCT CCGCCGGTTC GGGCCTGGCC ACCTTCGCCC TGCCCGACTC CCCGGGCGTG ACCGGGGCCA GCGCCGCCGA CACCCTCGTG CTGCCCTACA ACGACCCCGA GGCCGTCGAG CGAGCCTTCG CCGAGCACGG CGACGAGATC GCCTGCGTGA TCGCCGAGGC CTGCCCCGCC AACATGGGCG TCGTCGCACC CCGGGACGGG TTCAACGCCC GGATCAAGGA GATCGCGCAC GCCAACGGCG CCCTCCTCAT CCTCGACGAG GTCCTCACCG GCTTCCGCGT CAGCGCCTCG GGCTGGTTCG GCCTGGAGGG CGTCGCCCCC GACCTCATGA CCTTCGGCAA GGTCATGGGC GGCGGCCTGC CCGCCGCCGC GTTCGGCGGA CGCGCCGAGA TCATGGACCG CCTCGCGCCG AACGGTCCCG TCTACCAGGC GGGCACCCTG TCCGGGAACC CGCTTGCCAC CGCCGCCGGC CTGGCCACAC TGCGGGGGGC CACCCCCGAG GTCTACGCCC GCATCGACGA GGTCTCCGCC CGGGTGGCGG CCGAGGTCTC CAAGGCGCTC GGCGAGGCCG GGGTCGTCCA CCGGCTCCAG AACGGCGGCA ACCTCTTCAC GGTGTTCTTC ACCGGCCAGG AGGCCGTCGA CTTCGACACC GCGCGCACCA CCGACACCGC GGTCTTCTCC GCGTTCTTCC ACGCCATGCT CGACCAGGGC GTGTACCTGC CGCCCGCCGC CTTCGAGGCC TGGTTCTTCT CCGCCGCGCA CGACGACGCC GCCGTGGACC GGGTGGTCTC GGCGCTGCCC AGGGCGGCCC GCGCCGCGGC CGAGGCCCAG GGCTGA
|
Protein sequence | MGTQQTSAEL FQRASAVVPG GVNSPVRAFG AVGGTPPFFV KGEGPYLTDA DGRQYVDLVC SWGPLILGHA APAVVEALHG AVDAGTSYGA PTPGEVELAE LIVERTPVEK VRLVNSGTEA TMSAIRLARG FTGRSKIVKF AGNYHGHVDA LLASAGSGLA TFALPDSPGV TGASAADTLV LPYNDPEAVE RAFAEHGDEI ACVIAEACPA NMGVVAPRDG FNARIKEIAH ANGALLILDE VLTGFRVSAS GWFGLEGVAP DLMTFGKVMG GGLPAAAFGG RAEIMDRLAP NGPVYQAGTL SGNPLATAAG LATLRGATPE VYARIDEVSA RVAAEVSKAL GEAGVVHRLQ NGGNLFTVFF TGQEAVDFDT ARTTDTAVFS AFFHAMLDQG VYLPPAAFEA WFFSAAHDDA AVDRVVSALP RAARAAAEAQ G
|
| |