Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4023 |
Symbol | |
ID | 9247895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4813419 |
End bp | 4814915 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | inosine-5'-monophosphate dehydrogenase |
Protein accession | YP_003681926 |
Protein GI | 297562952 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.836954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCAGA GCCAGGAGTA CAGCGACTAC GGGGACAAAC TCCTTCCCCC CGGACTGACC TACGACGACG TCCTTCTGGT CCCGGCCTAC TCCGATCTCC AGCCGGGCGA GGTCGACACC ACCACCCGCC TGTCGCGCAA CCTCACGCTG CGGATCCCGC TGCTGTCCGC GGCGATGGAC ACCGTCACCG AGGCACGCAT GGCCGTGGCG ATGGCCCGCC AGGGCGGCGC GGGCGTCCTG CACCGCAACC TCTCCGTCGA GGACCAGGCC AGCCAGGTCG ACCTGGTCAA GCGCTCCGAG GCGGGCATGG TCACCGACCC GGTCACCTGT CAGCCCGAGG ACACCCTCGC CGAGGTCGAG CGCCTGTGCG CCCACTACCG GATCTCCGGC GTGCCCGTCA CCGACGGCGC CGGGATCCTG GTCGGCATCG TCACCAACCG GGACATGCGC TTCGAGAGCG ACCGGGGCCG CCTGGTCCGC GACGTGATGA CCACGGAGAA CCTGGTCACC GCCCCCGTGG GCGTCAGCCG GGAGCAGGCC TTCGACCTGC TGCGCAGGCA CAAGGTCGAG AAGCTGCCGC TGGTCGACGG CCAGAACCGG CTGCGCGGCC TGATCACCGT CAAGGACTTC ATCAAGAGCG AGCAGTACCC CGACGCCACC AAGGACGCAG ACGGCCGCCT CGTCGTCGGC GGCGCCGTGG GCGTGGGCGC CGAGGCGGAG GAGCGCGCCA AGCGGCTCGT GGAGGCCGGT GTCGACTTCA TCGTCGTGGA CACCGCCCAC GGCCACTCCT CGGGGCTCGC GGACATGATC GCCAAGCTCA AGGCCAACTC GCGCGCCGAC ATCGTCGCGG GCAACGTCGC CACCCGCGCC GGGGCGCAGC TGCTCATCGA CGCCGGGGCC GACGCCGTCA AGGTCGGTGT GGGCCCGGGC TCCATCTGCA CGACGCGGGT CGTGGCCGGT GTGGGCGCCC CGCAGCTCAC CGCCATCCTG GAGGCCGCCA AGGCGTGCGG CCCGGCGGGC GTCCCGCTCA TCGCCGACGG AGGGCTCCAG TACTCCGGTG AGATCGCCAA GGCCATCGCG GCCGGGGCCA GCACCGTGAT GCTCGGCAGC CTGCTCGCGG GCGTGGAGGA GAGCCCCGGT GAGCTGATCT TCATCAACGG CAAGCAGTTC AAGGCCTACC GGGGCATGGG CTCGCTCGGC GCCATGCGCG GCCGCTCCTT CTCCAAGGAC CGCTACGCGC AGGCCGACGT GGCCAGCGAG GACAAGCTGG TCCCCGAGGG CATCGAGGGC CAGGTGCCCT TCCGCGGCCC GCTTCAGGCG GTCGCCCACC AGCTCGTCGG CGGTCTGCAC CAGTCCATGT GGTACGCCGG GACCCGTACG CTGGACGAGC TGCGCGAGCG CGGCCAGCTC ATGCGCATCA CCAGCGCCGG TCTGCGCGAG AGCCACCCGC ACGACATCAA GATGACGGTC GAGGCGCCCA ACTACAACGC CCGCTGA
|
Protein sequence | MGQSQEYSDY GDKLLPPGLT YDDVLLVPAY SDLQPGEVDT TTRLSRNLTL RIPLLSAAMD TVTEARMAVA MARQGGAGVL HRNLSVEDQA SQVDLVKRSE AGMVTDPVTC QPEDTLAEVE RLCAHYRISG VPVTDGAGIL VGIVTNRDMR FESDRGRLVR DVMTTENLVT APVGVSREQA FDLLRRHKVE KLPLVDGQNR LRGLITVKDF IKSEQYPDAT KDADGRLVVG GAVGVGAEAE ERAKRLVEAG VDFIVVDTAH GHSSGLADMI AKLKANSRAD IVAGNVATRA GAQLLIDAGA DAVKVGVGPG SICTTRVVAG VGAPQLTAIL EAAKACGPAG VPLIADGGLQ YSGEIAKAIA AGASTVMLGS LLAGVEESPG ELIFINGKQF KAYRGMGSLG AMRGRSFSKD RYAQADVASE DKLVPEGIEG QVPFRGPLQA VAHQLVGGLH QSMWYAGTRT LDELRERGQL MRITSAGLRE SHPHDIKMTV EAPNYNAR
|
| |