Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4006 |
Symbol | |
ID | 9247878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4791095 |
End bp | 4792675 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | GMP synthase, large subunit |
Protein accession | YP_003681909 |
Protein GI | 297562935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.147243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.371451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGTTG GTGCTGAGAG CACGTTCGAC ACCGTTCTCG TCGTCGACTT CGGCGCGCAG TACGCGCAGC TGATCGCACG CCGGGTCCGC GAATGCCACG TGCACAGTGA GATCGTGCCC TCGACCATGC CCGTCGAGGA GATGCTCGCC AAGAAACCCA AGGCCATCAT CCTCTCCGGC GGACCGTCGT CGGTCTACGC CGAGGGGGCC CCGAACCTGG GCCCCGAGCT GTTCGAGACC GGGGTGCCCA CCTTCGGCAT CTGCTACGGC TTCCAGGCGA TGACCCGGGC CCTGGGCGGC ACCGTCGCCA GGACCGACCT CAGCGAGTTC GGCCGCACCG AGCTGTCGGC GGTCACCGAC TCCCTGCTGT TCGCCGGGAT GCCCGCCGAG CAGCTGGTCT GGATGTCGCA CGGCGACTCG GTGGTGGAGG CCCCCGAGGG CTTCGCCACG ACCGCCAGCA CCGCGGGGGC CCCGGTCGCC GCGTTCGAGC ACACCGGCCG CAACCTCTTC GGCGTCCAGT TCCACCCCGA GGTCCTGCAC ACCGAGCACG GCCAGGAGGT GCTGCGCCGC TTCCTGTACG AGGGCGCCGG GTGCCGCCCC ACCTGGACGA TGGTCAACAT CGTCGAGGAG CAGCTGGAGC GCATCCGCGA GGACATCGGC GACAAGCGCG TCATCTGCGC GCTGAGCGGC GGCGTGGACT CCGCGGTGGC CGGCGCGCTG GTGCAGCGCG CCGTCGGCGA CCAGCTGACC TGCGTCTTCG TGGACCACGG CCTGCTGCGC AAGGGCGAGG CCGAGCAGGT GGAGAAGGAC TTCGTCGCGA TCACGGGCGC CAAGCTCAAG GTCGTGGACG CCGAGGAGCG GTTCCTGTCC GCCCTCGCGG GGGTCTCCGA CCCCGAGGAG AAGCGCAAGA TCATCGGCCG CGAGTTCATC CGCGTGTTCG AGCAGGCGGC CCGCGAGGTC GTCGCCGAGA GCGGCGAGAC CGGCGCCGAG GTCGAGTTCC TGGTGCAGGG CACCCTCTAC CCCGACGTGG TGGAGTCCGG CGGCGGTACC GGGACCGCCA ACATCAAGTC CCACCACAAC GTGGGCGGGC TGCCCGACGA CCTCCAGTTC ACGCTGGTGG AACCGCTGCG CGAGCTGTTC AAGGACGAGG TGCGCAAGGT CGGCGAGGAG CTGGGCCTGC CCGCCGAGAT GGTCTGGCGC CAGCCCTTCC CCGGCCCCGG CCTGGGCATC CGCATCATCG GCGAGGTCAC CCGCGAACGC CTGGAGATCC TGCGCGAGGC CGACGCGATC GCCCGCGAGG AGCTGACCCG CGCCGGACTC GACCGCGACA TCTGGCAGTG CCCGGTGGTG CTGCTCGCCG ACGTGCGGTC GGTGGGCGTG CAGGGCGACG GGCGCACCTA CGGCCACCCG GTCGTGCTGC GCCCGGTCAG CAGCGAGGAC GCCATGACCG CCGACTGGTC GCGCGTGCCC TACGACGTGC TGGCCAGGAT CTCCAACCGC ATCACCAACG AGGTGCGCGA GATCAACCGG GTGGCGCTGG ACGTGACCAG CAAGCCCCCG GGCACCATCG AGTGGGAGTA G
|
Protein sequence | MSVGAESTFD TVLVVDFGAQ YAQLIARRVR ECHVHSEIVP STMPVEEMLA KKPKAIILSG GPSSVYAEGA PNLGPELFET GVPTFGICYG FQAMTRALGG TVARTDLSEF GRTELSAVTD SLLFAGMPAE QLVWMSHGDS VVEAPEGFAT TASTAGAPVA AFEHTGRNLF GVQFHPEVLH TEHGQEVLRR FLYEGAGCRP TWTMVNIVEE QLERIREDIG DKRVICALSG GVDSAVAGAL VQRAVGDQLT CVFVDHGLLR KGEAEQVEKD FVAITGAKLK VVDAEERFLS ALAGVSDPEE KRKIIGREFI RVFEQAAREV VAESGETGAE VEFLVQGTLY PDVVESGGGT GTANIKSHHN VGGLPDDLQF TLVEPLRELF KDEVRKVGEE LGLPAEMVWR QPFPGPGLGI RIIGEVTRER LEILREADAI AREELTRAGL DRDIWQCPVV LLADVRSVGV QGDGRTYGHP VVLRPVSSED AMTADWSRVP YDVLARISNR ITNEVREINR VALDVTSKPP GTIEWE
|
| |