Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3592 |
Symbol | |
ID | 9247461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4304188 |
End bp | 4307226 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | translation initiation factor IF-2 |
Protein accession | YP_003681499 |
Protein GI | 297562525 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.17436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGAAGG TCCGGGTATA TGAACTCGCC AAGGAGTTCG GTGTAGAGAG CAAGGCCGTT CTGGCCAAGC TCAACGAGAT GGGTGAATTC GTCCGTTCGG CGTCCTCGAC CATAGAGGCC CCCGTCGTAC GACGCCTCCA GGAAGCTTTC AACAACGGTT CCGACGGCGG TCGCAAGGGT TCGGCTCCCA GGCCCGGCCC GCGTCCCGCC GCGGACAACG GTGCCGCCCC GAAGCCGGGT CCGGCCCCCA AGCCCGGCCC CAGGCCGAGC CCCGCGCCCA AGCCGAGCCC GCAGAGCGAG GCCGCTCCGC GTGCGAACGC GCCCAAGCCC GGCCCCAAGG CTCCCCGCGG CGAGGCCGGC GCCCGCCCCG GCGGCGCCCC CAAGCCCGGT GCCCCGGCGG GCCCCGGCGG CCGTTCCGAG GGCGGTCGTC CCGAGGGTGG CCGTCCCGAG GGTGGCCGTC CCGACCGCGC CCAGCGCGGC GGCGACCGTC CCGAGCGCGG TGAGCGCCCC GAGCGCGGCG ACCGTTCCGA GCGCGGCGGG CCCCGTCCCG GCCCGCGCCC GTCCGGTCCC CGTCCGGGCA ACAACCCGTT CGCGTCCAAC GCCTCCGGCA TGGGCTCCAA GTCGGCCGCC CCGCGGCCCG GCGGCGGTGC TCCGCGTCCG GGCGGTCCCC GTCCGGGTCC GCGTCCCGGT CCGCAGGGCG GCGGCGAGCA GCGCGCCCCG CGTCCCGGCG GTGCGGCGGG CGCTCCGCGT CCGGGCGGTC CCCGTCCGGG TCCGCGTCCC GGTCCGCAGG GCGGCGGCGA GCAGCGCGCC CCGCGTCCCG GCGGTGCGGC GGGCGCTCCG CGTCCGGGCG GTGCCGCGGG TCCCCCGCGT CCCGGTCCCC GGCCCGGCGG TCCGCGTCCC AGCCCGATGA ACATGCCCTC GTCCCGTCCG GCCGCACCCG CGGGTGGCGG TCGCGGCGGT GGCGGCGGCC GTCCCGGCGG CGGCGGTGGC CGTGGCCGCG GGGGAGCCCC CGCGGGTTCC GGTCCGCGTC CCGGCGGTGT CGGCGGTGGC GGCGGTCGTC CGGGTGGCTT CGGTGGCCGT CCCGGCGGCG GTGGCCGCGG TCGCGGCGGC GGTACGGCCG GTGCGTTCGG ACGCCCCGGC GGCCGTCCCG GACGTGCCCG CAAGTCCAAG AAGCAGCGGC GCCAGGAGTT CCACGACTTC CAGGCCCCGT CGTTCGGCGG CGTCAAGATC CCGAGCGGCA ACGACCAGAC CATCCGCCTG TCCCGCGGCG CCTCGCTGTC GGACTTCGGC GACAAGATCG ACGTCAACCC GGCGTCGCTC GTCCAGGTGA TGATGCACCT GGGTGAGATG GTCACCGCGA CCCAGTCCCT TCCGGACGAG ACCCTGATGC TGCTGGGCGA GGAGCTCAAG TACAGGATCG AGGTCGTCAG CCCGGAGGAC GAGGACCGCG AGCTGCTGGA GTCCTTCTCC ATCGAGTTCG GCGAGGACGA GGGCACCGCG GAGGACCTGC GTCCGCGTCC GCCGGTGGTC ACCGTCATGG GTCACGTCGA CCACGGTAAG ACCCGACTGC TGGACACCAT CCGCAAGACC AACGTCGTCA GCGGCGAGGC CGGCGGTATC ACCCAGCACA TCGGTGCCTA CCAGGTCGCC ACCGAGGTGG ACGGCGAAGA GCGCAAGATC ACCTTCATCG ACACCCCGGG TCACGAGGCG TTCACCGCCA TGCGTGCCCG CGGTGCCAAG GTCACCGACA TCGCGGTGCT GGTCGTGGCC GCCGACGACG GCGTCAAGCC GCAGACGGCG GAGGCCATCG ACCACGCCAA GGCGGCCGAG GTGCCGATCG TGGTCGCGGT CAACAAGATC GACGTCGAGG GCGCCGACCC GCAGCGGGTC CGCGCGCAGC TCACCGAGTA CGGCCTGGTG GCCGAGGAGT ACGGCGGCGA CGTCCAGTTC GTGGACATCT CCGCACTCAA GGGCGACAAC ATCGACGCCC TGCTCGAGTC GATCGTGCTG ACCTCCGACG CAGCGCTCGA TCTCCAGGCC AACCCGGAGA TGGACGCGCA GGGTCTGGCC ATCGAGGCCT ACCTGGACCG CGGTCGCGGT TCCATGGCCA CGGTGCTGGT CCAGCGCGGC ACGCTCAACG TCGGTGACTC GATCGTCTGC GGTGACGCCT TCGGCCGCGT CCGCGCCATG CTCGACGAGC ACGGCCAGAA CGTGCAGAGC GCCGAGCCGT CCCGTCCGGT GCAGGTGCTG GGTCTGACCA ACGTTCCCAG CGCCGGTGAC AACTTCCTGG TCGTCAAGGA CGACCGGGTG GCGCGGCAGA TCGCCCAGCA GCGCGAGGCG CGCGAGCGCT TCGCCCAGCA GGCGCGTTCG GCCCGCCGGG TCACCCTCGA CAACTGGCAG AACGCCCTGA AGGAGGGCGA GCGCTCCGAG CTGCTCCTCC TCATCAAGGG TGACATGTCC GGTTCCGTCG AGGCGCTGGA GGAGTCCCTG CTCAAGATCG ACGCCGGTGG CGACGAGGTG AGCATCCGGG TCATCGGGCG CGGTGTCGGT GCGATCACGC AGAACGACAT CAACCTGGCG GCCTCCGCCG AGGCGATCAT CGTCGGCTTC AACGTTCGGC CCGAGGGCAA GAACAGCGAG CTGGCCGACC GCATGGGCGT GGACATCCGG TACTACTCGG TGATCTACCA GGCCATCGAC GAGGTCGAGG CCGCGGTCAA GGGTCTGCTC AAGCCGATCT ACGAAGAGGT CCAGCTCGGC AGCGCGGAGA TCCGCGAGGT CTTCAAGGTG CCGCGGATCG GCAACATCGC CGGTTCGATC GTCCGCAGCG GCCTGATCCG CCGCAACTCC AAGGCGCGGC TCATCCGCGA CGGGGTCGTC ATCTCCGAGA ACCTCAACGT GGAGTCCCTG CGTCGGTTCA AGGACGACGC GACCGAGGTC CGCGAGGGCT TCGAGTGCGG TATCGGCGTC GGCTACAACG ACCTGCGTGT CGAGGACGTC ATCGAGACGT ACGAGATGCA GGAGAAGCCC CGCGACTGA
|
Protein sequence | MAKVRVYELA KEFGVESKAV LAKLNEMGEF VRSASSTIEA PVVRRLQEAF NNGSDGGRKG SAPRPGPRPA ADNGAAPKPG PAPKPGPRPS PAPKPSPQSE AAPRANAPKP GPKAPRGEAG ARPGGAPKPG APAGPGGRSE GGRPEGGRPE GGRPDRAQRG GDRPERGERP ERGDRSERGG PRPGPRPSGP RPGNNPFASN ASGMGSKSAA PRPGGGAPRP GGPRPGPRPG PQGGGEQRAP RPGGAAGAPR PGGPRPGPRP GPQGGGEQRA PRPGGAAGAP RPGGAAGPPR PGPRPGGPRP SPMNMPSSRP AAPAGGGRGG GGGRPGGGGG RGRGGAPAGS GPRPGGVGGG GGRPGGFGGR PGGGGRGRGG GTAGAFGRPG GRPGRARKSK KQRRQEFHDF QAPSFGGVKI PSGNDQTIRL SRGASLSDFG DKIDVNPASL VQVMMHLGEM VTATQSLPDE TLMLLGEELK YRIEVVSPED EDRELLESFS IEFGEDEGTA EDLRPRPPVV TVMGHVDHGK TRLLDTIRKT NVVSGEAGGI TQHIGAYQVA TEVDGEERKI TFIDTPGHEA FTAMRARGAK VTDIAVLVVA ADDGVKPQTA EAIDHAKAAE VPIVVAVNKI DVEGADPQRV RAQLTEYGLV AEEYGGDVQF VDISALKGDN IDALLESIVL TSDAALDLQA NPEMDAQGLA IEAYLDRGRG SMATVLVQRG TLNVGDSIVC GDAFGRVRAM LDEHGQNVQS AEPSRPVQVL GLTNVPSAGD NFLVVKDDRV ARQIAQQREA RERFAQQARS ARRVTLDNWQ NALKEGERSE LLLLIKGDMS GSVEALEESL LKIDAGGDEV SIRVIGRGVG AITQNDINLA ASAEAIIVGF NVRPEGKNSE LADRMGVDIR YYSVIYQAID EVEAAVKGLL KPIYEEVQLG SAEIREVFKV PRIGNIAGSI VRSGLIRRNS KARLIRDGVV ISENLNVESL RRFKDDATEV REGFECGIGV GYNDLRVEDV IETYEMQEKP RD
|
| |