Gene Ndas_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3592 
Symbol 
ID9247461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4304188 
End bp4307226 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content72% 
IMG OID 
Producttranslation initiation factor IF-2 
Protein accessionYP_003681499 
Protein GI297562525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGAAGG TCCGGGTATA TGAACTCGCC AAGGAGTTCG GTGTAGAGAG CAAGGCCGTT 
CTGGCCAAGC TCAACGAGAT GGGTGAATTC GTCCGTTCGG CGTCCTCGAC CATAGAGGCC
CCCGTCGTAC GACGCCTCCA GGAAGCTTTC AACAACGGTT CCGACGGCGG TCGCAAGGGT
TCGGCTCCCA GGCCCGGCCC GCGTCCCGCC GCGGACAACG GTGCCGCCCC GAAGCCGGGT
CCGGCCCCCA AGCCCGGCCC CAGGCCGAGC CCCGCGCCCA AGCCGAGCCC GCAGAGCGAG
GCCGCTCCGC GTGCGAACGC GCCCAAGCCC GGCCCCAAGG CTCCCCGCGG CGAGGCCGGC
GCCCGCCCCG GCGGCGCCCC CAAGCCCGGT GCCCCGGCGG GCCCCGGCGG CCGTTCCGAG
GGCGGTCGTC CCGAGGGTGG CCGTCCCGAG GGTGGCCGTC CCGACCGCGC CCAGCGCGGC
GGCGACCGTC CCGAGCGCGG TGAGCGCCCC GAGCGCGGCG ACCGTTCCGA GCGCGGCGGG
CCCCGTCCCG GCCCGCGCCC GTCCGGTCCC CGTCCGGGCA ACAACCCGTT CGCGTCCAAC
GCCTCCGGCA TGGGCTCCAA GTCGGCCGCC CCGCGGCCCG GCGGCGGTGC TCCGCGTCCG
GGCGGTCCCC GTCCGGGTCC GCGTCCCGGT CCGCAGGGCG GCGGCGAGCA GCGCGCCCCG
CGTCCCGGCG GTGCGGCGGG CGCTCCGCGT CCGGGCGGTC CCCGTCCGGG TCCGCGTCCC
GGTCCGCAGG GCGGCGGCGA GCAGCGCGCC CCGCGTCCCG GCGGTGCGGC GGGCGCTCCG
CGTCCGGGCG GTGCCGCGGG TCCCCCGCGT CCCGGTCCCC GGCCCGGCGG TCCGCGTCCC
AGCCCGATGA ACATGCCCTC GTCCCGTCCG GCCGCACCCG CGGGTGGCGG TCGCGGCGGT
GGCGGCGGCC GTCCCGGCGG CGGCGGTGGC CGTGGCCGCG GGGGAGCCCC CGCGGGTTCC
GGTCCGCGTC CCGGCGGTGT CGGCGGTGGC GGCGGTCGTC CGGGTGGCTT CGGTGGCCGT
CCCGGCGGCG GTGGCCGCGG TCGCGGCGGC GGTACGGCCG GTGCGTTCGG ACGCCCCGGC
GGCCGTCCCG GACGTGCCCG CAAGTCCAAG AAGCAGCGGC GCCAGGAGTT CCACGACTTC
CAGGCCCCGT CGTTCGGCGG CGTCAAGATC CCGAGCGGCA ACGACCAGAC CATCCGCCTG
TCCCGCGGCG CCTCGCTGTC GGACTTCGGC GACAAGATCG ACGTCAACCC GGCGTCGCTC
GTCCAGGTGA TGATGCACCT GGGTGAGATG GTCACCGCGA CCCAGTCCCT TCCGGACGAG
ACCCTGATGC TGCTGGGCGA GGAGCTCAAG TACAGGATCG AGGTCGTCAG CCCGGAGGAC
GAGGACCGCG AGCTGCTGGA GTCCTTCTCC ATCGAGTTCG GCGAGGACGA GGGCACCGCG
GAGGACCTGC GTCCGCGTCC GCCGGTGGTC ACCGTCATGG GTCACGTCGA CCACGGTAAG
ACCCGACTGC TGGACACCAT CCGCAAGACC AACGTCGTCA GCGGCGAGGC CGGCGGTATC
ACCCAGCACA TCGGTGCCTA CCAGGTCGCC ACCGAGGTGG ACGGCGAAGA GCGCAAGATC
ACCTTCATCG ACACCCCGGG TCACGAGGCG TTCACCGCCA TGCGTGCCCG CGGTGCCAAG
GTCACCGACA TCGCGGTGCT GGTCGTGGCC GCCGACGACG GCGTCAAGCC GCAGACGGCG
GAGGCCATCG ACCACGCCAA GGCGGCCGAG GTGCCGATCG TGGTCGCGGT CAACAAGATC
GACGTCGAGG GCGCCGACCC GCAGCGGGTC CGCGCGCAGC TCACCGAGTA CGGCCTGGTG
GCCGAGGAGT ACGGCGGCGA CGTCCAGTTC GTGGACATCT CCGCACTCAA GGGCGACAAC
ATCGACGCCC TGCTCGAGTC GATCGTGCTG ACCTCCGACG CAGCGCTCGA TCTCCAGGCC
AACCCGGAGA TGGACGCGCA GGGTCTGGCC ATCGAGGCCT ACCTGGACCG CGGTCGCGGT
TCCATGGCCA CGGTGCTGGT CCAGCGCGGC ACGCTCAACG TCGGTGACTC GATCGTCTGC
GGTGACGCCT TCGGCCGCGT CCGCGCCATG CTCGACGAGC ACGGCCAGAA CGTGCAGAGC
GCCGAGCCGT CCCGTCCGGT GCAGGTGCTG GGTCTGACCA ACGTTCCCAG CGCCGGTGAC
AACTTCCTGG TCGTCAAGGA CGACCGGGTG GCGCGGCAGA TCGCCCAGCA GCGCGAGGCG
CGCGAGCGCT TCGCCCAGCA GGCGCGTTCG GCCCGCCGGG TCACCCTCGA CAACTGGCAG
AACGCCCTGA AGGAGGGCGA GCGCTCCGAG CTGCTCCTCC TCATCAAGGG TGACATGTCC
GGTTCCGTCG AGGCGCTGGA GGAGTCCCTG CTCAAGATCG ACGCCGGTGG CGACGAGGTG
AGCATCCGGG TCATCGGGCG CGGTGTCGGT GCGATCACGC AGAACGACAT CAACCTGGCG
GCCTCCGCCG AGGCGATCAT CGTCGGCTTC AACGTTCGGC CCGAGGGCAA GAACAGCGAG
CTGGCCGACC GCATGGGCGT GGACATCCGG TACTACTCGG TGATCTACCA GGCCATCGAC
GAGGTCGAGG CCGCGGTCAA GGGTCTGCTC AAGCCGATCT ACGAAGAGGT CCAGCTCGGC
AGCGCGGAGA TCCGCGAGGT CTTCAAGGTG CCGCGGATCG GCAACATCGC CGGTTCGATC
GTCCGCAGCG GCCTGATCCG CCGCAACTCC AAGGCGCGGC TCATCCGCGA CGGGGTCGTC
ATCTCCGAGA ACCTCAACGT GGAGTCCCTG CGTCGGTTCA AGGACGACGC GACCGAGGTC
CGCGAGGGCT TCGAGTGCGG TATCGGCGTC GGCTACAACG ACCTGCGTGT CGAGGACGTC
ATCGAGACGT ACGAGATGCA GGAGAAGCCC CGCGACTGA
 
Protein sequence
MAKVRVYELA KEFGVESKAV LAKLNEMGEF VRSASSTIEA PVVRRLQEAF NNGSDGGRKG 
SAPRPGPRPA ADNGAAPKPG PAPKPGPRPS PAPKPSPQSE AAPRANAPKP GPKAPRGEAG
ARPGGAPKPG APAGPGGRSE GGRPEGGRPE GGRPDRAQRG GDRPERGERP ERGDRSERGG
PRPGPRPSGP RPGNNPFASN ASGMGSKSAA PRPGGGAPRP GGPRPGPRPG PQGGGEQRAP
RPGGAAGAPR PGGPRPGPRP GPQGGGEQRA PRPGGAAGAP RPGGAAGPPR PGPRPGGPRP
SPMNMPSSRP AAPAGGGRGG GGGRPGGGGG RGRGGAPAGS GPRPGGVGGG GGRPGGFGGR
PGGGGRGRGG GTAGAFGRPG GRPGRARKSK KQRRQEFHDF QAPSFGGVKI PSGNDQTIRL
SRGASLSDFG DKIDVNPASL VQVMMHLGEM VTATQSLPDE TLMLLGEELK YRIEVVSPED
EDRELLESFS IEFGEDEGTA EDLRPRPPVV TVMGHVDHGK TRLLDTIRKT NVVSGEAGGI
TQHIGAYQVA TEVDGEERKI TFIDTPGHEA FTAMRARGAK VTDIAVLVVA ADDGVKPQTA
EAIDHAKAAE VPIVVAVNKI DVEGADPQRV RAQLTEYGLV AEEYGGDVQF VDISALKGDN
IDALLESIVL TSDAALDLQA NPEMDAQGLA IEAYLDRGRG SMATVLVQRG TLNVGDSIVC
GDAFGRVRAM LDEHGQNVQS AEPSRPVQVL GLTNVPSAGD NFLVVKDDRV ARQIAQQREA
RERFAQQARS ARRVTLDNWQ NALKEGERSE LLLLIKGDMS GSVEALEESL LKIDAGGDEV
SIRVIGRGVG AITQNDINLA ASAEAIIVGF NVRPEGKNSE LADRMGVDIR YYSVIYQAID
EVEAAVKGLL KPIYEEVQLG SAEIREVFKV PRIGNIAGSI VRSGLIRRNS KARLIRDGVV
ISENLNVESL RRFKDDATEV REGFECGIGV GYNDLRVEDV IETYEMQEKP RD