Gene Ndas_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5121 
Symbol 
ID9249014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp273711 
End bp275825 
Gene Length2115 bp 
Protein Length704 aa 
Translation table11 
GC content65% 
IMG OID 
Producttranslation elongation factor G 
Protein accessionYP_003683007 
Protein GI297564034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.63953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCTA AGACTGTTCT TGACCTGGCC AAGGTCCGCA ACATCGGGAT CATGGCCCAC 
ATTGACGCGG GCAAGACCAC CACCACCGAG CGCATCCTGT TCTACACGGG TGTCAAGCAC
AAGCTCGGCG AGACGCACGA CGGCGCCTCG ACAATGGACT GGATGAAGGA AGAGCAGGAA
CGGGGTATCA CCATTACCTC GGCTGCCATC ACCACCCATT GGAACGACAA CACGATCAAC
ATCATCGACA CGCCCGGCCA CGTCGACTTC ACGGTCGAGG TCGAGCGGTC GTTGCGTGTG
CTCGACGGTG CGGTCGCGGT GTTCGACGCC AAGGAGGGCG TCGAGCCCCA GTCGGAGCAG
GTCTGGCGTC AGGCCGACCG CTACGGCGTC CCGCGTATCT GCTTCGTCAA CAAGATGGAC
AAGATCGGCG CTGAGTTCCA GCGCTGCGTG GACATGTTCC GCGAGCGTCT GGGCGCCAAC
GCCATGCCGA TCCAGCTGCC CATCGGCGCT GAGAGCGACT TCAAGGGTGT CATCGACCTC
GTGCTGATGA AGGCCTACGT CTGGAACGAC GAGGCCGCGC TCGGCGAGAT GTACGACACG
ACCGAGATCC CCGAGAGCCA CGCCGACGCC GCCCGCGAGG CCCGCGACCA GTTCATCGAG
ACGCTGGCCG AGGCCGACGA CGAGATCATG GAGATGTACC TGGAGGGCCA GGAGCCCACC
GTGGAGCAGC TCATCCCCGC GATCCGCCGC GCGACCATCG CCGGTACGGC CGTCCCGGTC
GTGTGCGGCA CCGCGTTCAA GAACAAGGGC GTGCAGCCCC TGCTCGACGC GGTCACCGCC
TACCTCCCCT CGCCCCTGGA CGTCGAGGCC ATCGAGGGCC ACGACCCCAA GGACGAGAGC
GAGGAGACCA AGCTCGTCCG CAAGCCGAGC AACGACGAGC CGCTGTCGGC CCTGGTCTTC
AAGATCGCGA GCGACCCGCA CCTGGGCAAG CTCTCCTACG TGCGCGTCTA CTCCGGCGTT
CTCCAGACGG GCACCCAGGT GCTCAACAGC CTCAAGGGCC GCAAGGAGCG CATCGGCAAG
ATCTACCGCA TGCACTCCAA CAAGCGCGAG GAGATCGCCG AGGTCGGCGC CGGCGACATC
GTCGCCGTCA TGGGCCTGAA GGACACCACG ACCGGTGAGA CCCTGTGCGA CCAGGCGAAC
CCGATCGTGC TGGAGTCCAT GACCTTCCCG GCTCCGGTCA TCGAGGTGGC CATCGAGCCC
AAGACCAAGA GCGACCAGGA GAAGCTGGGC ATCGCGATCC AGCGCCTCGC GGACGAGGAC
CCCTCGTTCC AGGTCGCCAC GGACGACCAG ACCGGTCAGA CCGTGATCTC GGGCATGGGC
GAGCTGCACC TCGAGGTGCT CGTCAACCGC ATGCGCGACG AGTTCAAGGT CGAGGCGAAC
ATCGGCAAGC CGCAGGTGGC CTACCGCGAG ACCATTCGCA AGAAGGTCGA GGGGCACACC
TACACCCACA AGAAGCAGAC CGGTGGGTCG GGCCAGTTCG CCAAGGTCAA GATCGACATC
GAGCCCCTGG AGACCGAGAG CGGCGACGCC TCGGGCTACG AGTTCGTCAA CGCCGTCACC
GGTGGCCGCA TCCCGAGGGA GTACATCCCC TCGGTCGACG CCGGCTGCCA GGAGGCCGCC
GAGCTGGGCG TGCTCGCGCA CTACCCGCTC GTCGGCGTCA AGGTGACGCT CCAGGACGGC
CAGTACCACG AGGTCGACTC CTCCGAGATG GCCTTCAAGA CCGCCGGTTC GATCGCCTTC
AAGGAGGCGG TCAAGCTGGC CAAGCCGACT CTCCTGGAGC CGGTCATGGC TGTCGAGGTC
ACCACCCCCG AGGAGTACAT GGGTGACGTG ATCGGCGACC TGAACTCCCG CCGTGGACAG
ATCCAGTCCA TGGACGAGCG TTCCGGCGTC CGTATCGTCA AGGCCCAGGT GCCCCTCTCC
GAAATGTTCG GCTACGTGGG TGACCTGCGC AGCCGCACGC AGGGTCGAGC CAACTACTCG
ATGGTGTTCG ACTCCTACGC GGAGGTTCCG TCCGCTGTCG CCCAAGAAAT TGTGGCGAAG
GTCCGCGGCG AATAG
 
Protein sequence
MAAKTVLDLA KVRNIGIMAH IDAGKTTTTE RILFYTGVKH KLGETHDGAS TMDWMKEEQE 
RGITITSAAI TTHWNDNTIN IIDTPGHVDF TVEVERSLRV LDGAVAVFDA KEGVEPQSEQ
VWRQADRYGV PRICFVNKMD KIGAEFQRCV DMFRERLGAN AMPIQLPIGA ESDFKGVIDL
VLMKAYVWND EAALGEMYDT TEIPESHADA AREARDQFIE TLAEADDEIM EMYLEGQEPT
VEQLIPAIRR ATIAGTAVPV VCGTAFKNKG VQPLLDAVTA YLPSPLDVEA IEGHDPKDES
EETKLVRKPS NDEPLSALVF KIASDPHLGK LSYVRVYSGV LQTGTQVLNS LKGRKERIGK
IYRMHSNKRE EIAEVGAGDI VAVMGLKDTT TGETLCDQAN PIVLESMTFP APVIEVAIEP
KTKSDQEKLG IAIQRLADED PSFQVATDDQ TGQTVISGMG ELHLEVLVNR MRDEFKVEAN
IGKPQVAYRE TIRKKVEGHT YTHKKQTGGS GQFAKVKIDI EPLETESGDA SGYEFVNAVT
GGRIPREYIP SVDAGCQEAA ELGVLAHYPL VGVKVTLQDG QYHEVDSSEM AFKTAGSIAF
KEAVKLAKPT LLEPVMAVEV TTPEEYMGDV IGDLNSRRGQ IQSMDERSGV RIVKAQVPLS
EMFGYVGDLR SRTQGRANYS MVFDSYAEVP SAVAQEIVAK VRGE