Gene Ndas_4025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4025 
Symbol 
ID9247897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4816273 
End bp4817883 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID 
Productchaperonin GroEL 
Protein accessionYP_003681928 
Protein GI297562954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TCCTGGAGTT CGAGGACGAC GCCCGTCGCG CCCTCGAACG GGGCGTCGAC 
CGCCTCGCCA ACGCCGTCAA GGTGACGCTC GGCCCGCGCG GTCGCAACGT CGTCATCGAC
AAGAAGTTCG GCGCCCCCAC CATCACGAAC GACGGCGTGA CCGTCGCCCG TGAGGTCGAG
CTGGACGACC CCTACGAGAA CCTGGGCGCC CAGCTGGTCA AGGAGGTCGC CACCAAGACC
AACGACGCGG CCGGTGACGG CACCACCACC GCGACCGTCC TGGCCCAGGC CCTCGTCCGC
GAGGGTCTGC GCAGCGTGGC CGCCGGCGCC TCCCCGATGT CCCTGAAGAA GGGCATCGAC
GCCGCCGCCG CCAAGGTGTC GGAGATCCTC CTGGAGCGCG CCCGCCCCGT CGAGGAGCGC
GCGGACATCG CCTACGTCGC CACCAACTCC GCCCAGGACG CCCAGATCGG CGACCTGATC
GCCGAGGCGT TCGACAAGGT CGGCAAGGAC GGCGTCATCA CGGTGGAGGA GGCCCCGACC
TTCGGTCTGG ACCTGGACTT CACCGAGGGC CTCCAGTTCG ACAAGGGCTA CGTCTCGCCC
TACTTCGTCA CCGACGGCGA CCGCCAGGAG GCGGTGCTGG AGGACGCGCT GATCCTGATC
AACCAGGGCA AGATCAGCAG CCTCAACGAC CTGCTGCCCG TGCTGGAGAA GGTCGTCCAG
AGCAAGAAGC CCCTGCTCAT CATCGCCGAG GACATCGACG GTGACGCCCT GGGCGCCCTG
GTGCTCAACA AGATCCGCGG CACCCTCAAC GTCGCCGCGG TCAAGGCGCC CGGCTTCGGC
GAGCGCCGCA AGGCCATGCT CCAGGACATC GCGGTCCTCA CCGGCGGCCA GGTCGTCGCC
GAGGAGGTCG GCCTGACCCT GGAGAACGTG GACCTCGACG CGCTGGGCGG CGCCCGCCGC
GTCACGATCA CCAAGGACGC CACCACCATC GTGGACGGCG CCGGGGAGCA GTCCGAGGTC
GAGGACCGCG TCCGCCAGAT CCGCAAGGAG ATCGAGGCGA GCGACTCCGA CTGGGACCGC
GAGAAGCTCC AGGAGCGCCT GGCCAAGCTC GCGGGCGGTG TCTCCGTCCT GCGCGTGGGC
GCCGCCACCG AGGTGGAGCT CAAGGAGAAG AAGCACCGCC TGGAGGACGC CATCTCGGCG
ACCCGCGCGG CCATCGAGGA GGGCATCGTC GCCGGCGGCG GCGCCTCCCT GGTGCACGCT
TCCAAGGCGC TGGACTCGGA CGACCTCGGC CTGACGGGCG ACGAGGCCAC CGGTGTGGCG
ATCGTGCGCC GCGCCCTGGT CGAGCCCGCC CGGTGGATCG CGGAGAACGC GGGCGCCGAG
GGCTACGTGG TCACCCACCG CGTGTCGGAG CTGGAGGTCG GCCACGGCTA CAACGCCGCG
ACCGGCACCT ACGGCGACCT GACCTCGCAG GGCATCCTCG ACCCGGTCAA GGTGTCCCGC
TCCGCCGTGC AGAACGCCGC CTCCATCGCG GGCATGCTGC TGACCACCGA GGTGCTGGTG
GCGGACAAGC CCGAGGACGA TGAGGACGAC GGGCACGGCC ACAGCCACTA G
 
Protein sequence
MPKILEFEDD ARRALERGVD RLANAVKVTL GPRGRNVVID KKFGAPTITN DGVTVAREVE 
LDDPYENLGA QLVKEVATKT NDAAGDGTTT ATVLAQALVR EGLRSVAAGA SPMSLKKGID
AAAAKVSEIL LERARPVEER ADIAYVATNS AQDAQIGDLI AEAFDKVGKD GVITVEEAPT
FGLDLDFTEG LQFDKGYVSP YFVTDGDRQE AVLEDALILI NQGKISSLND LLPVLEKVVQ
SKKPLLIIAE DIDGDALGAL VLNKIRGTLN VAAVKAPGFG ERRKAMLQDI AVLTGGQVVA
EEVGLTLENV DLDALGGARR VTITKDATTI VDGAGEQSEV EDRVRQIRKE IEASDSDWDR
EKLQERLAKL AGGVSVLRVG AATEVELKEK KHRLEDAISA TRAAIEEGIV AGGGASLVHA
SKALDSDDLG LTGDEATGVA IVRRALVEPA RWIAENAGAE GYVVTHRVSE LEVGHGYNAA
TGTYGDLTSQ GILDPVKVSR SAVQNAASIA GMLLTTEVLV ADKPEDDEDD GHGHSH