Gene Ndas_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4199 
Symbol 
ID9248073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5015330 
End bp5016961 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content69% 
IMG OID 
Productchaperonin GroEL 
Protein accessionYP_003682097 
Protein GI297563123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA AACTGATCGC GTTCGACGAG GAGGCCCGTC GCGGCCTTGA GCGCGGCATG 
AACCAGCTCG CTGACGCCGT CAAGGTCACG CTCGGCCCCA AGGGCCGCAA CGTCGTCCTG
GAGAAGAAGT GGGGCGCCCC CACGATCACC AACGACGGTG TCTCCATCGC CAAGGAGATC
GAGCTCGAGG ACCCGTGGGA GAAGATCGGG GCCGAGCTGG TCAAGGAGGT CGCCAAGAAG
ACCGACGACG TCGCGGGTGA CGGCACCACC ACCGCCACCG TGCTCGCCCA GGCGCTCGTT
CGCGAGGGCC TGCGCAACGT CGCCGCCGGC GCCAACCCGA TCAGCCTCAA GCGCGGCATC
GAGTCCGCCG TCGCGCGCAT CAACGAGGAG CTCGGCAACC TCTCCAAGGA CATCGAGACC
AAGGAGCAGA TCGCCTCCAC CGCCTCGATC TCCGCCGGCG ACCCCCAGAT CGGCGAGACC
ATCGCCGAGG CCATGGACAA GGTCGGCAAG GAAGGCGTCA TCACGGTCGA GGAGGGCCAG
ACCTTCGGGC TGGAGCTCGA GCTCGCCGAG GGCATGCGCT TCGACAAGGG CTACATCTCG
CCCTACTTCG CCACCGACCT GGAGCGCATG GAGACGGTCC TCGAGGACCC CTACATCCTC
ATCGTCAACT CCAAGATCTC GAACAACAAC GAGTTCCTGC CGGTTGTCGA GAAGGTCCTC
CAGGCCGGCC GCCCGCTGGT CGTCATCGCC GAGGACCTGG AGGGCGGCGC CCTCCAGACG
CTGGTCGTCA ACAAGATCCG CGGCACCTTC AAGTCCGTCG CCGTCAAGGC CCCGGGCTTC
GGCGACCGCC GCAAGGCCCA GCTCGGCGAC ATCGCCGTGC TGACCGGTGG CCAGGTCATC
ACCGAGGAGG TCGGCCTCAA GCTGGAGAGC ACCGAGCTCG ACATGCTCGG CCGCGCCCGC
AAGGTCGTCG TCACCAAGGA CGAGACCACC ATCGTGGACG GCGCCGGTGA CGCCACCGCG
ATCGCCGGTC GCGTGAACGA GATCCGCAAC GAGATCGAGC GCACCGACTC CGACTACGAC
CGCGAGAAGC TCCAGGAGCG TCTCGCCCGC CTGGCCGGCG GCGTGGCCGT CATCAAGGCC
GGTGCCGCCA CCGAGGTGGA GCTCAAGGAG CGCAAGCACC GCATCGAGGA CGCCGTCCGC
AACGCCAAGG CCGCGGTCGA GGAGGGCATC CTGCCCGGCG GTGGTGTCGC TCTGCTCCAG
GCCAGCGTCC CGGCCTTCGA GAAGCTGGAG CTGGAGGGCG ACGAGGCCAT CGGCGCCGAC
ATCGTGCGCC GCGCCATCGC CGAGCCGCTC AAGCAGATCG CGATCAACGC CGGCCTCGAG
GGCGGCGTCG TGGCGGAGAA GGTCAAGAAC CTGGAGCCCG GGTTCGGCCT GAACGCCGCC
ACCGGCGAGT ACACCGACCT GTTCAAGGAC GGCGTCATCG ACCCGACCAA GGTCACCCGC
TCGGCTCTGC AGAACGCGGC TTCCATCGCC GGTCTGTTCC TGACCACCGA GGCCGTCATC
GCCGAGAAGC CGGAGAAGGC CGCCGCTCCC GCCGGCGACC CGACCGGTGG CATGGGCGGC
ATGGACTTCT AG
 
Protein sequence
MAAKLIAFDE EARRGLERGM NQLADAVKVT LGPKGRNVVL EKKWGAPTIT NDGVSIAKEI 
ELEDPWEKIG AELVKEVAKK TDDVAGDGTT TATVLAQALV REGLRNVAAG ANPISLKRGI
ESAVARINEE LGNLSKDIET KEQIASTASI SAGDPQIGET IAEAMDKVGK EGVITVEEGQ
TFGLELELAE GMRFDKGYIS PYFATDLERM ETVLEDPYIL IVNSKISNNN EFLPVVEKVL
QAGRPLVVIA EDLEGGALQT LVVNKIRGTF KSVAVKAPGF GDRRKAQLGD IAVLTGGQVI
TEEVGLKLES TELDMLGRAR KVVVTKDETT IVDGAGDATA IAGRVNEIRN EIERTDSDYD
REKLQERLAR LAGGVAVIKA GAATEVELKE RKHRIEDAVR NAKAAVEEGI LPGGGVALLQ
ASVPAFEKLE LEGDEAIGAD IVRRAIAEPL KQIAINAGLE GGVVAEKVKN LEPGFGLNAA
TGEYTDLFKD GVIDPTKVTR SALQNAASIA GLFLTTEAVI AEKPEKAAAP AGDPTGGMGG
MDF