Gene Ndas_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3397 
Symbol 
ID9247262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4062139 
End bp4063413 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content60% 
IMG OID 
ProductATP-dependent Clp protease, ATP-binding subunit ClpX 
Protein accessionYP_003681308 
Protein GI297562334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.828638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGCA TCGGCGACGG CGGGGACCTG TTGAAGTGCT CGTTCTGCGG CAAGAGCCAG 
AAGCAGGTGA AGAAGCTCAT CGCCGGCCCC GGCGTGTACA TCTGCGATGA GTGCATTGAT
CTGTGCAACG AGATCATCGA AGAGGAACTC GCCGACGCCA CCGAGCTCAC CTGGGACTCG
TTGCCCAAGC CCCGTGAGAT CTACGAGTTC CTGGACTCGT ACGTGATCGG TCAGGAGCAG
GCCAAGAAGG CGTTGTCGGT GGCGGTGTAC AACCACTACA AGCGGGTGCG TTCGGAGGGG
GATCGTCCGG GCGAGGAGGA TGTGGAGATC GCCAAGTCCA ACATCTTGCT GTTGGGGCCG
ACGGGGTCGG GCAAGACGTT GTTGGCGCAG ACTCTGGCGC GGATTCTGAA CGTGCCGTTC
GCGATCGCGG ACGCGACGGC GCTGACGGAG GCGGGGTATG TGGGGGAGGA TGTGGAGAAC
ATCCTGCTCA AGTTGATCCA GGCGGCTGAC TATGACGTGA AGAAGGCCGA GACGGGGATC
ATCTACATCG ACGAGGTCGA CAAGGTCGCG CGTAAGAGTG AGAATCCCTC GATCACTCGG
GATGTGTCGG GTGAGGGGGT GCAGCAGGCG TTGTTGAAGA TCTTGGAGGG GACGACGGCG
AGTGTGCCTC CGCAGGGGGG TCGGAAGCAT CCGCATCAGG AGTTCATTCA GATCGACACG
ACGAATGTGT TGTTCATCTG TGGTGGTGCG TTCGCGGGGT TGGAGAAGTT GATTGAGTCG
CGGACGGGTC AGCAGGGGAT GGGGTTCAAC GCGGTGTTGC GTCCCAAGGG TGAGCTGGGT
GGTTCGGCGT TGTTCGGTGA GGTGATGCCG GAGGATCTGT TGAAGTTCGG GATGATTCCG
GAGTTCGTGG GTCGGTTGCC GGTGATCACG AGTGTGCATG ATCTGGATCG TGAGGCGTTG
ATTCGGATTT TGACGGAGCC GCGGAACGCG TTGGTGAAGC AGTATCAGCG GTTGTTCGAG
TTGGATGGTG TGGAGTTGGA GTTCACGCCG GATGCTTTGA ACGCGATTGC TGAGCAGGGG
ATTATTCGGG GTACGGGTGC GCGTGGTTTG CGGGCGATTA TCGAGGAGGT GTTGTTGTCG
GTGATGTATG AGGTGCCTTC TCGTGAGGAT GTGGGGCAGG TGATCATTAC GCGGGAGACC
GTGATCGACA ACGTCAACCC CACCATCGTC CCGCGTGCCC AGCTGCGCCG GGCCCGCCAG
GAGAAGTCCG CCTAG
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL ADATELTWDS 
LPKPREIYEF LDSYVIGQEQ AKKALSVAVY NHYKRVRSEG DRPGEEDVEI AKSNILLLGP
TGSGKTLLAQ TLARILNVPF AIADATALTE AGYVGEDVEN ILLKLIQAAD YDVKKAETGI
IYIDEVDKVA RKSENPSITR DVSGEGVQQA LLKILEGTTA SVPPQGGRKH PHQEFIQIDT
TNVLFICGGA FAGLEKLIES RTGQQGMGFN AVLRPKGELG GSALFGEVMP EDLLKFGMIP
EFVGRLPVIT SVHDLDREAL IRILTEPRNA LVKQYQRLFE LDGVELEFTP DALNAIAEQG
IIRGTGARGL RAIIEEVLLS VMYEVPSRED VGQVIITRET VIDNVNPTIV PRAQLRRARQ
EKSA