Gene Ndas_1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1138 
Symbol 
ID9244988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1392983 
End bp1394602 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content79% 
IMG OID 
ProductATP-binding region ATPase domain protein 
Protein accessionYP_003679085 
Protein GI297560111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTTC ACCCGTCCGA CCGACAACGG CGGCCACCGC GCCCCGGACG ACAGGCCCTG 
TTCGCCCTGG TCGCCGTGAC CGCGCTGGCG GCCCCAGCCT GGTCCTGGGC GGTCGTCACG
GCCCACCCCG ACGCCCGACA GAGCGTGGCC CTGGCCGTCG GAGCGGCCGG AGCCGCCCTG
TGCGCCGCCG TCACCGCGGC GGCGCACCAG GCCGCCGCGG CCCGCGCCGC GCGCGAGCAC
GGCACCCGGG TCGACGCCAC CGCCCAGGCC CTCGAACGCG AGGCCGAGCA CCTGGTGGAC
CAGCTCCTGC CCGCCCTCAC CGAGGGCCTG CGCGCGGGCC GCACCGCGCG CGAGGTCCTC
GCCGAGCACC CCCAGCCCCG CCACGTCGCG CTGCACCGGC TCACCCACGC CGTCACCGCC
GAGCTCGACA CCGCGCGCGT GCGCGCCGCC GACGCCCTCG CCCGGTGCAC CGCGCTGGAG
GAGCAGATCG CCGACCTCGA CCGCGTCGGC CTGCCGCTCA TGGTCGCCCG CGTGCGCGAG
GACCGCGTCG GCGCCGCCGA GACCCTGCGG GAGGAACTGC CCCCCGTCCA CGGCGCCCTC
GCCCGCCTGC GCGCGCACAC GCTGGAGGAG CTGCTGGCCG CCGCCCGCCG TTCCGCCTCG
GCCATGGAGA CCGCCGCCGC CTCCGGCGCC CGCATCCAGG CCCACCTGAC CTCCCTGCTG
GCCAGGCTCC GCGAACTCCA GGACCGCTAC GGCGACAACC CCGGTGTCTT CGGCGACCTC
CTGGACGTCG ACCACGGGGT CTCCCGCACG GGCCGCCTCG CCGACGGCCT CGTCGTCCTG
GCCGGGGGCC GCTCGGGCCG CCGCTGGACC CGGCCCATCG TCATGGAGAG CGTCCTGCGC
GGCGCCATGG GCCGCATCAA CGCCTACCGC CGCGTGCGCC TGCACAACAC CAGCACCGCC
TCCATCGCCG GGCACGCCGC CGAGGGCGTC ATGCAGGCCC TGGCCGAACT CATGGACAAC
GCCGCCAACT TCTCCGCCCA CGGCACCGAG GTGCACGTCT ACGTCCAGGA GGAGGACACC
GGCCTGTCCG TCACCGTCGA GGACAGCGGA CTGGGCATGC GCGTGCGCGA GCGCAGGCTC
GCCGAGAGCC TGGTCACCGA GCCCCGCGAC CTGTCCACGC TGCGCGGCAC CCGCACGGGC
CTGGCGGTCG TGGGCCGCCT CGCCCACAAG CACGCGCTCG GCGTCAGCTT CCGTCCCTCG
GCCCGCGGCG GCGTGGGCGT CGTCGTCCTC GTCCCGCCGC ACCTGGTCAC CGAGTCCCAG
CCCGCCCCCG GCGGCCCGCG CGCCGGGCAC CGCCCCCGGC GCTCCGCCGG GCCCGCGCCC
GCCGCGGGAC CGGGCCCCGG TGCCGGTACG GCGGCCCCCG GGCCCGACAG CCCCGCCGCG
CCCGCCCGCG CCGCCTCCGG GCTGCCCCGG CGGCGGCGCG GCCAGACCCT CGCCGCGGCG
CTGCGGGAGG AGCCCGACCA GCTCTCCGCC GGTCCGGTCA CGCCCGGACC CGGCGGCGAC
CCCGGCACCA GGTTCGCGTC CTTCCGCAAC GCACGCCAGC CACAACGAGC CGAAGAGTAG
 
Protein sequence
MPVHPSDRQR RPPRPGRQAL FALVAVTALA APAWSWAVVT AHPDARQSVA LAVGAAGAAL 
CAAVTAAAHQ AAAARAAREH GTRVDATAQA LEREAEHLVD QLLPALTEGL RAGRTAREVL
AEHPQPRHVA LHRLTHAVTA ELDTARVRAA DALARCTALE EQIADLDRVG LPLMVARVRE
DRVGAAETLR EELPPVHGAL ARLRAHTLEE LLAAARRSAS AMETAAASGA RIQAHLTSLL
ARLRELQDRY GDNPGVFGDL LDVDHGVSRT GRLADGLVVL AGGRSGRRWT RPIVMESVLR
GAMGRINAYR RVRLHNTSTA SIAGHAAEGV MQALAELMDN AANFSAHGTE VHVYVQEEDT
GLSVTVEDSG LGMRVRERRL AESLVTEPRD LSTLRGTRTG LAVVGRLAHK HALGVSFRPS
ARGGVGVVVL VPPHLVTESQ PAPGGPRAGH RPRRSAGPAP AAGPGPGAGT AAPGPDSPAA
PARAASGLPR RRRGQTLAAA LREEPDQLSA GPVTPGPGGD PGTRFASFRN ARQPQRAEE