Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1138 |
Symbol | |
ID | 9244988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1392983 |
End bp | 1394602 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | ATP-binding region ATPase domain protein |
Protein accession | YP_003679085 |
Protein GI | 297560111 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.127728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTTC ACCCGTCCGA CCGACAACGG CGGCCACCGC GCCCCGGACG ACAGGCCCTG TTCGCCCTGG TCGCCGTGAC CGCGCTGGCG GCCCCAGCCT GGTCCTGGGC GGTCGTCACG GCCCACCCCG ACGCCCGACA GAGCGTGGCC CTGGCCGTCG GAGCGGCCGG AGCCGCCCTG TGCGCCGCCG TCACCGCGGC GGCGCACCAG GCCGCCGCGG CCCGCGCCGC GCGCGAGCAC GGCACCCGGG TCGACGCCAC CGCCCAGGCC CTCGAACGCG AGGCCGAGCA CCTGGTGGAC CAGCTCCTGC CCGCCCTCAC CGAGGGCCTG CGCGCGGGCC GCACCGCGCG CGAGGTCCTC GCCGAGCACC CCCAGCCCCG CCACGTCGCG CTGCACCGGC TCACCCACGC CGTCACCGCC GAGCTCGACA CCGCGCGCGT GCGCGCCGCC GACGCCCTCG CCCGGTGCAC CGCGCTGGAG GAGCAGATCG CCGACCTCGA CCGCGTCGGC CTGCCGCTCA TGGTCGCCCG CGTGCGCGAG GACCGCGTCG GCGCCGCCGA GACCCTGCGG GAGGAACTGC CCCCCGTCCA CGGCGCCCTC GCCCGCCTGC GCGCGCACAC GCTGGAGGAG CTGCTGGCCG CCGCCCGCCG TTCCGCCTCG GCCATGGAGA CCGCCGCCGC CTCCGGCGCC CGCATCCAGG CCCACCTGAC CTCCCTGCTG GCCAGGCTCC GCGAACTCCA GGACCGCTAC GGCGACAACC CCGGTGTCTT CGGCGACCTC CTGGACGTCG ACCACGGGGT CTCCCGCACG GGCCGCCTCG CCGACGGCCT CGTCGTCCTG GCCGGGGGCC GCTCGGGCCG CCGCTGGACC CGGCCCATCG TCATGGAGAG CGTCCTGCGC GGCGCCATGG GCCGCATCAA CGCCTACCGC CGCGTGCGCC TGCACAACAC CAGCACCGCC TCCATCGCCG GGCACGCCGC CGAGGGCGTC ATGCAGGCCC TGGCCGAACT CATGGACAAC GCCGCCAACT TCTCCGCCCA CGGCACCGAG GTGCACGTCT ACGTCCAGGA GGAGGACACC GGCCTGTCCG TCACCGTCGA GGACAGCGGA CTGGGCATGC GCGTGCGCGA GCGCAGGCTC GCCGAGAGCC TGGTCACCGA GCCCCGCGAC CTGTCCACGC TGCGCGGCAC CCGCACGGGC CTGGCGGTCG TGGGCCGCCT CGCCCACAAG CACGCGCTCG GCGTCAGCTT CCGTCCCTCG GCCCGCGGCG GCGTGGGCGT CGTCGTCCTC GTCCCGCCGC ACCTGGTCAC CGAGTCCCAG CCCGCCCCCG GCGGCCCGCG CGCCGGGCAC CGCCCCCGGC GCTCCGCCGG GCCCGCGCCC GCCGCGGGAC CGGGCCCCGG TGCCGGTACG GCGGCCCCCG GGCCCGACAG CCCCGCCGCG CCCGCCCGCG CCGCCTCCGG GCTGCCCCGG CGGCGGCGCG GCCAGACCCT CGCCGCGGCG CTGCGGGAGG AGCCCGACCA GCTCTCCGCC GGTCCGGTCA CGCCCGGACC CGGCGGCGAC CCCGGCACCA GGTTCGCGTC CTTCCGCAAC GCACGCCAGC CACAACGAGC CGAAGAGTAG
|
Protein sequence | MPVHPSDRQR RPPRPGRQAL FALVAVTALA APAWSWAVVT AHPDARQSVA LAVGAAGAAL CAAVTAAAHQ AAAARAAREH GTRVDATAQA LEREAEHLVD QLLPALTEGL RAGRTAREVL AEHPQPRHVA LHRLTHAVTA ELDTARVRAA DALARCTALE EQIADLDRVG LPLMVARVRE DRVGAAETLR EELPPVHGAL ARLRAHTLEE LLAAARRSAS AMETAAASGA RIQAHLTSLL ARLRELQDRY GDNPGVFGDL LDVDHGVSRT GRLADGLVVL AGGRSGRRWT RPIVMESVLR GAMGRINAYR RVRLHNTSTA SIAGHAAEGV MQALAELMDN AANFSAHGTE VHVYVQEEDT GLSVTVEDSG LGMRVRERRL AESLVTEPRD LSTLRGTRTG LAVVGRLAHK HALGVSFRPS ARGGVGVVVL VPPHLVTESQ PAPGGPRAGH RPRRSAGPAP AAGPGPGAGT AAPGPDSPAA PARAASGLPR RRRGQTLAAA LREEPDQLSA GPVTPGPGGD PGTRFASFRN ARQPQRAEE
|
| |