Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1954 |
Symbol | |
ID | 9245804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2377887 |
End bp | 2378948 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | ATPase associated with various cellular activities AAA_3 |
Protein accession | YP_003679887 |
Protein GI | 297560913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.25859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000487969 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGAGA GCGGAAACGG CTCAGCGCCC TCCGCGGGCC CGGACAGCGC TCCGGGTGAA CCGACCCGGC TGCTGCGCGC CGCGCTGCAC GAGGTGTCCA GGGTGATCGT GGGCCAGGAG CGCATGGTGG AGCGCTCCCT GATCGCCCTG GTCGCGGGAG GGCACTGCCT GATCGAGGGC GTACCCGGAA TCGCCAAGAC CCTGGCGGTG TCCACCCTGG CCAAGGTCAC CGGGGGTTCG TTCGCCCGTG TGCAGTTCAC CCCGGACCTG GTGCCCTCCG ACATCGTCGG CACGCGCATC TACCATCCCT CCACCGAGCA GTTCGACGTG GAGCTGGGGC CGGTCTTCGC CAACTTCGTG CTCGCCGACG AGATCAACCG GGCCCCGGCC AAGGTGCAGT CGGCCCTGTT GGAGGTCATG GCCGAGCGGC AGGTGTCCCT CGGCGGGACC ACCCACCCGC TCCCCTCCCC GTTCATCGTC ATCGCCACAC AGAACCCGGT GGAGTCCGAG GGCGTGTACC CGCTGCCCGA GGCCCAGCGC GACCGGTTCC TGATGAAGGT CAACGTCCGC CATCCGCGCG CGCACGAGGA GATGGAGATC CTGCGCCGGA TGTCGACCAC CCCGCCCACC GCGCACCAGG TGCTCGACCC CGTCACCCTG AGCGAGCTGC AGTCCGACGC CCAGCGCGTC CACGTCCACC AGCTCATCGC CGACTACGTG GTGCGGCTGG TCATGGCCAC GCGCGAGCCG GAGAACCACC AGATGCCGGA CCTGCGCCAG GTGCTGGAGA TGGGGGCCAG CCCCCGTGCC ACCCTGGGGC TGGTCTCGGC CGCCCGGGCC CTGGCCCTGC TGCGCGGGCG CGACTACGTG CTCCCCGACG ACGTGCGGGT GCTCGCCCAC GACGTCATCG CCCACCGCCT GGTGCTGACC TTCGACGCCC TGGCCGACGG GATCACCGCC GAACAGGTGG TGGACCGCAT CCTGTCGACC GTGCTGGCGC CGCGGGTGAT CTGGGACGAG CCGCCCAGCG GGGAGACCGC CTCCTTCGCC TCCGCGAGGT GA
|
Protein sequence | MAESGNGSAP SAGPDSAPGE PTRLLRAALH EVSRVIVGQE RMVERSLIAL VAGGHCLIEG VPGIAKTLAV STLAKVTGGS FARVQFTPDL VPSDIVGTRI YHPSTEQFDV ELGPVFANFV LADEINRAPA KVQSALLEVM AERQVSLGGT THPLPSPFIV IATQNPVESE GVYPLPEAQR DRFLMKVNVR HPRAHEEMEI LRRMSTTPPT AHQVLDPVTL SELQSDAQRV HVHQLIADYV VRLVMATREP ENHQMPDLRQ VLEMGASPRA TLGLVSAARA LALLRGRDYV LPDDVRVLAH DVIAHRLVLT FDALADGITA EQVVDRILST VLAPRVIWDE PPSGETASFA SAR
|
| |