Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1205 |
Symbol | |
ID | 9245055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1464938 |
End bp | 1467829 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | AAA ATPase containing von Willebrand factor type A (vWA) domain protein |
Protein accession | YP_003679150 |
Protein GI | 297560176 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.854661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTCCC CCCTCTTCGA CTTCCGTGCT GCCCTCTCCG GGGAGGAGCC GGATCCGACC GAGACCGAGG AGGCCACCAC CCTCCAGGAG CAGACCACCA ACACGAACCG GTCGGAGTCC GCCCCGATCA GCGTGGACCT GACCAGCGGC CTGCTCATGC TCCGGCACGC CGGCTCCTCG GTCGAGGTGG ACACCGCCAC CCGGGAGGCG GTGCTCGACC CGGGCGACTC CGAGGTGAGC GTCGGCGACG ACGGCGTGAT GCTGTTCGGC ACGCAGCAGA CCGGGGACAC CGGCTCCCCG CACGTACTCG CCGACCAGGG GGTCGCGCGG GTCGTCACGG TGTCCGCGCC CGAGTCCGGG GACGTCTCCG GACTGGAGCC CCACACGGAG AGCCGGTTCG AACCCGCGCA CCCGGGAACA CCGGCGCTCC CGGTCGAACC GGCGGCCAGG TCCGTCGGGG TGCCCGCCAA CGCGGAGCCG GTCCCCGAGG GCGAGACCAC GGAGCCCGTA CCGCTGGAGC CCATGCTCGT CGCGGAGCCG CTCCAGCGCA TGGAACCGAT GCAGCCGCTG GAACCGGTGA TCCCCGCCGA GCCGGCGATC CTCGCCGAGC CCGCCCAGCG GGTGGAGGCG GCGGAGCCCC TGGTGGGCAG GGAGGCCGAG CAGGGTGAGC TGCTGGAGCC GCTGACCCCC GTGGACACCG AGAACACGAA GGTGGTCTCC GGCGTGGAGC CCATCTCCGC CGACGCAGAC GACCCCGCTC TCGGCGAGAC CGGCTCCGAG ATCCCCGGGG CTCCGGTCGT GACCCTGGAC CCCGAGACGG GGGTCACCAC CGTCGAGGCG GGCGAGCTGA AGTTCGTCCT GGACTCGGAC GAGGGCACGA TCAGCATCGA GCCCGGAGAC AGGAACGTTC CCCTCGACCC CGAGGCGGCA CCGGTCACCG TCGAGGCCGG ACACCTGGGC CTGACCATCG ACCCGGCCAA GGGGACCGTC GGGATCGAGA CGGGCGGTTC CGGTGAGGGC GGGGAGAACC TGCCGACCAC GATCGAGATC GGCGACCTGA AGCTCACCCT GGACCCCGAG ACGGGCGAGG TCTCCCTCGA CCCCGGCAGC GGGGACGTGG AGGTCGATCC CGAGACCGGT CTGATCACCG TGTCCGAGGA CCTCGACGGC GAGGAGGGCT CCGGCGACGA GGACAGCGAC GACCGCCCCG GCGCAGACCA ACCCGGCCAA GACGAAGAGA ACGACCGCCC CGGCAGCGAC GGACCCGGCC AGGACCAAGC CTCCGACCAC GACCGCCCCG GCGCAGACCA ACCCGGCCAA GACGAGGAGA ACGACCGCCC CGGCAGCGAC GGACCCGGCC AGGACCAAGC CTCCGACCAC GACCGCCCCG GCGCAGACCA ACCCGGCCAA GACGAAGAGA ACGACCGCCC CGGCAGCGAC GGACCCGGCG AGGACGAGGC CTCAGAGGAC GACCAGTCCG CAGGCGACGG ACCCGGCACA CCCGGCCAAG ACGACTCCTC CAACCACGAC CACACCGGCG GAGACCACAC CGGCCAGGAC GAAGCCTCCG AAGACGACCA CTCCACAGGC AACGGACCCG GCACACCCGG CCAAGACGAA GAGAACGACC GCCCCGGCAG CGACGGACCC GGCCAGGACC AGGCCTCCGA CCACGACCGC CCCGGCGCAG ACCAACCCGG CCAAGACGAA GAGAACGACC GCCCCGGCGG TGACGGACCC GGCCAGGACC AGGCCTCCGA CCACGACCGC CCCGGCGCAG ACCAACCCGG CCAAGACGAT GAGCAGGACT CCCCCGAACA CGTGGAGAGC CACCGCCCCG GCGCGAACGA CCCGGGCAGC GGCGGGACTC CGGGAGTGCC CGGGGGCACT CCCCCGCCCA CGGTCGCGGC GGGGACCCAC CGGGACGACC ACATCACCGG CGGAGACGAC GACTCAGACA CCGCCGGGGA CGACAGCGGT CCTGAGCACC TGGATCCGAC CGTCTCGTTC GACCCCTCGG TCGTGGACAC CTCCGAAACG CCCGACGAGG AGACCGGGGG CGGTGAGGAC ACTTCCGGGA GCGGCGGTGA AGAGGGACCC GGCAACGAGG AGGACACTGC CGGGGACGAA GAGGACACCG AGGGCGACGA CGAAGAGGAC GGCGAGGACA CCGCCGGGGA CGAAGAGGAC TCCGAAGGAG ACGACGAAGA GGACGGCGAG GACACCGAGG ACACCGAGGA CACCGAGGAC GACGAGGACG ACGAGGACGA CGAGGACGAC GAGGACGACG ACGAGGACGA CGACGAGGAC GACGACGAGG ACGACGAGGA GGACGGCGAG GGAGACGAGG ACGACGAGGA GGGCGACGGC GAGGACACCG GCACCGGGGA CAGCGGAAGC GGCGACGAGG GTGCGGGCTC CGGCACTGGT GACAAGGGCA CCAAGATCGA CCTCGACCGG ATCAGGGATT TCCGCTCGAA GATCGTGGAG CCCCTCCAGA AGGACCTCAA CATCCTGGTC ACCGAACTCT CCGTGTACGA AGGGGTGGCC GAAAGCGGTT ACCAGCGGCT CGGAGACCCC GGCATCCTCG AAGAGGCCGG GCGGCTCACG AGCAAGATCG ACAGTTCGAT GAGCGGGATC TACGACTTCA TCTACGGCCT GAACCAGGAA CTGATCGAGA TCGACCGCAG GCTCTCGGAG AACCTCATCG TCTTCGCCAA CCTGGAGGAC GAGCAGAACC TCACCGCCCA GGAAGTCCAC AACCTGATCT ATTCGAGCCA GAGCGGTTCG GGTGGCTCGG GCGGCTCGGG CGGCTCGGGC GGCTCGGGCG GCTCGGGTGG TTCCGGTAAC GGTAACGGTA ACGGCCAGGA CGAGGACAGC AGCGAAACCT AG
|
Protein sequence | MESPLFDFRA ALSGEEPDPT ETEEATTLQE QTTNTNRSES APISVDLTSG LLMLRHAGSS VEVDTATREA VLDPGDSEVS VGDDGVMLFG TQQTGDTGSP HVLADQGVAR VVTVSAPESG DVSGLEPHTE SRFEPAHPGT PALPVEPAAR SVGVPANAEP VPEGETTEPV PLEPMLVAEP LQRMEPMQPL EPVIPAEPAI LAEPAQRVEA AEPLVGREAE QGELLEPLTP VDTENTKVVS GVEPISADAD DPALGETGSE IPGAPVVTLD PETGVTTVEA GELKFVLDSD EGTISIEPGD RNVPLDPEAA PVTVEAGHLG LTIDPAKGTV GIETGGSGEG GENLPTTIEI GDLKLTLDPE TGEVSLDPGS GDVEVDPETG LITVSEDLDG EEGSGDEDSD DRPGADQPGQ DEENDRPGSD GPGQDQASDH DRPGADQPGQ DEENDRPGSD GPGQDQASDH DRPGADQPGQ DEENDRPGSD GPGEDEASED DQSAGDGPGT PGQDDSSNHD HTGGDHTGQD EASEDDHSTG NGPGTPGQDE ENDRPGSDGP GQDQASDHDR PGADQPGQDE ENDRPGGDGP GQDQASDHDR PGADQPGQDD EQDSPEHVES HRPGANDPGS GGTPGVPGGT PPPTVAAGTH RDDHITGGDD DSDTAGDDSG PEHLDPTVSF DPSVVDTSET PDEETGGGED TSGSGGEEGP GNEEDTAGDE EDTEGDDEED GEDTAGDEED SEGDDEEDGE DTEDTEDTED DEDDEDDEDD EDDDEDDDED DDEDDEEDGE GDEDDEEGDG EDTGTGDSGS GDEGAGSGTG DKGTKIDLDR IRDFRSKIVE PLQKDLNILV TELSVYEGVA ESGYQRLGDP GILEEAGRLT SKIDSSMSGI YDFIYGLNQE LIEIDRRLSE NLIVFANLED EQNLTAQEVH NLIYSSQSGS GGSGGSGGSG GSGGSGGSGN GNGNGQDEDS SET
|
| |