Gene Ndas_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1205 
Symbol 
ID9245055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1464938 
End bp1467829 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAAA ATPase containing von Willebrand factor type A (vWA) domain protein 
Protein accessionYP_003679150 
Protein GI297560176 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.854661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTCCC CCCTCTTCGA CTTCCGTGCT GCCCTCTCCG GGGAGGAGCC GGATCCGACC 
GAGACCGAGG AGGCCACCAC CCTCCAGGAG CAGACCACCA ACACGAACCG GTCGGAGTCC
GCCCCGATCA GCGTGGACCT GACCAGCGGC CTGCTCATGC TCCGGCACGC CGGCTCCTCG
GTCGAGGTGG ACACCGCCAC CCGGGAGGCG GTGCTCGACC CGGGCGACTC CGAGGTGAGC
GTCGGCGACG ACGGCGTGAT GCTGTTCGGC ACGCAGCAGA CCGGGGACAC CGGCTCCCCG
CACGTACTCG CCGACCAGGG GGTCGCGCGG GTCGTCACGG TGTCCGCGCC CGAGTCCGGG
GACGTCTCCG GACTGGAGCC CCACACGGAG AGCCGGTTCG AACCCGCGCA CCCGGGAACA
CCGGCGCTCC CGGTCGAACC GGCGGCCAGG TCCGTCGGGG TGCCCGCCAA CGCGGAGCCG
GTCCCCGAGG GCGAGACCAC GGAGCCCGTA CCGCTGGAGC CCATGCTCGT CGCGGAGCCG
CTCCAGCGCA TGGAACCGAT GCAGCCGCTG GAACCGGTGA TCCCCGCCGA GCCGGCGATC
CTCGCCGAGC CCGCCCAGCG GGTGGAGGCG GCGGAGCCCC TGGTGGGCAG GGAGGCCGAG
CAGGGTGAGC TGCTGGAGCC GCTGACCCCC GTGGACACCG AGAACACGAA GGTGGTCTCC
GGCGTGGAGC CCATCTCCGC CGACGCAGAC GACCCCGCTC TCGGCGAGAC CGGCTCCGAG
ATCCCCGGGG CTCCGGTCGT GACCCTGGAC CCCGAGACGG GGGTCACCAC CGTCGAGGCG
GGCGAGCTGA AGTTCGTCCT GGACTCGGAC GAGGGCACGA TCAGCATCGA GCCCGGAGAC
AGGAACGTTC CCCTCGACCC CGAGGCGGCA CCGGTCACCG TCGAGGCCGG ACACCTGGGC
CTGACCATCG ACCCGGCCAA GGGGACCGTC GGGATCGAGA CGGGCGGTTC CGGTGAGGGC
GGGGAGAACC TGCCGACCAC GATCGAGATC GGCGACCTGA AGCTCACCCT GGACCCCGAG
ACGGGCGAGG TCTCCCTCGA CCCCGGCAGC GGGGACGTGG AGGTCGATCC CGAGACCGGT
CTGATCACCG TGTCCGAGGA CCTCGACGGC GAGGAGGGCT CCGGCGACGA GGACAGCGAC
GACCGCCCCG GCGCAGACCA ACCCGGCCAA GACGAAGAGA ACGACCGCCC CGGCAGCGAC
GGACCCGGCC AGGACCAAGC CTCCGACCAC GACCGCCCCG GCGCAGACCA ACCCGGCCAA
GACGAGGAGA ACGACCGCCC CGGCAGCGAC GGACCCGGCC AGGACCAAGC CTCCGACCAC
GACCGCCCCG GCGCAGACCA ACCCGGCCAA GACGAAGAGA ACGACCGCCC CGGCAGCGAC
GGACCCGGCG AGGACGAGGC CTCAGAGGAC GACCAGTCCG CAGGCGACGG ACCCGGCACA
CCCGGCCAAG ACGACTCCTC CAACCACGAC CACACCGGCG GAGACCACAC CGGCCAGGAC
GAAGCCTCCG AAGACGACCA CTCCACAGGC AACGGACCCG GCACACCCGG CCAAGACGAA
GAGAACGACC GCCCCGGCAG CGACGGACCC GGCCAGGACC AGGCCTCCGA CCACGACCGC
CCCGGCGCAG ACCAACCCGG CCAAGACGAA GAGAACGACC GCCCCGGCGG TGACGGACCC
GGCCAGGACC AGGCCTCCGA CCACGACCGC CCCGGCGCAG ACCAACCCGG CCAAGACGAT
GAGCAGGACT CCCCCGAACA CGTGGAGAGC CACCGCCCCG GCGCGAACGA CCCGGGCAGC
GGCGGGACTC CGGGAGTGCC CGGGGGCACT CCCCCGCCCA CGGTCGCGGC GGGGACCCAC
CGGGACGACC ACATCACCGG CGGAGACGAC GACTCAGACA CCGCCGGGGA CGACAGCGGT
CCTGAGCACC TGGATCCGAC CGTCTCGTTC GACCCCTCGG TCGTGGACAC CTCCGAAACG
CCCGACGAGG AGACCGGGGG CGGTGAGGAC ACTTCCGGGA GCGGCGGTGA AGAGGGACCC
GGCAACGAGG AGGACACTGC CGGGGACGAA GAGGACACCG AGGGCGACGA CGAAGAGGAC
GGCGAGGACA CCGCCGGGGA CGAAGAGGAC TCCGAAGGAG ACGACGAAGA GGACGGCGAG
GACACCGAGG ACACCGAGGA CACCGAGGAC GACGAGGACG ACGAGGACGA CGAGGACGAC
GAGGACGACG ACGAGGACGA CGACGAGGAC GACGACGAGG ACGACGAGGA GGACGGCGAG
GGAGACGAGG ACGACGAGGA GGGCGACGGC GAGGACACCG GCACCGGGGA CAGCGGAAGC
GGCGACGAGG GTGCGGGCTC CGGCACTGGT GACAAGGGCA CCAAGATCGA CCTCGACCGG
ATCAGGGATT TCCGCTCGAA GATCGTGGAG CCCCTCCAGA AGGACCTCAA CATCCTGGTC
ACCGAACTCT CCGTGTACGA AGGGGTGGCC GAAAGCGGTT ACCAGCGGCT CGGAGACCCC
GGCATCCTCG AAGAGGCCGG GCGGCTCACG AGCAAGATCG ACAGTTCGAT GAGCGGGATC
TACGACTTCA TCTACGGCCT GAACCAGGAA CTGATCGAGA TCGACCGCAG GCTCTCGGAG
AACCTCATCG TCTTCGCCAA CCTGGAGGAC GAGCAGAACC TCACCGCCCA GGAAGTCCAC
AACCTGATCT ATTCGAGCCA GAGCGGTTCG GGTGGCTCGG GCGGCTCGGG CGGCTCGGGC
GGCTCGGGCG GCTCGGGTGG TTCCGGTAAC GGTAACGGTA ACGGCCAGGA CGAGGACAGC
AGCGAAACCT AG
 
Protein sequence
MESPLFDFRA ALSGEEPDPT ETEEATTLQE QTTNTNRSES APISVDLTSG LLMLRHAGSS 
VEVDTATREA VLDPGDSEVS VGDDGVMLFG TQQTGDTGSP HVLADQGVAR VVTVSAPESG
DVSGLEPHTE SRFEPAHPGT PALPVEPAAR SVGVPANAEP VPEGETTEPV PLEPMLVAEP
LQRMEPMQPL EPVIPAEPAI LAEPAQRVEA AEPLVGREAE QGELLEPLTP VDTENTKVVS
GVEPISADAD DPALGETGSE IPGAPVVTLD PETGVTTVEA GELKFVLDSD EGTISIEPGD
RNVPLDPEAA PVTVEAGHLG LTIDPAKGTV GIETGGSGEG GENLPTTIEI GDLKLTLDPE
TGEVSLDPGS GDVEVDPETG LITVSEDLDG EEGSGDEDSD DRPGADQPGQ DEENDRPGSD
GPGQDQASDH DRPGADQPGQ DEENDRPGSD GPGQDQASDH DRPGADQPGQ DEENDRPGSD
GPGEDEASED DQSAGDGPGT PGQDDSSNHD HTGGDHTGQD EASEDDHSTG NGPGTPGQDE
ENDRPGSDGP GQDQASDHDR PGADQPGQDE ENDRPGGDGP GQDQASDHDR PGADQPGQDD
EQDSPEHVES HRPGANDPGS GGTPGVPGGT PPPTVAAGTH RDDHITGGDD DSDTAGDDSG
PEHLDPTVSF DPSVVDTSET PDEETGGGED TSGSGGEEGP GNEEDTAGDE EDTEGDDEED
GEDTAGDEED SEGDDEEDGE DTEDTEDTED DEDDEDDEDD EDDDEDDDED DDEDDEEDGE
GDEDDEEGDG EDTGTGDSGS GDEGAGSGTG DKGTKIDLDR IRDFRSKIVE PLQKDLNILV
TELSVYEGVA ESGYQRLGDP GILEEAGRLT SKIDSSMSGI YDFIYGLNQE LIEIDRRLSE
NLIVFANLED EQNLTAQEVH NLIYSSQSGS GGSGGSGGSG GSGGSGGSGN GNGNGQDEDS
SET