Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3738 |
Symbol | |
ID | 9247607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4486495 |
End bp | 4489338 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | preprotein translocase, SecA subunit |
Protein accession | YP_003681642 |
Protein GI | 297562668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.361446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.764216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCAGGGA TACTCGACAA GCTCCTGCGC GCGGGTGAGG GAAAGATCCT GCGCCGACTC AACAAGCTGA AGGACCAGGT CAACTCGCTC GAGGAAGACT ACGTCGACCT CACCGACGAG GAGCTGCGCG ACCTCACCGG GGAGTACAAG GAGCGCTACG AGGACGGGGA GTCCCTGGAC GACCTGCTCC CGGAGGCCTT CGCCACCGTC CGGGAGGCGG CCAAGCGCAC CCTGGGCCAG CGCCACTTCG ACGTCCAGAT CATGGGCGGC GCCGCGCTGC ACCTCGGCAA CATCGCCGAG ATGAAGACCG GTGAGGGCAA GACCCTGACC GGCACCCTCG CGGTCTACCT CAACGCGCTC GCGGGCAAGG GCGTCCACGT CATCACGACC AACGACTACC TGGCCAAGCG CGACGCCCAG AACATGGGCC GCATCTACCA CTTCCTCGGC CTCGAGGTCG GCGTCGTCGG ACCGCAGATG ACCGCGGCCG ACCGCCGCAG CGCCTACCAG GCCGACATCA CCTACGGCAC GAACAACGAG TTCGGCTTCG ACTACCTGCG CGACAACATG GCGCTCTCGC TCAAGGACAC CGTCCAGCGC GAGCACTACT TCGCCCTGGT CGACGAGGTC GACTCCATCC TCATCGACGA GGCCCGCACC CCGCTGATCA TCAGCGGCCC CTCGGAGCAG AACTCCCGGT GGTACGCCGA GTTCGCCAAG ATCGCCCCGC GCCTCAAGCG CGAGGAGGAC TACGAGGTCG ACGAGAAGAA GCGCACCGTC GGCATCACCG AGTCCGGCGT GGCCAAGGTC GAGGACTGGC TGGGCATCGA CAACCTCTAC GAGGCCGTCA ACACCCCGCT CATCAGCTTC CTCAACAACT CGCTCAAGGC CAAGGAGCTG TACAAGCGGG ACAAGGAGTA CATCGTCAAG GACGGTGAGG TCCTCATCGT CGACGAGTTC ACCGGCCGCG TCCTCGCGGG CCGCCGCTAC AACGAGGGCA TGCACCAGGC CATCGAGGCC AAGGAGCGCG TCAAGATCAA GGACGAGAAC CAGACTCTCG CCAAGGTCAC CCTCCAGAAC TACTTCCGCC TCTACGAGAA GCTCGCCGGC ATGACCGGTA CCGCCCAGAC CGAGGCGGCG GAGTTCACCC AGACCTACAA CGTCGGCGTG GTGCCCATCC CCACCAACAA GCCGATGGTC CGCCAGGACG TCAAGGACGT CGTCTACAAG CACGAGGACG CCAAGTTCCA GGCGCTCGCC GAGGACATCG CCGAGCGCCA CGAGGCCGGG CAGCCCGTCC TGGTCGGTAC CACCAGCGTC GAGAAGTCCG AGCTGCTGTC CCGGATGCTC AAGCGCGAGG GCGTGCCCCA CGAGGTCCTC AACGCCAAGA ACCACGCCCG TGAGGCCGCG ATCATCGCCC GCGCGGGCAA GCTCGGCGCG GTCACCGTCG CCACCAACAT GGCCGGTCGC GGTACCGACA TCATGCTGGG CGGCAACCCC GACTTCCTCG CCGACGAGGA GCTCCAGGGC CGCGGACTGA GCCCGCTGGA GACCCCCGAG GAGTACGAGG CCGCCTGGCC CGAGGCGCTG GAGAAGGCCA AGGCCGACTA CGAGGAGGAG CACGAGAAGG TCGTCGAGGC CGGCGGCCTG TACGTGCTCG GCACCGAGCG CCACGAGTCG CGGCGCATCG ACAACCAGCT CCGCGGCCGC TCCGGCCGCC AGGGCGACCC CGGCATCTCC CGCTTCTACC TCTCCCTCCA GGACGACCTG CTGCGCCTGT TCAACAGCAG CCGTCTGGAG GCGTTCATGA ACCAGCTGAA CATCCCGGAC GACCAGCCCA TCGAGTCCGG CATGGTCAGC AAGGCGATCG CCTCCGCGCA GGGCCAGGTC GAGACGCAGA ACTTCGAGAT CCGCAAGAAC GTCCTCAAGT ACGACGAGGT CCTCAACCGC CAGCGCAAGG TCATCTACGC CGAGCGCCGC AAGGTCCTGG AGGGCCAGGA CCTGCGCGAC CAGGTCATGA GCATGCTGGA GGAGGTCCTG CGCGGCTACG TCGTGGAGGA GACCGCCAGC GGCGACCCCA GCGACTGGGA CCTCGACAAG CTCTGGCGGG CGTTCAAGCA GGTCTACCCG ATCAGCTTCA CCGTCGACGA GTTCATCGAG GAGAACGGCG ACCTGCACAC GCTGACCACC GAGGTGATCG CGGACCGCGT GGTGGAGGAC GCCAACACCG CCTACGAGGC CCGCGAGGCC GAGCTGGGCG AGGAGGCCAT GCGCGAGGTC GAGCGCCGGG TCATCCTCCA GGTCATGGAC CGCAAGTGGC GTGAGCACCT CTACGAGATG GACTACCTCC AGGAGGGCAT CGGGCTGCGC GCCATGGCGC AGCGCAACCC GCTGATCGAG TTCCAGCGCG AGGGATTCGA CATGTTCCAG CAGATGCTGG AGGCCATCAA GGAGGAGTCC GTCGGCTACC TCTTCAACGT CGAGGTGCAG GTCCGCAAGA AGGAGGAGCC CGCCCTCACC ACCGCCGCCG CTGCCAAGAC CGCCGCGGCA GTCGGCGGCT CCGCCAGCGC GACCGCCGTC GCGACCGCGG TGGAGGAGGA CGAGGAGGTC CAGGACGCCG CCTCCCCGGT GGAGGAGGCC GAGCCCGCCG AGGACGTGGT CGTCCCCGGC TTCGGCGAGG GCCAGCCCAA CCGGCTCCAG TACTCCGCGC CGAGTGAGGA CGGCACCGTC GAGCGCCACA GCGAGACGGC CGACGAGTAC TCCGGCACGG CGCGCAACGC GCCCTGCCCG TGCGGCTCCG GCAAGAAGTA CAAGAAGTGC CACGGCGACC CGGCCGCCAA GTAG
|
Protein sequence | MPGILDKLLR AGEGKILRRL NKLKDQVNSL EEDYVDLTDE ELRDLTGEYK ERYEDGESLD DLLPEAFATV REAAKRTLGQ RHFDVQIMGG AALHLGNIAE MKTGEGKTLT GTLAVYLNAL AGKGVHVITT NDYLAKRDAQ NMGRIYHFLG LEVGVVGPQM TAADRRSAYQ ADITYGTNNE FGFDYLRDNM ALSLKDTVQR EHYFALVDEV DSILIDEART PLIISGPSEQ NSRWYAEFAK IAPRLKREED YEVDEKKRTV GITESGVAKV EDWLGIDNLY EAVNTPLISF LNNSLKAKEL YKRDKEYIVK DGEVLIVDEF TGRVLAGRRY NEGMHQAIEA KERVKIKDEN QTLAKVTLQN YFRLYEKLAG MTGTAQTEAA EFTQTYNVGV VPIPTNKPMV RQDVKDVVYK HEDAKFQALA EDIAERHEAG QPVLVGTTSV EKSELLSRML KREGVPHEVL NAKNHAREAA IIARAGKLGA VTVATNMAGR GTDIMLGGNP DFLADEELQG RGLSPLETPE EYEAAWPEAL EKAKADYEEE HEKVVEAGGL YVLGTERHES RRIDNQLRGR SGRQGDPGIS RFYLSLQDDL LRLFNSSRLE AFMNQLNIPD DQPIESGMVS KAIASAQGQV ETQNFEIRKN VLKYDEVLNR QRKVIYAERR KVLEGQDLRD QVMSMLEEVL RGYVVEETAS GDPSDWDLDK LWRAFKQVYP ISFTVDEFIE ENGDLHTLTT EVIADRVVED ANTAYEAREA ELGEEAMREV ERRVILQVMD RKWREHLYEM DYLQEGIGLR AMAQRNPLIE FQREGFDMFQ QMLEAIKEES VGYLFNVEVQ VRKKEEPALT TAAAAKTAAA VGGSASATAV ATAVEEDEEV QDAASPVEEA EPAEDVVVPG FGEGQPNRLQ YSAPSEDGTV ERHSETADEY SGTARNAPCP CGSGKKYKKC HGDPAAK
|
| |