Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1230 |
Symbol | |
ID | 9245080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1528467 |
End bp | 1531841 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | OmpA/MotB domain protein |
Protein accession | YP_003679175 |
Protein GI | 297560201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.172002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00819992 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACCCAGA TCCGACGGCT GAGCGCCCTC CTGCTCGCAG CGGCGATCCT GCTCGGCATC CCCTACCTGC TCCTGTGGCA TCTGCCCTGG CCGGAGCTGC CGGACTCCTG GGCCGTGGCC GCGGCCCACC TGCGCGGCTG GCGTCTGCCG CCGGGTGTGT TCACCGCCCT GCTCATCACC GTGCTGTGGG CCCTGTGGGG TCTGTACGCG GCCGGACTGG TCGTGGAGAC CTTCGCCCGC CTGCGCGGCA TGCCCCGCAG GCTGCGCCCC CTCGGGCCGC TCCAGGTGGT GGCGGCGGCC GCCGTGGGAA CCGTGGCGGT CGCCCCGGCC ACCGCGTTCG CCGACACCGT CACCACGCAG GAGGACCGCG AGGCGGACGG GCAGCAGGAC GCCGACGCGC CCCCCGCCCC CGAGGAGGCC CCGTCCGCGC AGCCCGTCGA GCGGGTGCGC ACCGTCTCCG GTTTCGGCGT CGGCTCGGCC GAGCTCACCG AGCAGATGCG CGACGACCTC GCACCGGTCG CGGAGATGAT CGACTCCTAC GGCGACACCG AGACGCCCGT GCGCATCACC GGCCACACCG ACCCCAGCGG GAACGCCCAG ACCAACCAGG AGCTGTCCGA GCGCCGGGCC CAGGCGGTCG CCGACCACCT CGACGAGGTC CTGGGCGAGG AGGCCCCGGA GATGGAGGTG GAGGGCGTCG GCTCCGAGCA GCCCCGCGAG GGCGGCGCCG CCGCCCAGCG CAGGGCCGAG GTCGCCTACA CCGTCGTTCC GCAGCCGACC TCCCTCCAGT CGCGTGTGAT GGCCGCCGAG ACGCCGGACC CCGGTGCCGG TGCCGGGGAG ACCGCTGCCC CCGTCGAGGA GGACGACCCC GTGGAGGCGG CGGCCGCTGC CGCCGCCGAG GCCGCCGGGG ACGGCGAGAA CGTGCTGGTC GTGGAGATCC CCGACGGCGC CGTCACCGGA GCGGTGGCGT TCGCGGGCCT GGCCGGGGGA TACCTGCTGG GCAAGCGGGG CGTCCGGATG CCCGGCGTCA GCCTGAGCCT GCCCCGCCTG CCGAGGGCGC CCCGCCGCCT GGCCCTGACC GCCCCGCCGC CCCGCCCCAC CCCCGGGGAC GAGATCGACG AGCGGGTGAC GGTGGAACTC GACCACGTGC CCGGCCTCGG CATCACCGGG CCGGGTGCGG CGGGGGCCGC CCGGCGCCTC ATCGTCAACG CGCTCGACCG CCTCAACGGC GACGCCGTGC GCGTGCTGCT CACCGAGGCC GAGGCCGCCC GCCTGGTGGG CGACCGGGGC CGCGACCTGC TGCGCGCGCA TCCCTGCGAG CCCGTGCGGA TGGTCGGCAC GATGGAGGAG GCCCTGACCG TCCTCCAGCG CGAGCTGCAC CAGATCGTCG AGGAACCGGG GGAGCGGCCG CACACGCCCC TGGCCCTGGT GACCTCGCCC ACCTCCGAGC ACGAGACCGC GCTGTCGGGT CTGCTGCTGC ACGGGCAGCA GCAGGGCATC ACGGCCGTGG TCCTGGGCCG CTGGCCGCTC GGGGGCAGCT GCGTCATCGA GGAGGACGGT CTCATCACCG AGACCAGCCC TCCGCTCAAC CCGGTCTTCC ACTGCTCCTG GCCCGGCGCG ACCGCCGAGG AGGTCATGGA CGCCGTCCGC GCCTACCGCC ACTCCTCGCC CGCCCTGGAG GAGACCGGTG ACCGGGTGCC GAGCCCGGAG GCCGCCGCGG AGACCGTCCC GGACGTCGCG CTGGAGGAGG CCGGTTCCTT CGCGGAGCCG TTCCTCGGCG CCCTGGGGGA GCAGACCGCT GTTGAGACGG CGGAGGCGAA CGAGCCGGTA GAGACGGCTG AGTCGACCGA GTCGACCGAG TCGACCGAGT CAGTCGATGC GGTTGATGCC GCTGTGGGCG AGAAGGACCA GAAGAGGGCC GAGGCGGCCG CGAAGCCCAG GAGGACCAAG GGGATCAGGA AGGCCAAGGC GGCCGACGCC TCGGAGGAGG GCTTCTGGGA CGCCGACGTC TGGAACACCG ACTTCTGGGA GACCGACGCG CCCGCGGCCG GGAAGGCCGA CACCGACCGG TCCGAGACCG AGTCGGCCGA GGCCGCGCCG TCCGTGCCCG AGCGGCCAGG TGCACTGTGG TCCGAGGCCG AGCCGGACGA CACGGCCCGC TTCGGGGCCG CGTCGGAGGT CTCCACGCGG CCGGACACCC GGTCGGACGG TTCCGTCATC GGGCGGCCCG CCGTGGAGGA GAGCACGACG CGGCCTTCCT CGTCCGCACC CTCCGTTTCC GAGGAGGCGG TTTCCGGATC GTCCGCGGAC GCGGAACCCG CGGGCACCGG AGCGGACCGC TCCGCCCCCG AGCCGTCCGC CGCCGAGGAG GCCGAGCCCC GGCCGTCCGC GTTCGAACAG CTCGCCGCCG AGGAACAGGT GTCCCGGGAG ACCGTGACGG GACCGTCCGA GTCCGCGCGG CCCACCGCCT CCGCACCCGC TCCGGAGAAG GCCGCCGCCG AGGAGTCCGC GCCCGAGCAG AGTGCGTCGG AGCGGTCCGC CCGGGAGGAG GGCGCCCCCG TGGCCTCCCC GTCCGAACGG CCCTCCGCCG AACAGTACGC CGCCGAGCTG TCCGCGCCCG CGCCGTCCGC GACCCGGGAA CAGCGCGCGC CCGAGGAGAC CGCGTCCGAG CCCGCCGAGC CCCGGGAGAA CGCGTCCGCC CAGGTCACCG CTCCGGCACG GACGGAACAG CCGAAGGAGA ACGCACCCGC GGAGCCCCCA GCCCAGCGCC CCACGTCCGA GGAGGCCGTC CCCGCCAGGC CCGCCGCCGA GCGGCGTGAG CCGGAGCCCG AGCCCGCCAG GCCCGCCCCC GGGCGTCCGA GCACCGGCAC GGTTCCGTCG GTGGTCGCCT CTCCAGCCCG CGCCCAACGC CCGTCCCAGC AGACGCGCAC GGCGCCCGCC CAGCAGGAGA CGGCCTCCGA GGAAGAGACC GCCCCCGTCC GCAAGGCCGC CAAACGCGCC GTGCGCTCCC GCCTCCCGAA GCCGAGCGGG ACCGCGTCCG CTCCGGCGGC CCAGGCGTCC CCGTCCGCCC CGGCGGCGCG GTCCGAGCAG CTCCCGCTCC CGGACCAGCC CGCCCGCCCC GAGCGGGCCC GCGCCCGCGC CCAGAACGCG GCCGCGTCCG GCGCCGAGCC CCTCACCAGC CGCGCGGAGC GCATGGAGAG GCGCACCAGG GCCGAACGCA GGAACGCCAC CGCCCCCGCC GGAGCCGTCG CCGAGCACCC CGTGCCGGAC ACCGAGGGCG GGGAGTCGGA CGGGTCCCAG CCGTCCCGGC CCCTGCTCCG CAAGCCCAAG AAGGCGGGAC GCGGAAGGAG CTGGCGCCCC AGGGACAACT CCTGA
|
Protein sequence | MTQIRRLSAL LLAAAILLGI PYLLLWHLPW PELPDSWAVA AAHLRGWRLP PGVFTALLIT VLWALWGLYA AGLVVETFAR LRGMPRRLRP LGPLQVVAAA AVGTVAVAPA TAFADTVTTQ EDREADGQQD ADAPPAPEEA PSAQPVERVR TVSGFGVGSA ELTEQMRDDL APVAEMIDSY GDTETPVRIT GHTDPSGNAQ TNQELSERRA QAVADHLDEV LGEEAPEMEV EGVGSEQPRE GGAAAQRRAE VAYTVVPQPT SLQSRVMAAE TPDPGAGAGE TAAPVEEDDP VEAAAAAAAE AAGDGENVLV VEIPDGAVTG AVAFAGLAGG YLLGKRGVRM PGVSLSLPRL PRAPRRLALT APPPRPTPGD EIDERVTVEL DHVPGLGITG PGAAGAARRL IVNALDRLNG DAVRVLLTEA EAARLVGDRG RDLLRAHPCE PVRMVGTMEE ALTVLQRELH QIVEEPGERP HTPLALVTSP TSEHETALSG LLLHGQQQGI TAVVLGRWPL GGSCVIEEDG LITETSPPLN PVFHCSWPGA TAEEVMDAVR AYRHSSPALE ETGDRVPSPE AAAETVPDVA LEEAGSFAEP FLGALGEQTA VETAEANEPV ETAESTESTE STESVDAVDA AVGEKDQKRA EAAAKPRRTK GIRKAKAADA SEEGFWDADV WNTDFWETDA PAAGKADTDR SETESAEAAP SVPERPGALW SEAEPDDTAR FGAASEVSTR PDTRSDGSVI GRPAVEESTT RPSSSAPSVS EEAVSGSSAD AEPAGTGADR SAPEPSAAEE AEPRPSAFEQ LAAEEQVSRE TVTGPSESAR PTASAPAPEK AAAEESAPEQ SASERSAREE GAPVASPSER PSAEQYAAEL SAPAPSATRE QRAPEETASE PAEPRENASA QVTAPARTEQ PKENAPAEPP AQRPTSEEAV PARPAAERRE PEPEPARPAP GRPSTGTVPS VVASPARAQR PSQQTRTAPA QQETASEEET APVRKAAKRA VRSRLPKPSG TASAPAAQAS PSAPAARSEQ LPLPDQPARP ERARARAQNA AASGAEPLTS RAERMERRTR AERRNATAPA GAVAEHPVPD TEGGESDGSQ PSRPLLRKPK KAGRGRSWRP RDNS
|
| |