Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3795 |
Symbol | |
ID | 9247666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4556517 |
End bp | 4559540 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function UPF0182 |
Protein accession | YP_003681699 |
Protein GI | 297562725 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.214749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0401617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTTCC GATCGCCCGG CGCACCACCT GCGCGTATGC CTCGCCGATC AAGGTTGCTC GCGCCAGTCG CGGCGACCGT GGTCGTCATC ATCGCGGGGC TGATGCTCGC CGCGAACTTC TGGACCGACT TCAAGTGGTT CGAATCCGTC GGCTACACCT CGGTCTTCCT GACCGAACTG TGGACCCGCG TGCTGCTGTT CGCCGTCGCC GGACTGGTGA TGGCCGTCAT CGTCGGCGCC AGCATCTTCT TCGCCTACCG CTCCCGGCCC GGCATCCGAC CGATGAGCCT GGAGCAGCAG GGCCTGGACC GCTACCGGCA GTCCATCGAC CCGCACCGCA AACTGTTCTT CTGGATCGCC GTGGGCGCCC TCGCGCTGCT GGCCGGAGCC GCGGCCAGCG GTGACTGGCG GTCCTACCTC CAGTTCGTGA ACAGCACCGA CTTCGGGGTG AACGACCCCG AGTTCGGCAT GGACGTGGCC TTCTTCGCGT TCACCTACCC GTTCCTGCGC ATCCTGCTCG GCTACCTGTA CGCGGCGGTC ATCCTGGCGT TCATCGCCGC GGTGATCGTG CACTACCTGT ACGGCGGGGT CCGCCTCCAG AACGACTCCG GCCAGCGCGC CACCCCCTCG GCCCGCGTGC ACCTGTCGGT GCTGCTGGGC CTGTTCGTGC TGCTGCGCGG CGGTTCCTAC TGGCTGGACC GCTACGGCCT GGTCTTCTCC GAGCGCGGCT ACACCTTCGG CGCCTCGTAC ACCGACGTGA ACGCCGTCAA GACCGCGCTG CTCATCCTCA CGGTCATCTC GGTCATCTGC GCCGTGCTGT TCTTCGCCAA CATCTACTTC AAGAACATCA TGGTGCCCAT GGCCAGCCTC GGACTGCTGG TGCTGTCCGC GGTGCTGGTC GGCGTGGCCT ACCCCGAGAT CGTCCAGCGC TTCCAGGTCG CGCCCAACGA GCAGCGCCTG GAGAGCCCCT ACATCGAGCG CAACATCGAG TACACGCGCC AGGCCTACGG CATCGACGAC GCCGAGGTGC AGGCCTACGA CGCCACCACC GAGCTCAGCG CCCAGGACCT GGTGGAGGAG TCCCAGGACA CCACCGTGCG CCTGGTCGAC CCCTCCGTCG TCTCGCAGAC CTTCCAGCAG ATGCAGCAGG TCCGCGGCTT CTACCAGTTC CCCGAGGTGC TGGAGGTCGA CCGCTACCCG GACTCCGAGG GCAACCTGAT CGACACGATC GTGGCCGTCC GAGAGCTGGA CGGGCCGCCC GCCGACCAGG ACAACTGGCT CAACCGGCAC CTGATCTACA CCCACGGCTA CGGCATGGTC GCCGCCGCGG GCACCCAGAT CGACGCCGAG GGGCGCCCGG TCTTCACCGA GTACAACATC CCTCCGCGCG GTGAGCTGAG CGACGTCGTC GGCGAGTACG AGCCGCGCAT CTACTACGGC CGCGAGGGCG CCGAGTACGC GATCGTGCAG GCCGAGGAGG AGTACGACTA CCCCCTCGAC GCCGAGGAGG AGACCGGCGA CGTCCCGACC CCGGAGGACG CGGTCGAGCC GGAGGTCTCC CCGAGCCCGG ACGAGGCCCG CGCCCCGTCC GACGCGGACC AGGAGGCCTC GGAGCAGACC GCAGAGGAGG CGCCCGCCGA GGGCGGCGGC GAGGGCGCCG GAGGCGGTGG GGAGGGCAGC GACTCGCAGG CCTACAACCG CTACGACGGC GACGGCGGCG TCCAGCTGGC CAGCTTCTTC GACCGGATCC TGTACGCGAT CAAGTACCAG GAACCCAACA TCCTGCTCAA CAGCGCCATC ACCAACGACT CGCGCATCAT CTACGAGCGC GACCCGGTGG AGCGCGTGGA GAAGGTGGCC CCGTACCTGA CCACGGACAG CCGACCCTAC CCGGCGGTCG TCGACGGCCG GGTCGTGTGG ATCGTGGACG CCTACACCAC GTCCGACGGC TACCCGTACG CCAACCGGAT CGACTTCACC CAGGCGGTGA CCGACACCTT CACCGACGGC TCGGCCCAGC AGGTGGGCGC GCTGCCGGGC AACGAGGTCA ACTACATCCG CAACTCGGTG AAGGCCACCG TCGACGCCTA CGACGGCACC GTCACCCTGT ACGCGTGGGA CGAGGCGGAC CCGGTCCTCC AGACCTGGAT GGACGCCTTC CCCGGGACCG TCGTCGGCAG GGACCAGATG AGCGAGGAGC TCGTCGACCA CCTGCGCTAC CCGGACGACC TGTTCAAGGT GCAGCGCCAG ATCATGCGCG AGTATCACGT GACGGACGCC GCGGCCTACT ACGGCGGTCA GGACTTCTGG TCGGTCCCCA GCGACCCGAC CAGCGAGACC GATGCCCCCG AGCCGCCCTA CCGGCAGACC ATCCAGTACC CGGGTGAGGA GTCCACCTTC TCGCTGACCA GCACGTTCGT GCCGCGCGGC CGTGAGAACC TGGCGGCCTT CATGGCGGTG GACAGCGATC CGCGCTCCGA GGAGTACGGC CAGCTCAAGC TGCTGGAGCT GCCGCGGAGC ACGGTGATCC TCGGCCCGGG GCAGGTGCAG AACGCCTTCG ACGCCGACGC CGACGTCCGC GAGGTGCTGC TGCCGCTGGA GCAGTCCAAC GCGGAGGTGA CGCGGGGCAA CCTGCTCACG CTGCCCTTCG CCGGGGGCCT GCTCTACGTC GAGCCGCTGT ACGTGCAGGC GGGCGGCGGC GGAGCCGCCT CCTTCCCGCT GCTCCAGCAG GTCATGGTCG GCTTCGGTGA CGAGGTGGCC ATCGGCAACA GCCTGCCGGA CGCGCTCAGC AACCTCTTCG ACGGAGAGGG CGCGGCCCCC GAGGAGGGCG TCGAGCAGAC TCCCGAGGAG GAGTCCGAGG CCGGTGGCGG CGGCGGAGGC GGTGGTGGGG GCAACGAGGA CCTCACCGAG TCCCTCAACG AGGCCGTCGA GGCCTGGGAA GAGGCCCAGC AGTCCAACGA GGAGGCCAAC GACCGCCTGC GCGAGGCGCT GGAGGACATC CAGCAGCAGC TGGACGAGAA CTGA
|
Protein sequence | MSFRSPGAPP ARMPRRSRLL APVAATVVVI IAGLMLAANF WTDFKWFESV GYTSVFLTEL WTRVLLFAVA GLVMAVIVGA SIFFAYRSRP GIRPMSLEQQ GLDRYRQSID PHRKLFFWIA VGALALLAGA AASGDWRSYL QFVNSTDFGV NDPEFGMDVA FFAFTYPFLR ILLGYLYAAV ILAFIAAVIV HYLYGGVRLQ NDSGQRATPS ARVHLSVLLG LFVLLRGGSY WLDRYGLVFS ERGYTFGASY TDVNAVKTAL LILTVISVIC AVLFFANIYF KNIMVPMASL GLLVLSAVLV GVAYPEIVQR FQVAPNEQRL ESPYIERNIE YTRQAYGIDD AEVQAYDATT ELSAQDLVEE SQDTTVRLVD PSVVSQTFQQ MQQVRGFYQF PEVLEVDRYP DSEGNLIDTI VAVRELDGPP ADQDNWLNRH LIYTHGYGMV AAAGTQIDAE GRPVFTEYNI PPRGELSDVV GEYEPRIYYG REGAEYAIVQ AEEEYDYPLD AEEETGDVPT PEDAVEPEVS PSPDEARAPS DADQEASEQT AEEAPAEGGG EGAGGGGEGS DSQAYNRYDG DGGVQLASFF DRILYAIKYQ EPNILLNSAI TNDSRIIYER DPVERVEKVA PYLTTDSRPY PAVVDGRVVW IVDAYTTSDG YPYANRIDFT QAVTDTFTDG SAQQVGALPG NEVNYIRNSV KATVDAYDGT VTLYAWDEAD PVLQTWMDAF PGTVVGRDQM SEELVDHLRY PDDLFKVQRQ IMREYHVTDA AAYYGGQDFW SVPSDPTSET DAPEPPYRQT IQYPGEESTF SLTSTFVPRG RENLAAFMAV DSDPRSEEYG QLKLLELPRS TVILGPGQVQ NAFDADADVR EVLLPLEQSN AEVTRGNLLT LPFAGGLLYV EPLYVQAGGG GAASFPLLQQ VMVGFGDEVA IGNSLPDALS NLFDGEGAAP EEGVEQTPEE ESEAGGGGGG GGGGNEDLTE SLNEAVEAWE EAQQSNEEAN DRLREALEDI QQQLDEN
|
| |