Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3077 |
Symbol | |
ID | 9246933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3679149 |
End bp | 3680816 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | protein of unknown function DUF262 |
Protein accession | YP_003680992 |
Protein GI | 297562018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.392398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAGC TCGAAGCCCA CGAGGTCCCC CTGCACAAGG TCTTCTCCAG CGACTACGAC TTTCGCATCC CCGACTACCA GCGCCCCTAC GCCTGGGAGG CCGAGGACGC CCTCCAGCTT CTCGACGACC TCAAGGAAGC TCTGGAACGC GACAGGGAAG AGCCCTACTT CCTCGGATCG GCTGTGCTGG TCAAGAGCAA GGAATCGGCC ATTGCCGAGG TCATCGACGG CCAGCAGCGC CTGACGACCC TCACCATCCT GTTCGCGATC CTGCGTGACC AGACCAAGGA CGCGGAGCTG CGTACCGAGC TGGAGAAGAT GGTGGTGGAG CCGGGCAGTA AGATGCTCAA GCTGGACCCC AAACCCCGGT TGGCGCTACG GCCGAAGGAC GTGGAGTTCT TCCGCGAGCA CGTGCAGACG ACTGGTTCCG TTCCCGGTCT GCTTGGCCTT CCCCGTACGG CTCTGAAGAC CAGTGCTCAG GAGGCGGTAC AGACCAACGC GAAGGTTCTG TCCCGTGCGC TTGAGGGGTG GTCCGACGAG CGTCGGTTGG AACTCGCCGG AATGCTCAGC GCGCGAACCT ACCTCGTCGT GGTCAGCACT CCAGACCTGA ACAGCGCACA CCGTATCTTC AGCGTCATGA ACGCCCGAGG ACTCGACCTG TCCCCGGCCG ACATCTTCAA AGCGAGGATC ATCGGTGACC TGGATCCGAA GCTCAGCAGC ATGTACGCGG CCAAGTGGGA GGACGCCGAG GAGTCGCTGG GACGCGACGA CTTCGCCGAT ACCTTCCTCC ATTTGAGGCT GATCTTCTCG GGTGAACGTG CTCGGCGGGA GCTGTTGCTG GAGTTTCCCA AACAGGTCCT CTCGCGCTAT CTGCCGGGCA ACGGCGCGGA GTTCATCGAT GACGTCCTGA TTCCCTACAC CGACGCCTAC GCTCAGATCC GCGATCAGAG CCACTCCTTC CCAGCCGGGG CGGACAAGGT CAGTGCCTGG TTCAAGCGCT TGGAGCAGCT CGACAACAGC GACTGGCGGC CGGCCGCGCT CTGGGCGGTG CGTCACCACC GCCACGACCC CGACTGGCTC GACCAGTTCT TCCGCCGCCT GGAGCGGCTG GCTGCCAGTA TGTTCATCCG CCGGGTCTAT CGGACACCCC GGATACAGCG CTACGTCGAA CTCGTACGTG AGCTCAACTC TGGTAAGGGC TTGGACGCGC GTTCCTTCGA ACTCAGTGAA GAGGAGAAGC GCGCGACCCG GGCCGAACTC GACGGTGAAC TCTATCTGTC CACCAAAGTC CGTCGCTGCG TCCTGCTCCG CCTCGATGAG ATCCTCGCGG ACGAGTCCGG GGTCGTCTAC GAGTACGAGA CCATCACGGT TGAGCACGTC CTTCCACAGA ATCCGGCCCC GGGATGGACG TCCTCCTTCA ACCAGGAACA GCGCGACTAC TGGACTCACC GCGTCGGTAA TCTTGTTCTA CTCAACCGGA GGAAGAACTC ACAGGCACAG AACTACGGCT TCCTCAGGAA GAAGGAGAAG TACTTCATGG GGAAGGGCGG AGTGGTGACT TTCGCACTCA CCAGCCAGGT GCTCACCCAC TCCGAGTGGA CCCCTGAGGT GATCCAAGAG CGTCAGGAAC GGCTGGTCGA GACGCTGGCT CGGGAATGGG ATCTGTGA
|
Protein sequence | MQQLEAHEVP LHKVFSSDYD FRIPDYQRPY AWEAEDALQL LDDLKEALER DREEPYFLGS AVLVKSKESA IAEVIDGQQR LTTLTILFAI LRDQTKDAEL RTELEKMVVE PGSKMLKLDP KPRLALRPKD VEFFREHVQT TGSVPGLLGL PRTALKTSAQ EAVQTNAKVL SRALEGWSDE RRLELAGMLS ARTYLVVVST PDLNSAHRIF SVMNARGLDL SPADIFKARI IGDLDPKLSS MYAAKWEDAE ESLGRDDFAD TFLHLRLIFS GERARRELLL EFPKQVLSRY LPGNGAEFID DVLIPYTDAY AQIRDQSHSF PAGADKVSAW FKRLEQLDNS DWRPAALWAV RHHRHDPDWL DQFFRRLERL AASMFIRRVY RTPRIQRYVE LVRELNSGKG LDARSFELSE EEKRATRAEL DGELYLSTKV RRCVLLRLDE ILADESGVVY EYETITVEHV LPQNPAPGWT SSFNQEQRDY WTHRVGNLVL LNRRKNSQAQ NYGFLRKKEK YFMGKGGVVT FALTSQVLTH SEWTPEVIQE RQERLVETLA REWDL
|
| |