Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3506 |
Symbol | |
ID | 9247375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4208888 |
End bp | 4210531 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | protein of unknown function DUF187 |
Protein accession | YP_003681413 |
Protein GI | 297562439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.339986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCGAAAC CGCACCGGAA CCGTCCCGCT CATCCGCCCG CCCGGGGGGC GGGCCGGGTC TCCCGAGCCG TCGCCCTGGC CGCAGCGCTC ACCGCGTCGC TGCTCCACGC GCCGCACGCG GCGGCCGCCG CTCCGGCCGT CCCGGCGACG AACACCTCCG CCGAGCCGTG CGCGGTCGAT CCGCAGGCGC CGCCCAAGCG GCAGATGCGC GCCGAGTGGA TCTCCTCGGT GGTCAACATC GACTGGCCCA GCGAGCAGGG CCTCTCACCC GAGCGGCAGA AGGCCGAGCT GATCGACCTG TACGACCGGG CGAAGGCCGA CGGGCTCAAC GCCGTGTTCG TGCAGATCCG GCCGACCGCC GACGCGTTCT GGCCCTCACC GCACGAGCCC TGGTCGGAGT GGCTCACCGG CACGCAGGGC ACCGACCCCG GGTACGACCC GCTGGCGTTC GCGGTCGAGG AGGCGCACGC CCGCAACCTG GAGTTCCACG GCTGGTTCAA CCCCTACCGG GTGGCCATGC ACGACGACCC CTCGCGCCTG GTGGCCGACC ACCCGGCGCG GGTCAACCCG GACTGGGTGT TCGCCTACGG CGGCAAGCTC TACTACGACC CGGGCATCCC CGAGGTGCGC GCGTTCGTCG TCGAGGCGAT GATGCACGCG GTCGAGAACT ACGACCTGGA CGGCGTCCAC TTCGACGACT ACTTCTACCC CTACCCGGTG GCGGGCGAGA CGGTGCCCGA CCAGGACACG TTCGCCGAGT ACGGCGGAGA GTTCGGCGAC ATCGGGGACT GGCGGCGCGA CAACGTCAAC CGCATGGTCC AGGAGATGGA CGAGGCGGTG CACACCGCCA AGCCGCACGT GAAGTTCGGC ATCAGCCCGT TCGGGATCTG GCGCAACGAC ACCAGCGACC CGAACGGCTC CGACACCGGC GGGTTCGAGT CCTACAGCCA GATCTACGCC GACAGCCGCA GGTGGGTGCG CGAGGGCTGG GTGGACTACA TCAACCCGCA GGTCTACTGG GAGATCGGCC TGCCCGTGGC CGACTACGCG GTGCTCGTCC CCTGGTGGGA GCAGGTCACC GAGGGCACCG ACGTACACCT CTACATCGGC CAGGCCGCCT ACAAGGTCGG CAACGCGGGC GCCTGGTCCG ACCCGGACGA ACTCTCCCGG CACCTGGACC TGAACCGCGA GTACCCGGGC GTGGACGGGG ACGTCTACTT CAGCGCGAAC TCGCTGCGCA CCAACGCCAG GGACGCCATG GACGTCGTGG TCGAACAGCA CTACGCCAAC CCCGCCCTGA TCCCGGTCAA GGAGGACCTG GGCGGGGCCG CTCCCGCACC GCCGGTGGTC ACCGCCGCCG CCCGGGCCGA CGGCGGCACC GAGCTGACCA TCCGCCCCGG ACGCGGCGGC AGGCCCGCCT ACTACGCGGT CTACGAGCTG GAGGGCGCAC CCGGACGGCA GGAGGTCCCG TGCGAAGTGC AGGACGCGCG CGCACTGATC GGCACGGTGC GCGCGGCCGA GGACGGTGGC GAGACGGTCT TCACCGCGCC CGGCGGCGGT GACGTGACCT ACTACGTGAC CGCCCTGGAC CGGCTGCACC ACGAGAGCAC GACGAGCAAC CCGAGGCATG TGCCCCGGGG CTGA
|
Protein sequence | MAKPHRNRPA HPPARGAGRV SRAVALAAAL TASLLHAPHA AAAAPAVPAT NTSAEPCAVD PQAPPKRQMR AEWISSVVNI DWPSEQGLSP ERQKAELIDL YDRAKADGLN AVFVQIRPTA DAFWPSPHEP WSEWLTGTQG TDPGYDPLAF AVEEAHARNL EFHGWFNPYR VAMHDDPSRL VADHPARVNP DWVFAYGGKL YYDPGIPEVR AFVVEAMMHA VENYDLDGVH FDDYFYPYPV AGETVPDQDT FAEYGGEFGD IGDWRRDNVN RMVQEMDEAV HTAKPHVKFG ISPFGIWRND TSDPNGSDTG GFESYSQIYA DSRRWVREGW VDYINPQVYW EIGLPVADYA VLVPWWEQVT EGTDVHLYIG QAAYKVGNAG AWSDPDELSR HLDLNREYPG VDGDVYFSAN SLRTNARDAM DVVVEQHYAN PALIPVKEDL GGAAPAPPVV TAAARADGGT ELTIRPGRGG RPAYYAVYEL EGAPGRQEVP CEVQDARALI GTVRAAEDGG ETVFTAPGGG DVTYYVTALD RLHHESTTSN PRHVPRG
|
| |