Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4727 |
Symbol | |
ID | 9248609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5610874 |
End bp | 5612124 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF1205 |
Protein accession | YP_003682619 |
Protein GI | 297563645 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGTGC TGTTCGCGAC GCAGGCGGAG CGGACCCACT TCCTGGGGCT GGTGCCCCTG GCGTGGGCGC TGCGCGCCGC GGGCCACGAG GTCCGGGTGG CCAGCCAGCC CCATCTGGAG TCCGTCGCGG CGGGGGCGGG GCTGCCCTTC ACCGCCGTGG GCCGCGACCA CCACTTCGCC AGGATCAGCC GGGACGGCCG GGCGGACCGG CACCTGGACT TCGACATGGC CGAGGACCGC GACGAGGTCC TGACCTGGGA CCACCTCCTC CAGGGCTACC GCGGGGTGGT CACCTGGTGG TGGCGGACGG TCAACGACCC CATGTTCGAC GACCTGGTCG CCCTCTGCCG CGAGTGGCGC CCCCACCTGG TCGTGTGGGA GCCGTCCACC TACGCCGCCC CCGTGGCGGC CCAGGCGTGC GGTGCCGCAC ACGTGCGCCA CCTGTGGAGC GTGGACCTGT TCAGCAGGGT CCGCCGGACC TTCCTGGCGC GGATGGGCGA GCAGCCCGCC TCACAGCGGG AGGACCCCCT GGCCGCGTGG CTGGGGACCA GGGCGGCCCG GTACGGCGTG GACTTCTCCG AGACCCTGGT CCACGGCCAG GCCACCGTCG AGCAGGTGCC CTCCCCGCTC AGGGTGGACA CGCCCGCGCA CCTGGAGTAC CTGCCGGTGC GCTACGTGCC CTACAACGGA CGCGCCGTCG TCCCCCACTG GCTGCGTACA CAACCCGACC GCCCCCGGAT CGGACTCAGC CTCGGCACGA CGGCGGCGTT GCGCCTGGGC GGCTACACGG TCGACGTCGC GACCCTCCTG GAGGGTCTGG CCGAGCTGGA CGTGGAGGTG GTGGCCACCC TGCCCGCCAG TGAGCAGGCC AAGCTCGGCG CCGTCCCCGG CAACGCCCGC CTGGTCGAGT ACGTGCCCCT GCACGCCCTG GCCCCCACCT GCGCCGCCAT GGTCACCCAC GGCGGCCCCG GCACCGTCCT GACCGGCCTC GCCCACGGAG TCCCCCAACT CCTGTCACCC AACGCCCGGA TGTTCGACAT CCCGGTCCTC GCGGGGCTGG TGGAGGAGGC CGGGGCGGGC AGGGTCGTGG ACCCCGACCG CCTGGACGCC GCCACCGTCG CCGCAGGCGT GCGCACCCTC CTGGAGGACC CCCGCCACAC AAGCGCCGCC CGCGCCCTGC GCGCACGCAT GGACGCCATG CCCACCCCCG CCGACCTCGC CCACACCCTC GCCGGCCTCA CCCGCACCTG A
|
Protein sequence | MRVLFATQAE RTHFLGLVPL AWALRAAGHE VRVASQPHLE SVAAGAGLPF TAVGRDHHFA RISRDGRADR HLDFDMAEDR DEVLTWDHLL QGYRGVVTWW WRTVNDPMFD DLVALCREWR PHLVVWEPST YAAPVAAQAC GAAHVRHLWS VDLFSRVRRT FLARMGEQPA SQREDPLAAW LGTRAARYGV DFSETLVHGQ ATVEQVPSPL RVDTPAHLEY LPVRYVPYNG RAVVPHWLRT QPDRPRIGLS LGTTAALRLG GYTVDVATLL EGLAELDVEV VATLPASEQA KLGAVPGNAR LVEYVPLHAL APTCAAMVTH GGPGTVLTGL AHGVPQLLSP NARMFDIPVL AGLVEEAGAG RVVDPDRLDA ATVAAGVRTL LEDPRHTSAA RALRARMDAM PTPADLAHTL AGLTRT
|
| |