Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2594 |
Symbol | |
ID | 9246445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3089597 |
End bp | 3091024 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF162 |
Protein accession | YP_003680518 |
Protein GI | 297561544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.487179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0287829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGA CGTTCCTGGG CATCCCGCCC TTTCCCGAGG CCGCCGCGGG CGCGGTCAAC GACGCCCGCC TGCGGCACAA CCTGCGCAGG GCCACGCACA CCATCCGCGA CAAGCGCGCC TCCGTCGTCG ACGAGCTGCA CGAGGACTGG CAGCGCCTGC GGGCGGAGGG CGCCGCGGTC AAGGAGCACA CCCTGCGCCA CCTGGACCAC TACCTGGAAC AGCTGGAGGA GTCCGTCCAC CGGGCGGGCG GACGGGTGCA CTGGGCCTCC GACGCGGCCG AGGCCAACGA GATCGTCACC CGGCTGGTCC GGGACACCGG CGAGACCGAC GTGGTCAAGG TCAAGTCGAT GGCCACCCAG GAGATCGAAC TCAACGACGC CCTCGCCGCC GCCGGGATCA CCGCCTACGA GACCGACCTG GCCGAGCTGA TCGTGCAGCT GGGCGAGGAC CTGCCCTCGC ACATCCTGGT GCCCGCCATC CACCTGGGCC GCGCCCAGAT CCGCGAGATC TTCCTGGAGC AGATGGCCGA GTGGGGCGTG CCCGCGCCCC GGGGCCTGAC CGACGACCCC CGGGCGCTGG CCGAGGCGGC CCGCGTGCAC CTGCGCGAAC GCTTCCTGCG CACCCGCACC GCGATCTCGG GGGCCAACTT CGCGGTGGCC GACAGCGGCA CGCTCGTGGT GCTGGAGTCG GAGGGCAACG GGCGCATGTG CCTGACCCTG CCCCGGACGC TGATCTCCGT GGTCGGCATC GAGAAGATCG TGCCGACCTG GTCGGACCTG GAGGTGTTCC TCCAGTTGCT GCCGCGTTCC TCCACCGGCG AGCGGATGAA CCCCTACACC TCCACCTGGA CCGGGGTGAC GCCGGGCGAC GGCCCCCAGG AGTTCCACCT GGTGCTGCTG GACAACGGCC GCACCGACGT GCTGGCCGAC ACCGTGGGCC GCCAGGCGCT GCGCTGCATC CGCTGCTCGG CGTGCCTGAA CACCTGCCCG GTCTACGAGC GCACCGGCGG GCACTCCTAC GGTTCGGTCT ACCCGGGCCC GATCGGCGCG ATCCTCACCC CGCAGCTGCG GGGGATGTCC TCGCCGGTGG ACGAGGCGCT GCCCTACGCG TCCTCGCTGT GCGGGGCCTG CTACGAGGTG TGCCCGGTGG CCATCGACAT CCCCGAGGTG CTGGTGCACC TGCGCGAGGA GGTCGTGGAG CGCTCCGGGC ACGCGGGGGA GAAGGCGCTC ATGGCGGGCG CCGAGGCGGT GCTGTCCTCC CCGCGGACGC TGGGCGCGGT CCAGCGGGCG GCGGGGCTGG GGCGCCGCGC GGTCCCGCGC CACCTGCCGG GCCTGGCGGG GGCCTGGACC GACACCAGGG ACGTGCCCGA CGTCCCGGCC GAGTCCTTCC GCCAGTGGTG GGACAGGCGC GAGGGGGAGG GCCGATGA
|
Protein sequence | MSATFLGIPP FPEAAAGAVN DARLRHNLRR ATHTIRDKRA SVVDELHEDW QRLRAEGAAV KEHTLRHLDH YLEQLEESVH RAGGRVHWAS DAAEANEIVT RLVRDTGETD VVKVKSMATQ EIELNDALAA AGITAYETDL AELIVQLGED LPSHILVPAI HLGRAQIREI FLEQMAEWGV PAPRGLTDDP RALAEAARVH LRERFLRTRT AISGANFAVA DSGTLVVLES EGNGRMCLTL PRTLISVVGI EKIVPTWSDL EVFLQLLPRS STGERMNPYT STWTGVTPGD GPQEFHLVLL DNGRTDVLAD TVGRQALRCI RCSACLNTCP VYERTGGHSY GSVYPGPIGA ILTPQLRGMS SPVDEALPYA SSLCGACYEV CPVAIDIPEV LVHLREEVVE RSGHAGEKAL MAGAEAVLSS PRTLGAVQRA AGLGRRAVPR HLPGLAGAWT DTRDVPDVPA ESFRQWWDRR EGEGR
|
| |