Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3750 |
Symbol | |
ID | 9247619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4503100 |
End bp | 4504404 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF58 |
Protein accession | YP_003681654 |
Protein GI | 297562680 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.237776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTCA CCGGCCGGGC GGTGCTGCTG GCGCTGGCGG CCACGGTCGC GGTGGCGCTG TCCGGTCTGA TCGGCGCCGC GGCCGCGGCC GCGGCCCTGG GCGTGCTGGC GGCGCTGCTC GCCCTGGACG TCGTGCTGGC GGCGAGCCCC AAGGCGGTGC TGCTGTCCCG TGAGGGCGAC ACGTCGCTGC GCCTGGGCGA CTCGGCGACG GTGTACGTGA CCGTCGCCAA CCCCACCCGG CGGGCACTGC GGGGGTCGGT GCGCGACGCC TGGCCGCCGA GCGCGCACGC CGCGCCGCGC AGCCAGCCGC TGCGGGTACC GGCCGGGGAG CGGCGCCGGG TGAGGACCGT CCTGACCCCG ACGCGGCGCG GCGACGCCCG GGCCGCGGGC GTGACCGTGC GCAGCCTGGG GCCGCTGGGG CTGGCGGGCA GGCAGCGGAC GCTGCCCGCG CCCTGGACCG TGCGCACGCT GCCGCCCTTC CACAGCAGGC GCCACCTGCC GGGCAAGCTG TCCCGGCTGC GCGAGCTGGA GGGGCAGCAC ACGGCGATGG TGCGCGGGCA GGGCAGCGAG TTCGACTCCC TGCGCGACTA CGTGCCCGGC GACGACGTGC GGTCGATCGA CTGGCGGGCC ACGGCCCGCG GCGACGGCGT GGTGGTGCGC ACGTGGCGGC CCGAGAGGGA CCGGCGCATC CTCATCGTGC TGGACACCGG GCGCACGTCG GCCGGGCGGG TGGGCGACAC CCCGCGCCTG GACCACGCCA TGGACGCCGC GCTGCTGCTG GCCGCCCTGG CGGGCAGGGC GGGCGACCGG GTGGACTTCC TGGCCTACGA CCGGCGCACG CGCGCGCAGG TGCGCTCGTC GGGCAAGGGC GGCCAGCAGG TGGGCCGGAT CGTGGAGGCC ATGGCCCCGC TGGAGGCGGA GCTGGTGGAG TCCGACCCGG CGGGCCTGGT GGGGACGGTC CTGGGCACGC AGGGGCGGGC CCGGCGGCTG GTGGTGCTGC TGACCGACCT GAACGCGGCG TCGCTGGAGG AGGGGCTGCT GCCGAGGCTG CCCGTGCTCA CCTCCCGGCA CCTGCTGCTG GTCGCCGCGG TCAACGATCC GGCGGTGGAG CTGATGGCCG CCGAGCGGGG CAGCGCGGAC GCGCTGTACC GGGCGGCGGC CGCGGAGCGG ACGCTGGGCG AGCGGCGCCG GGTGACCGCC GAGCTGCGCC GGATGGGCGT GGAGGTGGTC GACGCCGACC CCGAGCACAT CGCGCCCGCG TTGGCTGACG CCTACATCAA CCTCAAGGCT CAGGGCAGGC TGTAG
|
Protein sequence | MVVTGRAVLL ALAATVAVAL SGLIGAAAAA AALGVLAALL ALDVVLAASP KAVLLSREGD TSLRLGDSAT VYVTVANPTR RALRGSVRDA WPPSAHAAPR SQPLRVPAGE RRRVRTVLTP TRRGDARAAG VTVRSLGPLG LAGRQRTLPA PWTVRTLPPF HSRRHLPGKL SRLRELEGQH TAMVRGQGSE FDSLRDYVPG DDVRSIDWRA TARGDGVVVR TWRPERDRRI LIVLDTGRTS AGRVGDTPRL DHAMDAALLL AALAGRAGDR VDFLAYDRRT RAQVRSSGKG GQQVGRIVEA MAPLEAELVE SDPAGLVGTV LGTQGRARRL VVLLTDLNAA SLEEGLLPRL PVLTSRHLLL VAAVNDPAVE LMAAERGSAD ALYRAAAAER TLGERRRVTA ELRRMGVEVV DADPEHIAPA LADAYINLKA QGRL
|
| |