Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5328 |
Symbol | |
ID | 9249228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 492122 |
End bp | 493273 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003683214 |
Protein GI | 297564241 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.194957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.192507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCCC TGGTGGAGGC CGTGCGCAAC CGCCTGGTGG CGTCCGGGGT GCCGGTGAGC CCGTCGAGCG TCGCTGCGGC GCTGCGCGCC GAGGGAGGTC TGCTGGGCGA CACCGAGGTG CTGTCCATGA CCCGCCGTCT GGCCGCCGAC CTGTCGGGGG CGGGGCCGCT GGAGGAGCTC ATGACGTCGG GGGTCACCGA CATCCTCGTC AACGGTCCCG ACGAGGTGTG GGTGGACGAC GGCGGAGGCC TGCGCCGGGC CGGGGTGCGC TTCGAGTCGG CGGACGCGGT GCGCCGACTG GCCCAGCGGC TGGCGGCGCA GGCGGGGAGA CGGTTGGACG CGGCCGTGCC CTACGTGGAC GCCCGGCTTC CCAGCGGCGC GCGGCTGCAC GCGGTGCTGC CGCCGGTGGC GCCCGCCGGG GCGTGCGTGT CGCTGCGGCT GCCTCCCCGG CGGGTGTTCA CCCTGGAGCG GTTGGCCCGG CGGGGGACGG TCACCCCGTG CGGGGCCGAG CTGCTGCGGT CCCTGGTGGC CTCGCGGGTG CCGTTCCTGG TCAGCGGGGG CACCGGCACG GGCAAGACCA CGCTGCTGTC GTGCCTGCTC TCCCTGGTGG ACCCGGGGGA GCGGATCGTC CTGGCCGAGG ACTCCCCCGA GCTGCGGCCC GAGCATCCGC ACGTGGTCCG CCTCCAGACC CGTCCGCCCA ACATCGAGGG CAGTGGCGAG GTGTCGCTGG AGACCCTGGT CCGGCAGGCG CTGCGGATGC GTCCGGACCG GCTGGTGGTC GGCGAGGCGC GCGGGCCGGA GATCGTGTCC TTGCTGGGCG CTCTGAACAC CGGGCACGAG GGCGGCGCGG GGACCCTGCA CGCCAACGGC GCGGGCGACG TGCCCGCCCG GGTGGAGGCG CTGGGGTGCG CGGCCGGACT GGACCGGGCC GCCGTGCACA GCCAGCTGGC CGCCACCAGG GTCATGGTCG TGCACCTGGT CCGGGACTCC GGCGGGCGGC GCCTGGCCGA GCTGCGGGTG CTCAGACGCG GTTCGGACGG GCTGGTGGAG GCGGTGCCGG CGGTGTCCTT CGGTCCCGAC GGCAGCCGCC GGGAACACGC CGGGGCGGAC GAGGTGGCCG CCCGGCTGGG CGGGGGGTGG CGCCGGTGGT GA
|
Protein sequence | MSPLVEAVRN RLVASGVPVS PSSVAAALRA EGGLLGDTEV LSMTRRLAAD LSGAGPLEEL MTSGVTDILV NGPDEVWVDD GGGLRRAGVR FESADAVRRL AQRLAAQAGR RLDAAVPYVD ARLPSGARLH AVLPPVAPAG ACVSLRLPPR RVFTLERLAR RGTVTPCGAE LLRSLVASRV PFLVSGGTGT GKTTLLSCLL SLVDPGERIV LAEDSPELRP EHPHVVRLQT RPPNIEGSGE VSLETLVRQA LRMRPDRLVV GEARGPEIVS LLGALNTGHE GGAGTLHANG AGDVPARVEA LGCAAGLDRA AVHSQLAATR VMVVHLVRDS GGRRLAELRV LRRGSDGLVE AVPAVSFGPD GSRREHAGAD EVAARLGGGW RRW
|
| |