Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0167 |
Symbol | |
ID | 9243998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 209058 |
End bp | 210284 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003678123 |
Protein GI | 297559149 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.926777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTATCT CCTCCCTCCC CCGCACCGTC GGCCGTACCG GCCTGGCGGC CACCGCAGCC GCGTTCCTGG CCTTCGGCCT GGCCGCCCCC GCGGCCGCGG ACCCCGCGGA GACCTACGAT GGTGCCGTCC GCGGGAAGTA CATCGGCAAC GCCGAGAACG GCGCCGACGT CCGCATGACC GGCGAAACCA TCGGCACCCG CCTCTTCAAC CTGGAGCTCG AAGACGGCAC GGTCCTGACG ACCTACTGCA TCGACTTCGA GACCCAGATC CGCGGCGGCG CCTTCTACAA CGAGGACGAC TGGGCCAACT ACCCCGGCAA GGGCGAGTTC GCCGCGCCCG CCAAGGTCCA CTGGATCCTG CAAAACGCCT ACCCGGCCCT GAGCGCCGAG GAGCTGGGCG CGGCCGCCGG TGTCGAGGGC CTGAGCCAGC GGGACGCCCT GGGCGCCACC CAGGCCGCGA TCTGGCACTT CAGCAACGGC GTGGACCTCG AGGACGAGGG CAACTCCCAG GCTGTCAGGA CGGTCTACAC CTACCTGATC GAGAACGCCG AGGAGCTGCC GCAGACCGCC GAGCCCGCCC CGGCCCTGAG CATCACGCCG GGCACGGCCT CCGGCACCGC CGGCGAGACC GTCGGTGAGT TCCTCATCCA GACCAACGCC AGCTCCATCC CGGTGGACCT CCAGGCCCCC GAGGGCGTCC AGCTGGTCGA CCTGGAGACC GGCGAGACCG TCACCGAGGT CGGCAACGGT GACACCGTCG GCTTCGCCGT CCCGGCCGAC GCCGAGGCGG GCCAGGCCAG CTTCTCCCTG GAGGCCAGCG CGACCGTCGC GACCGGCCGC CTGTTCAAGG GGGACGAGGA GTCCCAGCCG ACCCAGACCC TGATCACCGC CGAGGGCGGC GAGACCAGCG TGTCCGCGTC CGCCTCGGCC GACTGGACCG TGGGCGGCGG CGAGACCCCG CCGGAGTCCC CGGAGCCCAG CGAGCCGGAG TCCCCCGAGC CCAGCGAGCC CGAGAGCCCG GAGCCGACCC CGAGCGACAA GCCCTCCGAG CCCGCCGACA AGCCGTCCGA GCCCGCCGAC GACCAGAACG AGCCGACCCT GCCGGTGACC GGTGGCGCGC TCGCTGGCCT GGTCGCCGCC GGTGTGGCCG CCCTGGGCGC CGGTGGTGGC GCCATCTACC TGAGCCGCAA GCGCAAGGCC GCCAACAGCC AGGACCTGGA GGGCTAA
|
Protein sequence | MSISSLPRTV GRTGLAATAA AFLAFGLAAP AAADPAETYD GAVRGKYIGN AENGADVRMT GETIGTRLFN LELEDGTVLT TYCIDFETQI RGGAFYNEDD WANYPGKGEF AAPAKVHWIL QNAYPALSAE ELGAAAGVEG LSQRDALGAT QAAIWHFSNG VDLEDEGNSQ AVRTVYTYLI ENAEELPQTA EPAPALSITP GTASGTAGET VGEFLIQTNA SSIPVDLQAP EGVQLVDLET GETVTEVGNG DTVGFAVPAD AEAGQASFSL EASATVATGR LFKGDEESQP TQTLITAEGG ETSVSASASA DWTVGGGETP PESPEPSEPE SPEPSEPESP EPTPSDKPSE PADKPSEPAD DQNEPTLPVT GGALAGLVAA GVAALGAGGG AIYLSRKRKA ANSQDLEG
|
| |