Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0168 |
Symbol | |
ID | 9243999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 210696 |
End bp | 211865 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003678124 |
Protein GI | 297559150 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCA TCCTGCGTCG GTACGCCGCG CTGCTTTCCA CCTCGGCCCT GATCGGCACC CTGCTGGGCG CCCCGGCGGC CAGCGCCGAC GACTTCTCAC GTGTGGACCG CGACCCCGTC GAGGGCGCGC CGCTCGTGCT CGCCGACGGC CGGGAGGCCG ACACGGCGCT GTTCAGCCTG CGCGTGGCCG AACACGCCTC GGTCCGGGCC TACACGGTGA CCGCGGACGA GGAGGTCCAC CCGTACGCGG CCTACGTGGA GTCAGCGTGG TCGGACGTTC CGGAGTGGAC CGAAACCCCG CAGGAAACGG ATCCCGCCGA CCGGGCGGAC TGGATCGTCT CCCATTCCTA TCCGACGGTC GGTCTGGAGT CCCTGCTGGA GGGGAGCGAC CTGCCCCGGT TGGACGAGGC ACGGGCGATC GCGGGCACCC AGGCCGCCCT GTGGCACGTG CTGGAGGGGG TGCGGCCGGA CCCCGGCGCC AACGACCCGG GCGTCATGGC GCTGTACGAC CACCTGGTGG AGGGCAGCGC GTCCGCGCCG GACAACACCG TCTCCCGGTC CCTGGCGGTG TCGCCGTCCC ACGTGGAGGC GGTCGCCCCC GAGGAGCCGC TGGGGCCGTT GACCGTGCAC AGCTCCGGAG CGGAGCCGGT GTCGGTGTCG GTGCGCGGCG CCCCGGCGTC GTGGCTGGTC GACGCGGACG GGGAGCAGGT CACGCAGCTG GGCGACGGCG AGCGGGTCTT CCTGGACGTG GACCCGTCGG TCCCGGCGGG CGTGGCGACC CTGCACCTGC ACGGCCGGGA CTTCCCGCTG CCGCAGGGGC GCCTGTTCAC CGGCAGGGAC GGCGTGCGCA CCCGGTCCCT GGTGACCGCG GAGGCGGGTT CGGCGACGAG TTCGGCCACC GCGACGTTCA CGTGGCACCC GGAGCAGACC GAGGAGCCGG TGGCCGCGGC GAGCCCGGAG CCCGTGCCGG AGGAGGAACC CCCCGCCACC GAGGAGGCCG TCGAGGAGGA GGTTCCCGCC GAGGAGGTAT CTCCCACAGC GGAGGACCGA ATCCCCGAGG ACGATCTGGC CCTCACCGGA ACCTGGCTGT CCGGGCTGCT GGTGATCGCG GGGGCCCTGG TGGTATCCGG CCTGATCATC CTGGTTGTGG GACGTAAACG ACGCGACTGA
|
Protein sequence | MSGILRRYAA LLSTSALIGT LLGAPAASAD DFSRVDRDPV EGAPLVLADG READTALFSL RVAEHASVRA YTVTADEEVH PYAAYVESAW SDVPEWTETP QETDPADRAD WIVSHSYPTV GLESLLEGSD LPRLDEARAI AGTQAALWHV LEGVRPDPGA NDPGVMALYD HLVEGSASAP DNTVSRSLAV SPSHVEAVAP EEPLGPLTVH SSGAEPVSVS VRGAPASWLV DADGEQVTQL GDGERVFLDV DPSVPAGVAT LHLHGRDFPL PQGRLFTGRD GVRTRSLVTA EAGSATSSAT ATFTWHPEQT EEPVAAASPE PVPEEEPPAT EEAVEEEVPA EEVSPTAEDR IPEDDLALTG TWLSGLLVIA GALVVSGLII LVVGRKRRD
|
| |