Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1780 |
Symbol | |
ID | 9245630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2178625 |
End bp | 2180091 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | PepSY-associated TM helix domain protein |
Protein accession | YP_003679714 |
Protein GI | 297560740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.40116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAGG AAGACAGCGC GCCCCCGCAG GACGGGACGC GCGCGGAGCC GGACCGGCGC GGCACCTGGG CGGCGCTGCG CCCCCTGGTG CTGCGCCTGC ACTTCTACGC GGGGGTCCTC GTCGCCCCCT TCATCCTGGT CGCCGCCGTG TCGGGGCTGC TGTACGTGTG GACGCCCCAG ATCGAGCAGG CGGTCTACGC CGAACAGCTG CGGGTGGAGC CCTCCGGCGA ACCGGTCCCG CTCCACACAC AGGTGCGCGT CGCCCAGGAG GAGCTGCCGG GAGCCGAACT CGACGCGGTG CGACCGGCCA CCGGCGCGGA GGACTCCACC CGGGTGCTGT TCGACGTGCC GGGGCTGGAG GCCAGCCACA GGACGACGGT GTTCGTCGAC CCCTACGGCG GCGAGGTGCT CGGGGTGATG GAGACCTACG GCACCAGCGG CGCCCTCCCG GCCCGCACGT GGGTCGACAC CCTGCACCGC AGCCTGCACC TGGGCGACGT CGGACGCCTG TACAGCGAGC TCGCGGCGAG CTGGATGTGG GTGGTCGCCC TGGGCGGTGT CGCCCTGTGG GTCGCGCGCA ACCGGCGTCC GCGCGGCGGT CTCCGCCGCC TGCTGCTGCC CGGCGCGGGC ACCTCCGGGG GACGGTCCAG GTCGGTCTCG CTGCACGGAG CCGCCGGTCT GTGGCTTCTG GTGGGCCTGC TGTTCCTGTC GGCGACGGGG ATGACGTGGT CGCAGTACGC GGGGGCCAAC ATCAGCGACC TGCGCGAGCG GCTGGACTGG AGCACCCCCG CGGTGTCCAC GGAGGCGCCG GTGGCCTCGC CCGGGGTGGA CGCGGGAGTG GGCGCCGTCC TGGCCAGTGC GCGCGAGGCG GGCCTGGACG GCCCCGTCGA GGTGGCCCTG CCCGAGGACC ACACCTCCCC CTACGTGGTG AGCCAGATCG ACCGCGGTTG GCCGACGCGG GTGGACTCGG CCGCCGTCGC CCCGGATACG GCCGAGGTCA CCGACGTGGT CCGCTTCGCC GACTACCCGG TCATGGCCAA GCTGAGCCGG TGGGGGATCG ACGCCCACAT GGGTGTGCTG TTCGGCGTGC CGAACCAGTT GGTGCTGTCC GCCCTGGCCT CGGGGCTGAT CGCCGTCATC GTGCTGGGCT ACCGCATGTG GTGGCAGCGG CGCCCCACAC GGTCCCGAGT CCTGGGCGTG GGACGCCCCT ACCCGCGCGG CTCCCTCACG GCCCTGTCCC CGCTGTCCAG GGTCGCGGCG GTGGCGGTCC TGGCCCTGGT CGGCTGGGCG GCGCCGCTGC TGGGCGCCTC GCTGCTGGTG TTCCTGGCCG TGGACGCCGT CCTCGGGTGG CGGGCCCGGT CACGTGCCTC AGGGGGCTCC GCGCCGGGCG GCCGGGCTCC GGGTGCCGAC GGGCAGGCCG GACCCGAGTC CGCGGGCGCG CCCGCGTCCG GGGCGCGGAA CCTGTGA
|
Protein sequence | MGQEDSAPPQ DGTRAEPDRR GTWAALRPLV LRLHFYAGVL VAPFILVAAV SGLLYVWTPQ IEQAVYAEQL RVEPSGEPVP LHTQVRVAQE ELPGAELDAV RPATGAEDST RVLFDVPGLE ASHRTTVFVD PYGGEVLGVM ETYGTSGALP ARTWVDTLHR SLHLGDVGRL YSELAASWMW VVALGGVALW VARNRRPRGG LRRLLLPGAG TSGGRSRSVS LHGAAGLWLL VGLLFLSATG MTWSQYAGAN ISDLRERLDW STPAVSTEAP VASPGVDAGV GAVLASAREA GLDGPVEVAL PEDHTSPYVV SQIDRGWPTR VDSAAVAPDT AEVTDVVRFA DYPVMAKLSR WGIDAHMGVL FGVPNQLVLS ALASGLIAVI VLGYRMWWQR RPTRSRVLGV GRPYPRGSLT ALSPLSRVAA VAVLALVGWA APLLGASLLV FLAVDAVLGW RARSRASGGS APGGRAPGAD GQAGPESAGA PASGARNL
|
| |