Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3886 |
Symbol | |
ID | 9247757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4657174 |
End bp | 4658382 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Abortive infection protein |
Protein accession | YP_003681789 |
Protein GI | 297562815 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.539919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAATG TGCCGCCACC GCCTGGTTCG TCCTGGCCCG CCGCGCCCGA GGGGCCGGAG GGGCGGACTG TCGGGGCCTA CACGCAGGAG CAGACCTGGG CGGATCCTCC CGGAGGGGGG AGGCTGACGC CGCCCGGGGG CTACGCCGTG GCCGCCACCG ACCCGGACGT GTGGGCCTGG CCGCCGCCGA GGCCGGTGCG CAGGAACCAC TTCGCCCGCG CGGAACTCCC CGCGGACGAG TCCTACCACA GGCTGGGCCG CACGTCCCGC TTCCGCTGGT GGTTCACGCC GCTCACCCTG GCCGTCCTCG CGGTGCTGCT CTTCTTCCTG TGGATAGCCG TCATCCTGTC GGTGACGATC GTGGCGATCA TCAACGGCAG CGGACTGGCC CCGGACTCGG TCACCCTGAG CGTGATCGCC GAGATGGCCT TCGGGCTCCT GTCGTCGGCG CTGTTCATCC CCATCGTCCT CTTCCTGGTC CGCGTCGTGC AGTGGCGCCG GACCGGCTCC CTGTTCTCCG TCGAGGGACG GCTGCGCTGG GGGTGGCTGG CCCGCTGCAC GGCGGTGGCG GTCGTCCCGG TCGCCCTGTG CGTCGTGGCC TTCCTGCCGC TGGCCGAACT CCTCCAGCCC GACCTGGTGC CCCGCGAGGC CTCCGGCGGA ACCGAGGTGT TCGCCGCGGC GATGACCGCC ATCGTGCTGC TGGTGCCGTT GCAGTCGGCG GCCGAGGAGC TGACCCTGCG CGGCATGCTC ATGCAGCTGG TGGGCGCGCT CGGCGCCCGT CCCGACGAGC GGCGCGGCCG GTCGGCGGTC TCGCGGGTCC TGCGCTCCCC GGGTCCGGCC ATCCTGGCCA GCGGAACCCT GTTCGCGGCG CTGTACCTGG CCACGCACCC GGGCGACCCC TGGACGACCG CGGCGCTCGC GGTGATGGGG CTGGGGATGT CCTGGCTGAC CTGGCGCACC GGCGGTCTGG AGGCGGCGAT CAGCCTGCAC GTGGTCAACA GCCTCGTGCA GTTCACGCTG TGCGTGTTCG AGGGCCGCAT GGAGCAGATC GGCACGGGGG TCATGGTCGG CTCCGGCCTG CCCCTGGGCG CGGGTACGCC GCTGGTGCTG GTGCTGACCC TGATCCAGGT GGGGCTGTAC GTGCTGGCGG TGGTGTGGCT GGCGGGCCGC CGCGGGGTGC GGCGCAGGAG CGCGGCCGCC GTCCGTTAG
|
Protein sequence | MGNVPPPPGS SWPAAPEGPE GRTVGAYTQE QTWADPPGGG RLTPPGGYAV AATDPDVWAW PPPRPVRRNH FARAELPADE SYHRLGRTSR FRWWFTPLTL AVLAVLLFFL WIAVILSVTI VAIINGSGLA PDSVTLSVIA EMAFGLLSSA LFIPIVLFLV RVVQWRRTGS LFSVEGRLRW GWLARCTAVA VVPVALCVVA FLPLAELLQP DLVPREASGG TEVFAAAMTA IVLLVPLQSA AEELTLRGML MQLVGALGAR PDERRGRSAV SRVLRSPGPA ILASGTLFAA LYLATHPGDP WTTAALAVMG LGMSWLTWRT GGLEAAISLH VVNSLVQFTL CVFEGRMEQI GTGVMVGSGL PLGAGTPLVL VLTLIQVGLY VLAVVWLAGR RGVRRRSAAA VR
|
| |