Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1800 |
Symbol | |
ID | 9245650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2200618 |
End bp | 2201778 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | putative transposase IS891/IS1136/IS1341 family |
Protein accession | YP_003679734 |
Protein GI | 297560760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.31256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.500943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGACGG GGTTTCGGTA CCGGCTCGCG CTCACCGACG AGCAGGCCGA GTCGTGCCAG GTCTATGGCG ATATCTGTCG GGCGGTGTGG AACACCGGCC TTGACCAGCG CCGTGAGGCC GTGTCCCGTT GGCGGCGCGG CCAGCGCCTG CCCTACTGCG GCTACCACCT CCAGGCCCGT GAACTGGCCG AGGCCAAGAC CGAGGAGACC TGGCTTCAGG CCGCCCCCTC CCACATCCTC CAGCAGACCC TGCGCGACCT GGACCGGGCC TGCCGGGACC ACGGCACGTT CAAGGTGCGA TGGCGTGCCA AGGACCGGTG GAAGCCCTCC TTCCGCTTCC CCGACAGCGC ATGGATGAAG GTCGAACGCC TGGGCCGCAG GTGGGCACGG GTCGAGCTGC CCAAGCTGGG GTGGGTACGG TTCCGCTGGT CGCGCGCACC CCGGGGCACG GTCCGTTCGG CCACCGTCTC CCGCGACGGG GCCTACTGGT ACGTGTCGCT GTTGTGCGAG GACGGCCAGG CCACACCGGA GGCACATGAG CGTCCCGACA GCGCGGTAGG TGTGGACCGG GGTGTGGCGG TGGCGGTGGC CACCAGCGAT GGAGACCTGC TCGACCAGGT GTTCCAGACC CCGAAGGAGG CCGAGCGCGA ACGCCGTCTG CGCCGACGCC TGTGCCGCCA ACGCAAGGGG TCGGCGAACC GGGCCAGAAC CAGGGCCGCC CTGTCCGCGT TGACCGGGCG GGTGCGGGCT CGGCGCTCCG ATTTCGCCGC CCAGACCGCG CACACGCTGT GTGCCAAGAA CGCGGTCGTG GTGCTGGAAA AGCTCAACAC CACGAACATG ACGGCCTCCG CGAAAGGCAC TGTTGAGGTG CCCGGCGTCA GCGTGCGTCA GAAGGCGGGG TTGAATCGGG CGATTCTGGC CAAGGGCTGG CACGGCCTCA AGCTGGCCTG TCACAACGCG GCCCGGCGCA CCGGGACCCG GATCGTGGAG GTTGATCCCG CGTACACGTC CCAGACCTGT CACTCGTGCG GATACGTCGC GGCGGAGAAC CGAGAGAGCC AATCGGTCTT CTGCTGCGGC AGGTGCGGGC ACACAGCGCA TGCGGACGTG AACGCGGCCC AGAACATTCT CACGCGCGGA TGGACTAGCC CTTCGGGGTG A
|
Protein sequence | MLTGFRYRLA LTDEQAESCQ VYGDICRAVW NTGLDQRREA VSRWRRGQRL PYCGYHLQAR ELAEAKTEET WLQAAPSHIL QQTLRDLDRA CRDHGTFKVR WRAKDRWKPS FRFPDSAWMK VERLGRRWAR VELPKLGWVR FRWSRAPRGT VRSATVSRDG AYWYVSLLCE DGQATPEAHE RPDSAVGVDR GVAVAVATSD GDLLDQVFQT PKEAERERRL RRRLCRQRKG SANRARTRAA LSALTGRVRA RRSDFAAQTA HTLCAKNAVV VLEKLNTTNM TASAKGTVEV PGVSVRQKAG LNRAILAKGW HGLKLACHNA ARRTGTRIVE VDPAYTSQTC HSCGYVAAEN RESQSVFCCG RCGHTAHADV NAAQNILTRG WTSPSG
|
| |