Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2729 |
Symbol | |
ID | 9246580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3264638 |
End bp | 3266551 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Protein of unknown function DUF1998 |
Protein accession | YP_003680648 |
Protein GI | 297561674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0213066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.652268 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAAGG AGACGCACCG GCGCAGGGTC GGCTCGGTCC GGCCCAGCCA CCTGATGTTC ACCAGCGGCG TGGGGTCCCT GGTGGACCTG CCCAACTTCG CGGTGCTGGT GCGCGGGCTG GACGAGTGGA ACCACACCCA CGCCTACGGC TGGGAGCCCA TCGTCGAGCC CCGGCTGCTC AAGGTGATCC AGGAGCAGCC GAGCCACCGC AACGTGAAGG AGCTGCGCCC GGCGCCGTGG ACCGAGGGGC TGGAGCGCGA CCCCGGCGGA CCCGCCGCCG GGGTGGGCGT GCCCACCACC CCGTTCCCGT CCTGGTTCCG GTGCACCTCC TGCGACGAGC TGGTGGCCCT GGACTCGGAC ATGCTCGCCT TCGAGAACAC CAACCCGCGC AGGCCGCACG AGGCCCGCTT CGTGCACAAC GTCGGCAAAC ACAAGAAGGG CAAGCCGCTG GCGGTCCCCG CCCGGTTCGT GCTGGCGTGC ACCGACGGGC ACCTGGACGA GTTCCCCTAC GTCCACTTCG TGCACCGGGG GGAAGCGTGC CCCAGGGCCG AGAAGCCGCA GCTGAAGATG GAGGACCGGG GCGGGAACGT CGGCGCGAAC GTGGAGCTGA CGTGCGTGGT GTGCGGCGCG CACCGCAACA TGCGCGACGC GGGCGGTGCG CGGGGTAAGG AGAACCTGCC GGCGTGCCGA GCACGCCACC CCCACCTGGG CGTGTTCGAC CCGGAGGGAT GCAGTCAAAA TCCCAAGACC CTGGTGCTGG GCGCGTCCAA CCAGTGGTTC TCGGAGCTGC TGTCGACGCT GGCGGTCCCC TCCGGCCAGG GCACGGGCGA ACTGGACTCC CTGGTGGAGC AGTACTGGGA CATGCTGGAG GAGACGCCCC AGAGCCAGTA CAAGATCATG CGGCAGTTCG CGCCGCCCAT GCGCGACCAC TTCGGCAAGT GGGACGACGA CACCGTGTAC GAGGCGGTGG AGCGGCGCCG CGCGGTCCTG GAGGGGAAGG CCGGGGACGG CGGGAAGGAC GCGCCCTCGG GCCGCCAGGC CCTGCGCACC GCCGAGTGGG AGGCGCTGTC CTCCCCCGAC GCCCACGAGC CCCGGCCCGA CTTCGCGCTG CGCCGCCTGG AGGGCGGCGT GCCCGAGGAG CTGCGGGGCG TGTTCGCCGA CGTGGTGCAG GCCGAACGGT TGCGCGAGGT GCGGGTGCTG ACCGGGTTCA CCCGCCTGGA CTCCCCCGAC CTGGACGACC CGATGATGGT GCAGACGGTG CGGCTGTCGC GCGACGAGGC GACCTGGCTC CCGGCCAGCG AGGTGCGCGG CGAGGGCGTC TTCCTGCGGG TCCCGGAGGA GCTGCTGGCC GCCTGGGAGA AGCGGGTGGC CGACAGCGAG GCCCTGGAGC TGCACCGGGA GGCCTACGGC ACCTTCCGGG AGAAGCGCTA CTCCGACCGG GTCGGCTCGG GGTTCGAGCG GATGCGCAAC TGGCCGGGCG CGCGCTACGT CGCCCTGCAC ACCCTGTCGC ACCTGCTGAT CCGGACCATC GCCCTGGACT GCGGGTACAA CGCGGCGAGC CTGTCCGAGC GCGTCTACGC CGGGACCGAG GAGGACCCGC GCGGGGGCAT CCTCATCTAC ACGGCGGTGC CCGACGCCGA GGGGACGCTG GGCGGTCTGG TCTCGCAGGC CGAGCCGGAG CGGCTCGTAC ACCTGGTGCG CAAGGCCCTG CACGGGGCCA TGCACTGCTC CTCGGATCCG CTGTGCGCCG AGCGCCTGCC GCAGGCGAAC GCGGACTTCC TGCACGGGGC GGCCTGCCAC GTGTGCCTGT TCGTGTCCGA GACGACCTGC GAGCACGGCA ACCGGTTCCT GGACCGCAGG TTCGTGGTGC CGATCGGGGA TCCGGAGCTG GCCCTCTACC CTGAGCTTCC GTGA
|
Protein sequence | MHKETHRRRV GSVRPSHLMF TSGVGSLVDL PNFAVLVRGL DEWNHTHAYG WEPIVEPRLL KVIQEQPSHR NVKELRPAPW TEGLERDPGG PAAGVGVPTT PFPSWFRCTS CDELVALDSD MLAFENTNPR RPHEARFVHN VGKHKKGKPL AVPARFVLAC TDGHLDEFPY VHFVHRGEAC PRAEKPQLKM EDRGGNVGAN VELTCVVCGA HRNMRDAGGA RGKENLPACR ARHPHLGVFD PEGCSQNPKT LVLGASNQWF SELLSTLAVP SGQGTGELDS LVEQYWDMLE ETPQSQYKIM RQFAPPMRDH FGKWDDDTVY EAVERRRAVL EGKAGDGGKD APSGRQALRT AEWEALSSPD AHEPRPDFAL RRLEGGVPEE LRGVFADVVQ AERLREVRVL TGFTRLDSPD LDDPMMVQTV RLSRDEATWL PASEVRGEGV FLRVPEELLA AWEKRVADSE ALELHREAYG TFREKRYSDR VGSGFERMRN WPGARYVALH TLSHLLIRTI ALDCGYNAAS LSERVYAGTE EDPRGGILIY TAVPDAEGTL GGLVSQAEPE RLVHLVRKAL HGAMHCSSDP LCAERLPQAN ADFLHGAACH VCLFVSETTC EHGNRFLDRR FVVPIGDPEL ALYPELP
|
| |