Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1411 |
Symbol | |
ID | 9245261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1730333 |
End bp | 1731520 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | protein of unknown function UPF0027 |
Protein accession | YP_003679349 |
Protein GI | 297560375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0164259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTACA CCGAGGTTTC CGGGGAACGC GTCCCCATCC GCATGTGGGC CGCGCCCGAC GAGGTCGAGG CCGCGGCCAT GGAGCAGCTG CGCAACGTCA CCCGCGTGCC GTGGGTGCAC GGACTGGCCG TCATGCCCGA CGTGCACTAC GGCAAGGGCG CCACCGTCGG ATCGGTCATC GCCATGCGCG ACGCGGTCAG CCCCGCCGCC GTCGGCGTGG ACATCGGCTG CGGCATGACC GCCGTGCGCA CCGACCTGAC CGCCGAGCGC CTGCCCGACG ACCTCCGGCG GCTGCGCTCC GCCCTGGAGG CCGTGATCCC GGTCGGCTAC CACGCCCACG ACGAACCGGT CGACCCCGCC GCCGTCCCCA CCCTGCGCGA GGCCGACTGG TCCGGCTTCT GGGCGGGGTT CGACGCCCTG GCCGACGCCG TGCGCCCCCG CCGCGAGCGC GCCCTGCGCC AGATGGGCAC CCTCGGCGGC GGCAACCACT TCCTGGAGGT GTGCCTGGAC GACGGCGGCG CCGTGTGGCT GGTGCTGCAC TCCGGGTCGC GCAACATCGG CAAGGAGCTG GCCTCCCACC ACATCGAGCG GGCCCAGGCG CTGCCGCACA ACCAGGACCT GCCCGACCGC GACCTGGCGG TGTTCGTCAC CGGCACGCCC GAGATGGACG ACTACCGCCG GGACCTGTTC TGGGCCCAGG AGTACGCGCG GCGCAACCGC GACGTCATGA TGGGCCTGGC CTGCCGCACC CTGGCCGAGC AGGTCCCGGG CACCCGCTTC GAGCAGTGGA TCTCATGCCA CCACAACTAC GTGGCCGAGG AGACCTACGA CGGGGTGGAC GTGCTCGTCA CCCGCAAGGG CGCCATCCGC GCCGGTAAGG GCGACCTCGG GATCATCCCG GGATCCATGG CGACGGGCAC CTACATCGTG CGCGGACTGG GCAACCCGGC CTCGTTCAAC TCCGCCTCGC ACGGGGCGGG ACGGCGGATG AGCCGGAACA AGGCCCGTAG GACCTTCACC GAAGCCGACC TGGTCGAGCA GACCAGGGGA GTGGAGTGCC GCAAGGACCG GGGCGTCGTG GACGAGATCC CCGCCGCCTA CAAGGACCTG GAGTCGGTGA TCGAGGCCCA GGCCGACCTG GTCGAGGTGG TCGCCCACCT GCGCCAGGTG GTGTGCGTCA AGGGCTGA
|
Protein sequence | MPYTEVSGER VPIRMWAAPD EVEAAAMEQL RNVTRVPWVH GLAVMPDVHY GKGATVGSVI AMRDAVSPAA VGVDIGCGMT AVRTDLTAER LPDDLRRLRS ALEAVIPVGY HAHDEPVDPA AVPTLREADW SGFWAGFDAL ADAVRPRRER ALRQMGTLGG GNHFLEVCLD DGGAVWLVLH SGSRNIGKEL ASHHIERAQA LPHNQDLPDR DLAVFVTGTP EMDDYRRDLF WAQEYARRNR DVMMGLACRT LAEQVPGTRF EQWISCHHNY VAEETYDGVD VLVTRKGAIR AGKGDLGIIP GSMATGTYIV RGLGNPASFN SASHGAGRRM SRNKARRTFT EADLVEQTRG VECRKDRGVV DEIPAAYKDL ESVIEAQADL VEVVAHLRQV VCVKG
|
| |