Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4769 |
Symbol | |
ID | 9248652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5659722 |
End bp | 5660789 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | amino acid-binding ACT domain protein |
Protein accession | YP_003682659 |
Protein GI | 297563685 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTTT CCGAAGGACA GCACGGAACC GACGACCACC GCCACGGCTT CTTCGGCCGC GAGGCACTCG ACCTGGGCAC CCTGCTCCTC GCCGCCGGTG CGGCGCACCT GGTGGTGCTC TCCCTCGGGC ACAGCGACGC GGGCGTCCGC GTCCTGATCA CCGTGGGACT GCTGCTGCTC GCGGTCTCCG CGGTCCACCG GTGGCGCCGC CACAAGGCCG CGTCCGCTCC CCGGCCGCCC AGGGGCTCCG GAGCGGTGAA CGCGTCCGGG AGCGCCGGGC CGCCCACGGG TACCGGCCCG TACGCGAACG GCGGGCTGAC CGGCGGCGAC GGTGCGTCCC GTGGCACCGG GACATCCGAG GACACGGCGC TGCCCGGGAG CGCCGGGCTG CCCGGGAACG CCGGGTCGGC CGGCGGCGGA TCCTTCCGAA GCGCCCCCTC GACCGGGGAG GGAGCCCCGG CGTCGGCCGG TTCCCCGGCG GGTGGGGAAC CGGTGCGCGC CCCGTCCGAC GACCTGCTGT GGAGCGTCCG CGCGACGGTC GCCGACGTGC CCGGCGGCCT GGCCGCGCTC ACCGCGCGGT TCGCCGCCCT CGGGATCGAC ATCCGGCTCA TGCAGGTGCA CCCGGCGGGG CCGGACGCCG TGGACGAGTT CTTCGTCAGC GCTCCCGCGC ACGTGGGAGA GGGCGACCTG TACACCGCCG TACGGGAGGC GGGCGGACGC GAGGCCGCCG TGCGCCGCGC CGACGTCCAC GAGCTCAGCG ACACCACCAG CCGCACGCTC GCCCTGGTCA GCGCCCTGGT CACCGGGGCG ACCACGCTGG AGCGCTCGCT GCTCTCCCTG GCCTCGGCGC GGGCCGTGGA GCACACCGCC GAACCGCCCG CCGGAACGGT CCGCGAGGAC CTGTCCGGCA CGGTGATGAC CCTTCCGGCA CCCGACGGCG GCGTCCTGAC CGTCCGCCGG GAGGTCATCC CCTTCACCGC CGTGGAGTTC GCCCGGTGCC GGGCCCTGGC CCACGTCGCC TCGTCCCTGC ACGCGCGTTC GCACGGCCCG GGACCCGGGA GGCGCTGA
|
Protein sequence | MDVSEGQHGT DDHRHGFFGR EALDLGTLLL AAGAAHLVVL SLGHSDAGVR VLITVGLLLL AVSAVHRWRR HKAASAPRPP RGSGAVNASG SAGPPTGTGP YANGGLTGGD GASRGTGTSE DTALPGSAGL PGNAGSAGGG SFRSAPSTGE GAPASAGSPA GGEPVRAPSD DLLWSVRATV ADVPGGLAAL TARFAALGID IRLMQVHPAG PDAVDEFFVS APAHVGEGDL YTAVREAGGR EAAVRRADVH ELSDTTSRTL ALVSALVTGA TTLERSLLSL ASARAVEHTA EPPAGTVRED LSGTVMTLPA PDGGVLTVRR EVIPFTAVEF ARCRALAHVA SSLHARSHGP GPGRR
|
| |