Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5107 |
Symbol | |
ID | 9248999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 254078 |
End bp | 255295 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | aminotransferase class I and II |
Protein accession | YP_003682994 |
Protein GI | 297564021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.294065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.505049 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGCA TGACTGATCG ACCCCGTCTC TCTGCACGTA TCAGCGCGAT CTCCGAGTCC GCCACCCTGG CTGTGGACGC CAAGGCCAAG GCGATGAAGG CGGAGGGCCG TGCCGTCATC GGCTTCGGCG CGGGAGAGCC CGACTTCCCG ACCCCCGACT ACATCGTCGA GGCCGCCGTC GAGGCCGCCC GTGAGCCCCG GTTCCACCGC TACACGCCCG CCGGCGGCCT GCCCGAGCTC AAGAAGGCCA TCGCCGAGAA GACCCTGCGC GACTCCGGCT ACGAGGTCGA CCCCGCCCAG GTCCTGGTGA CCAACGGCGG CAAGCAGGCC ATCTACGAGG CCTTCGCCGC CATGCTGGAC CCGGGCGACG AGGTCATCGT CATCGCCCCG TACTGGACCA CCTACCCCGA GTCGATCAAG CTCGCGGGCG GCGTCCCGGT CTTCGTCGTC ACCGACGAGA GCACCGGCTA CCTGGCCAGT GTGGAGCAGC TGGAGGCGGC GCGCAGCGAG CGCACCAAGG TCCTGGTGTT CGTCTCCCCG TCCAACCCGA CCGGCGCCGT GTACCCGCGC GAGCAGGTCC GCGAGATCGG CCGCTGGGCG AACGAGCACG GCCTGTGGGT GCTGTCCGAC GAGATCTACG AGCACCTGGT CTACGGGGAC GCCGAGTTCT CCTCGCTGCC CGTCGAGGTG CCCGAGATCG CCGACCGCAC CGTCATCGTC AACGGCGTGG CCAAGACCTA CGCCATGACC GGCTGGCGCG TGGGGTGGAT CATCGGCCCC AAGGACGTGG TCAAGGCCGC GGGCAACCTC CAGTCGCACG CCACCTCCAA CGTCGCCAAC GTCTCGCAGG CCGCCGCCCT GGCCGCGGTC TCCGGCGACC TGGACGCCGT GGCGACGATG CGCGAGGCCT TCGACCGCCG CCGCAAGACG ATCGTGCGCA TGCTCAACGA GATCGACGGC GTGGTCTGCC CCGAGCCGCA GGGCGCGTTC TACGCCTACC CCTCGGTCAA GGGCGTGCTG GGCAAGGAGA TCCGGGGCAG GACGCCGCAG ACCTCCACCG AGCTGGCCGA GCTCATCCTG GAGCAGGCCG AGGTCGCCGT GGTGCCCGGT GAGGCCTTCG GCACCCCCGG CTACCTGCGC CTGTCCTACG CGCTCAGCGA CGAGGACCTC GCCGAGGGCG TGAGCCGCAT CCAGAAGCTG CTGGCCGAGG CCAAGTAG
|
Protein sequence | MGRMTDRPRL SARISAISES ATLAVDAKAK AMKAEGRAVI GFGAGEPDFP TPDYIVEAAV EAAREPRFHR YTPAGGLPEL KKAIAEKTLR DSGYEVDPAQ VLVTNGGKQA IYEAFAAMLD PGDEVIVIAP YWTTYPESIK LAGGVPVFVV TDESTGYLAS VEQLEAARSE RTKVLVFVSP SNPTGAVYPR EQVREIGRWA NEHGLWVLSD EIYEHLVYGD AEFSSLPVEV PEIADRTVIV NGVAKTYAMT GWRVGWIIGP KDVVKAAGNL QSHATSNVAN VSQAAALAAV SGDLDAVATM REAFDRRRKT IVRMLNEIDG VVCPEPQGAF YAYPSVKGVL GKEIRGRTPQ TSTELAELIL EQAEVAVVPG EAFGTPGYLR LSYALSDEDL AEGVSRIQKL LAEAK
|
| |