Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3433 |
Symbol | |
ID | 9247300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4109925 |
End bp | 4110962 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | aminotransferase class I and II |
Protein accession | YP_003681344 |
Protein GI | 297562370 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0324026 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.394034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGGAGT ACATGGACGC GGGCCACGAC CTGCGCCACC ACGGTGACGC CGAGGTCGGC GGCGGGCTGC TCGACCTGGC CGTCAACGTG CGGGGCCGGA CGCCGCCCGC CTGGCTGGGG CGGCTCCTCG CCGACTCGCT GACCGGCCTG GGCGCCTACC CCGACCCCTC CCGCGCGCGG AAGGCGGTCG CCCGGCGCCA CGGACGGGAG CCGGGCGAGG TCCTGCTCAC CGCCGGGGCG GCCGAGGCCT TCGTCCTCCT GGCGCGGGTC CTGAACCCCC GGCGGGCGGT CGTCGTGCAC CCCCAGTTCA CCGAGCCCGA GGCCGCCCTG CGCGCCGCCG GGCACGCCGT GGACCGGGTG CTGCTGGAAC CGGACTTCAC CCTCGACCCC GCACTGGTCC CCGAGGACGC CGACCTGGTC GTGGTGGGCA ACCCGACCAA CCCGACCTCC GTGCTCCACC CCGGACCGGT CCTGGCCGGG CTGGCCCGCC CCGGGCGCGT GCTGGTCGTC GACGAGGCCT TCGCCGACTG CGTTCCCGGC GAAACGGAGT CGCTGGCCTC CCGCGGGGAC CTGCCGGGCC TGGTGGTGGT GCGCAGCCTC ACCAAGACCT GGTCCCTCGC CGGGCTGCGC GCCGGCTACC TGCTCGCCGA ACCCGACCTG GTGGCCAGGT TCTCCGAGGC ACAGCCCCTG TGGTCGGTGT CCACGCCCGC CCTGGTCGCG GTGGAGGCGT GCTGCAGGCC GGAGGCCCTC GCCGAGGCCG ACGCCTGGGC GACGTCCCTG ACCGAGCACC GCGACGACCT CGCCGCGGGC CTGCGGAACC TGGGCCTGCG GGTGGTCCCG GGAGCCCGGG CCTCGTTCCT GCTGGTAGCG GACCCTGAGG CGGACCGGCT GCGGGCCCGC CTCAGGGAAG GGGGGATCGC CGTCCGGCGC GGTGACACCT TTCCCGGCCT CGGCCCGGAG TGGTTCCGGG TGGCGGTCCG CGAACCCGCC GTCCACCGGG TCCTGACGGA CGCGCTGGGG GAGTTGCTCG ACCGGTGA
|
Protein sequence | MGEYMDAGHD LRHHGDAEVG GGLLDLAVNV RGRTPPAWLG RLLADSLTGL GAYPDPSRAR KAVARRHGRE PGEVLLTAGA AEAFVLLARV LNPRRAVVVH PQFTEPEAAL RAAGHAVDRV LLEPDFTLDP ALVPEDADLV VVGNPTNPTS VLHPGPVLAG LARPGRVLVV DEAFADCVPG ETESLASRGD LPGLVVVRSL TKTWSLAGLR AGYLLAEPDL VARFSEAQPL WSVSTPALVA VEACCRPEAL AEADAWATSL TEHRDDLAAG LRNLGLRVVP GARASFLLVA DPEADRLRAR LREGGIAVRR GDTFPGLGPE WFRVAVREPA VHRVLTDALG ELLDR
|
| |