Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4171 |
Symbol | |
ID | 9248045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4982234 |
End bp | 4983364 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | phosphoserine aminotransferase |
Protein accession | YP_003682072 |
Protein GI | 297563098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.309749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGTGA CCGACGAGAT TCAGATCCCC GCGAACCTGC TGCCCTCCGA CGGCCGTTTC GGCAGCGGCC CGTCCAAAGT CCGCCCCGCC CAGATCGAGG CCCTGGCCGC CTCCGGCTCC CGCTACATGG GCACCTCCCA CCGGCAGAAG CCGGTCAAGT CCCTGGTCTC CCGGGTCCGC TCCGGGGTGA GCGAGCTGTT CTCCCTCCCC GACGGCTACG AGGTGGTCCT CGGCAACGGC GGCACCACCG CCTTCTGGGA CATCGCCGCG CACGGCCTGC TGCGCGAGAA GTCCCAGCAC CTCGCCTTCG GCGAGTTCTC CAGCAAGTTC GCGAAGGTCG CCAAGGGCGC GCCCTGGCTC CAGGAGCCCA CCGTCATCAG CACCGACCCG GGCAGCCACA GCGAGCCCGC GGCCGAGGCC GGCGTGGACG TGTACGCGCT GACCCACAAC GAGACGTCCA CCGGTGTGGC CGCGCCCATC AGGCGCGTGG CGGGCGCCGA CGAGGACGCG CTCGTCCTCG TGGACGCCAC CAGCGGCGCG GGCGGCCTGC CGGTCGACAT CGCCGAGACC GACGTCTACT ACTTCGCGCC GCAGAAGAGC TTCGCCGCCG ACGGCGGGCT GTGGCTGGCC GTCATGTCGC CCCGGGCCCT GGCCCGCGTG GAGGAGATCG CGGCGAGCGG CCGCTACGTG CCGGAGTTCT TCTCGCTGCC CACGGCGATC GACAACTCCC GCAAGGACCA GACCTACAAC ACCCCGGCCG TGGCCACGCT GCTGCTGCTC GCCGAGCAGC TGGAGTGGAT GAACGGCCAG GGCGGCCTGG AGTGGACCGT GGCGCGCACC GCCGAGTCCT CCTCGGTCCT CTACGACTGG GCGGAGAAGT CGCCGGTCGC GACGCCGTTC GTCACCGACC CGTCCAAGCG CTCTCAGGTG GTCGGCACCA TCGACTTCAG CGACGACGTG GACGCCGCCG CGGTGGCCCG GGTCCTGCGC GCCAACGGCG TGGTCGACAC CGAGCCCTAC CGCAAGCTGG GCCGCAACCA GCTGCGCGTG GCCATGTTCC CGGCGATCGA CCCGGACGAC GTGCGCGCGC TCACCGAGTG CGTGGACCAC GTGCTCACCG AGCTGTCCTG A
|
Protein sequence | MVVTDEIQIP ANLLPSDGRF GSGPSKVRPA QIEALAASGS RYMGTSHRQK PVKSLVSRVR SGVSELFSLP DGYEVVLGNG GTTAFWDIAA HGLLREKSQH LAFGEFSSKF AKVAKGAPWL QEPTVISTDP GSHSEPAAEA GVDVYALTHN ETSTGVAAPI RRVAGADEDA LVLVDATSGA GGLPVDIAET DVYYFAPQKS FAADGGLWLA VMSPRALARV EEIAASGRYV PEFFSLPTAI DNSRKDQTYN TPAVATLLLL AEQLEWMNGQ GGLEWTVART AESSSVLYDW AEKSPVATPF VTDPSKRSQV VGTIDFSDDV DAAAVARVLR ANGVVDTEPY RKLGRNQLRV AMFPAIDPDD VRALTECVDH VLTELS
|
| |