Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4642 |
Symbol | |
ID | 9248523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5514779 |
End bp | 5515858 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_003682534 |
Protein GI | 297563560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.937142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGAGT CGAAATCGCC ATATCTGCGG TCGGTGCTGG AGAGCATCCC GCCCTACAAG CCGGGCAGGA AGGTCGTCGG TCCCGACGGG CGTTCGGTCA AGCTGTCCTC CAACGAGAGC CCCTACGGGC CGCTCCCGTC GGTGCGTGAG GCCATCGCCG TGGCCGCGGC CGAACTCAAC CGCTACCCGG ACCCGGGCGC CGCCGAGCTC ACCTCCGCGC TCGCCCGCCG CCTCGACGTC CCCGAGGAGC ACCTCGCCCT GGGCGCCGGT TCTGTGGGCC TGCTCCAGCA GCTGCTCGAA GCCGTCGGAG AGCCCGGCGC CGAGGTCGTC TACGCCTGGC GCTCCTTCGA GGCCTACCCG CTGCTCGCCG AACTGGCGGG GGTCACCTCG GTGCGGGTCC CGCTCCGGGA CGAGACGCAC GACCTCGACG CGATCGCAGA CGCGGTCACC GAGGACACCC GCATGGTGCT CGTGTGCAAC CCCAACAACC CCACCGGCAC CACCGTGCGC GAGGAGGAGC TGGTCGCCTT CCTGGACCGG ATCCCCGAGA GCGTCCTGGT GGTCCTGGAC GAGGCCTACC GCGAGTACGT GCGCGACCCG CGGGTGCCCG ACGGCGTCTC CCTGTACCGC GACCGGCCCA ACGTCGCCGT GCTGCGCACC TTCTCCAAGG CCTACGGGCT CGCCGCCGTA CGCCTGGGCT TCCTCGTGGG GCACCCCCCG GTGACCGCCG CGGTCCGCAA GACCCTCGTC CCGTTCGCGG TGAACCACCT CGCCCAGGCC GCCGGGATCG CCTCCCTGGC CGCCGAGGGG GAGCTGCTGG AGCGCGTGGC CGCCACCGTC GAGGAGCGCG GGCGGGTGCG CGACGCGCTC ATCGCGTCCG GGTGGACGGT CCCGCCGACC GAGGCCAACT TCGTGTGGCT TCGGGTGGAC GAGGACACGC TCGACTTCGC CGAGGCGTGC GCGCGTGAGG GCGTCTCCGT GCGCCCGTTC GCGGGGGAGG GCGCCCGGGT GAGCCTGGGC ACCCCCGAGG AGAACGACGC GTTCCTGGCC GTGGCCACCT CCTACGGCAA GCGCCGTTAG
|
Protein sequence | MSESKSPYLR SVLESIPPYK PGRKVVGPDG RSVKLSSNES PYGPLPSVRE AIAVAAAELN RYPDPGAAEL TSALARRLDV PEEHLALGAG SVGLLQQLLE AVGEPGAEVV YAWRSFEAYP LLAELAGVTS VRVPLRDETH DLDAIADAVT EDTRMVLVCN PNNPTGTTVR EEELVAFLDR IPESVLVVLD EAYREYVRDP RVPDGVSLYR DRPNVAVLRT FSKAYGLAAV RLGFLVGHPP VTAAVRKTLV PFAVNHLAQA AGIASLAAEG ELLERVAATV EERGRVRDAL IASGWTVPPT EANFVWLRVD EDTLDFAEAC AREGVSVRPF AGEGARVSLG TPEENDAFLA VATSYGKRR
|
| |