Gene Ndas_4642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4642 
Symbol 
ID9248523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5514779 
End bp5515858 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content73% 
IMG OID 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003682534 
Protein GI297563560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.937142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGAGT CGAAATCGCC ATATCTGCGG TCGGTGCTGG AGAGCATCCC GCCCTACAAG 
CCGGGCAGGA AGGTCGTCGG TCCCGACGGG CGTTCGGTCA AGCTGTCCTC CAACGAGAGC
CCCTACGGGC CGCTCCCGTC GGTGCGTGAG GCCATCGCCG TGGCCGCGGC CGAACTCAAC
CGCTACCCGG ACCCGGGCGC CGCCGAGCTC ACCTCCGCGC TCGCCCGCCG CCTCGACGTC
CCCGAGGAGC ACCTCGCCCT GGGCGCCGGT TCTGTGGGCC TGCTCCAGCA GCTGCTCGAA
GCCGTCGGAG AGCCCGGCGC CGAGGTCGTC TACGCCTGGC GCTCCTTCGA GGCCTACCCG
CTGCTCGCCG AACTGGCGGG GGTCACCTCG GTGCGGGTCC CGCTCCGGGA CGAGACGCAC
GACCTCGACG CGATCGCAGA CGCGGTCACC GAGGACACCC GCATGGTGCT CGTGTGCAAC
CCCAACAACC CCACCGGCAC CACCGTGCGC GAGGAGGAGC TGGTCGCCTT CCTGGACCGG
ATCCCCGAGA GCGTCCTGGT GGTCCTGGAC GAGGCCTACC GCGAGTACGT GCGCGACCCG
CGGGTGCCCG ACGGCGTCTC CCTGTACCGC GACCGGCCCA ACGTCGCCGT GCTGCGCACC
TTCTCCAAGG CCTACGGGCT CGCCGCCGTA CGCCTGGGCT TCCTCGTGGG GCACCCCCCG
GTGACCGCCG CGGTCCGCAA GACCCTCGTC CCGTTCGCGG TGAACCACCT CGCCCAGGCC
GCCGGGATCG CCTCCCTGGC CGCCGAGGGG GAGCTGCTGG AGCGCGTGGC CGCCACCGTC
GAGGAGCGCG GGCGGGTGCG CGACGCGCTC ATCGCGTCCG GGTGGACGGT CCCGCCGACC
GAGGCCAACT TCGTGTGGCT TCGGGTGGAC GAGGACACGC TCGACTTCGC CGAGGCGTGC
GCGCGTGAGG GCGTCTCCGT GCGCCCGTTC GCGGGGGAGG GCGCCCGGGT GAGCCTGGGC
ACCCCCGAGG AGAACGACGC GTTCCTGGCC GTGGCCACCT CCTACGGCAA GCGCCGTTAG
 
Protein sequence
MSESKSPYLR SVLESIPPYK PGRKVVGPDG RSVKLSSNES PYGPLPSVRE AIAVAAAELN 
RYPDPGAAEL TSALARRLDV PEEHLALGAG SVGLLQQLLE AVGEPGAEVV YAWRSFEAYP
LLAELAGVTS VRVPLRDETH DLDAIADAVT EDTRMVLVCN PNNPTGTTVR EEELVAFLDR
IPESVLVVLD EAYREYVRDP RVPDGVSLYR DRPNVAVLRT FSKAYGLAAV RLGFLVGHPP
VTAAVRKTLV PFAVNHLAQA AGIASLAAEG ELLERVAATV EERGRVRDAL IASGWTVPPT
EANFVWLRVD EDTLDFAEAC AREGVSVRPF AGEGARVSLG TPEENDAFLA VATSYGKRR