Gene Ndas_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3191 
Symbol 
ID9247048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3815231 
End bp3816313 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_003681105 
Protein GI297562131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.253345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTCA CCCTGAACGA CCTGCCCCTG CGCGACGACC TGCGCGGCCG CTCGCCCTAC 
GGCGCGCCGC AGCTGGACGT GCCCGTGGTG CTCAACACCA ACGAGAACCC GCACCCGCCC
TCGGCGCGCC TGGCCAAGGC GCTGGCCGAG GCGGTCGCCG ACACCGCCCT GGGCCTCAAC
CGCTACCCCG ACCGGGACGC GGTCCGCCTG CGCGAGGGCC TGGCCGCCTA TCTCGGCCAC
GGCCTGACCG CCGACCAGGT GTGGGCGGCC AACGGCTCCA ACGAGGTCCT CCAGCAGATC
CTCCAGGCCT TCGGCGGCCC CGGCCGGTCC GCCATGGGCT TCGAGCCGTC CTACTCCATG
CACCCGATCA TCTCCCGGGG CACCGGCACC GCCTGGGTGT CCGTGCCGCG CGGAGCCGAC
TTCCGCGTCG ACGTGGACGC CGCCCTGGCC GCCATCGCCG AGCACCAGCC CAGCGTCGTC
TTCCTCACCT CGCCCAACAA CCCCACCGGC ACCGCCCTGG ACCTGGCCGA CACGGAGCGC
GTCCTGGCCG CGGCCCCCGG CGTCGTGGTC GTGGACGAGG CCTACGCCGA GTTCCGCCGC
GAGGGCACGC CCAGCGCGCT GAGCCTGCTG TCCGACCACC CCAGGCTCGT CGTCTCGCGC
ACCATGTCCA AGGCCTTCGC CCTGGCCGGG GCGCGCGTGG GCTACCTGGC CGCCCACCCG
GCCGTGGTCG AGGCCCTCCA GCTGGTCCGC CTGCCCTACC ACCTGTCCGC CGTCACCCAG
ACGGTCGCGC TCACCGCGCT CGACCACGCC GACGAACTCC TCGCCGCCGT CGCCGACCTG
CGCGCCGAAC GCGACTCCCT GGTCTCCTGG CTGCGCGGGC ACGGCTTCTC GGTCGCCGAG
TCCGACGCCA ACTTCGTCCT GTTCGGCGAG TTCGAGGACC GCAGCCGCGT CTGGCAGGAC
ATGCTCGACC AGCAGGTCCT CATCCGCGAG ACCGGCCCGC CCGGGTGGCT GCGCGTCAGC
GTCGGAACCC CGCAGGAGAT GGCCGCCTTC CGCCGGGCCC TGCTCAGCGC CACCGGACGC
TGA
 
Protein sequence
MSFTLNDLPL RDDLRGRSPY GAPQLDVPVV LNTNENPHPP SARLAKALAE AVADTALGLN 
RYPDRDAVRL REGLAAYLGH GLTADQVWAA NGSNEVLQQI LQAFGGPGRS AMGFEPSYSM
HPIISRGTGT AWVSVPRGAD FRVDVDAALA AIAEHQPSVV FLTSPNNPTG TALDLADTER
VLAAAPGVVV VDEAYAEFRR EGTPSALSLL SDHPRLVVSR TMSKAFALAG ARVGYLAAHP
AVVEALQLVR LPYHLSAVTQ TVALTALDHA DELLAAVADL RAERDSLVSW LRGHGFSVAE
SDANFVLFGE FEDRSRVWQD MLDQQVLIRE TGPPGWLRVS VGTPQEMAAF RRALLSATGR