Gene Ndas_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3001 
Symbol 
ID9246854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3586619 
End bp3588223 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content75% 
IMG OID 
Producthistidine ammonia-lyase 
Protein accessionYP_003680917 
Protein GI297561943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.463238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.112166 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCT CGTGTCAGCC CTTCACCAGG GGTCACGCGG GGGGTCAAAT GGGAGGCATG 
TCCACAGCAG TTCCTTCCGT CGTCGTCGGC GATGCGCCGC TCACCCCGGC CCAGATCCTC
GACGTGGCCC GCCACGGCGC CCGTGTCACC CTGTCCGAGC AGGCCCGAAA GGCCCTCCAC
CACGGCCGCG AACGGGTCGA GTCGCTCGCC CGCGGCGAGG TCCCCGCCTA CGGGGTCAGC
ACCGGCTTCG GCGCGCTCGC CACCCGCCAC ATCGCCCCCG ACCTGCGGGC CCGCCTCCAG
CGTTCGCTCA TCCGCTCGCA CGCCGCCGGG ACCGGACCCG AGGTGGAGGA CGAGGTCGTG
CGCGCCCTCA TGCTGCTGCG CCTGCGCACC CTGGCCTCGG GCAACACCGG CGTGGAGGTC
GCCACCGCCG AGACCCTCGC CGCGCTGCTC AACGCCCGCA TCACCCCGGT CGTGCACGAG
TACGGCAGCC TGGGCTGCTC GGGGGACCTG GCGCCCCTGT CGCACGTGGC GCTGGCCCTG
ATGGGCGAGG GCCGGGTCCG CGACGCCGCC GGGCGGGACC TGCCCGCCTA CACCGCCCTG
CACGAGGCCG GGATCCGCCC GGTCGAACTG GGCGCCAAGG AGGGCCTGGC GCTGATCAAC
GGCACCGACG GCATGCTCGG CATGCTCGTG CTGGCCTGCA TGGACCTGGA GCGCCTGCTC
AAGGCCGCCG ACATCACCGC CGCCATGAGC GTCGAGGCGC TGCTGGGCAC CGACCGCGTC
TTCGCCGAGG AGCTCCAGCG CCTGCGCCCC CACCCCGGCC AGGCCGCCTC CGCGGCCAAC
CTGCGCGCCC TGCTCGACTC CTCGCCCATC GTCGCCTCCC ACCGCGGCCC CGACTGCAAC
CGGGTCCAGG ACGCCTACTC GCTGCGCTGC GCCCCGCAGG TGGCCGGCGC CGCCCGCGAC
ACCCTCGCCC ACGCGCTGCT GGTGGCCGGA CGCGAACTCG ACAGCGTCAT CGACAACCCC
GTGGTCCTGG ACGACGGGCG GGTGGAGTCC AACGGCAACT TCCACGGCGC GCCCGTGGCC
TACGTGCTCG ACTTCCTGGC CATCGCCGTC GCCGACACCG CCTCCATCGC CGAGCGGCGC
ACCGACCGCA TGCTCGACGT GTCCCGCTCC CACGGCCTGC CCGCCTTCCT GGCCGACGAC
CCCGGCGTGG ACTCCGGCCA CATGATCGCC CAGTACACGC AGGCCGCCAT CGTCTCCGAG
CTCAAGCGCC TGGCCGTGCC CGCCAGCGTC GACTCCATCC CCAGCTCGGC CATGCAGGAG
GACCACGTGT CCATGGGCTG GTCGGCCGCC CGCAAGCTGC GCCGCGCCGT GGACGGGCTG
ACCAGCGTGC TGGCGGTGGA GCTGCTCACC GCCGCCCGCG CCCTGGACCT GCGCTCGCCG
CTGGAGCCCG GCCCCGCCAC CGGCGCCGTG CTGCGCACCG TACGGGAGAA GGTCTCCGGC
CCCGGCCCCG ACCGCCACCT GGCCCCCGAG ATCGCCGCCG TCGCCGCCCT GATCACCGAC
GGCTCCGTGG TCGCAGCCGC CGAGTCCGTC GTCCCCCTGG CCTGA
 
Protein sequence
MEPSCQPFTR GHAGGQMGGM STAVPSVVVG DAPLTPAQIL DVARHGARVT LSEQARKALH 
HGRERVESLA RGEVPAYGVS TGFGALATRH IAPDLRARLQ RSLIRSHAAG TGPEVEDEVV
RALMLLRLRT LASGNTGVEV ATAETLAALL NARITPVVHE YGSLGCSGDL APLSHVALAL
MGEGRVRDAA GRDLPAYTAL HEAGIRPVEL GAKEGLALIN GTDGMLGMLV LACMDLERLL
KAADITAAMS VEALLGTDRV FAEELQRLRP HPGQAASAAN LRALLDSSPI VASHRGPDCN
RVQDAYSLRC APQVAGAARD TLAHALLVAG RELDSVIDNP VVLDDGRVES NGNFHGAPVA
YVLDFLAIAV ADTASIAERR TDRMLDVSRS HGLPAFLADD PGVDSGHMIA QYTQAAIVSE
LKRLAVPASV DSIPSSAMQE DHVSMGWSAA RKLRRAVDGL TSVLAVELLT AARALDLRSP
LEPGPATGAV LRTVREKVSG PGPDRHLAPE IAAVAALITD GSVVAAAESV VPLA