Gene Ndas_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3811 
Symbol 
ID9247682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4573524 
End bp4574900 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content74% 
IMG OID 
ProductBeta-Ala-His dipeptidase 
Protein accessionYP_003681714 
Protein GI297562740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000599771 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACGTAC GCGCCCACAT CGAGGCACAC CGGGACGAGT TCGTCTCCTC CCTCAAGGAG 
TGGCTGGCGA TCCCCTCCAT CTCCGCCGAC CCCGCGCACC ACCCCGACGT GGTCCGCTCC
GCCCGGTGGC TCGCCGACCA CCTCACCGCG ACCGGCTTCC CCACCGTCGA GGTGTGGCAG
ACCCCCGGCC TGCCCGCCGT GTTCGCCGAG TGGCCCGCCG CCGACCCCGA CGCCCCCACG
GTGCTCGTCT ACGGACACCA CGACGTCCAG CCCGTCGACC CGGTCCAGGA GTGGGAGACC
GACCCGTTCG TGCCCACCGA GCGCGGCACC TCCCTGTTCG CGCGCGGGGC CTCCGACGAC
AAGGGGCAGG TGCTCTTCCA CGCCCTCGGC GTGCGCGCCG CCCTGGCCGC ATCCGGCGCC
GACGCCCCGC CCGTCACGGT CAAGCTGCTC GTGGAGGGCG AGGAGGAGTC GGGCTCGGTC
CACTTCGCCG ACCTGATGCG CGCCAACCGC GACCGCCTGG CCTGCGACGT CGTCGTCATC
TCCGACACCA CCATGTGGGC GGCCGACACC CCGTCCATGT GCGTGGGCAT GCGCGGCGTC
ACCGACGTGG AGATCAGCCT GTACGGCCCC GAGCGCGACC TGCACAGCGG CTCCTTCGGC
GGCGCCGTGC CCAACCCGCT CAAGGCCATG AGCGACCTGC TGTCCGGCCT GCACGACGAG
GACGGCCGGG TGGCGGTCCC CGGCTTCTAC GACGGGGTGG TCGAGGCCAG CCGGGAGGAG
CGCGAACTCA TCGCCCGGCT GCCCTTCGAC GAGCGCGAGT GGCTGGCCAC CGCCGCCTCC
ACCGCCACCT GGGGCGAGAA GGGCTACAGC ACGCTGGAGC GGATCTGGCT GCGCCCGACC
GCCGAGATCA ACGGCATGTG GGGCGGCCAC ACCGGCTCGG GCGGCAAGAC CATCGTCCCT
CGCTCCGCGC ACGCCAAGGT CAGCTTCCGC CTGGTGCCCG GCCAGGACCC GCTGCACGTG
CAGGACCGCG TCCGCGCCCA CGTCGAGGCG GCCGTCCCCG AGGGTCTGCG CGCCGAGACG
GAGTTCGGCG GGCCGGGCGT GCGCGCCTGC GCCTCCGACC TGTCCTCCAC CGCGCTGAAG
GCGGCCCGCT CGGCCATGGA GCGCGCCTTC GGCACCCAGG TCCTGTTCAC CCGCGAGGGC
GGCAGCGGCC CCGAGGCCGA CATCGCCGAC ATCCTCGGGG CGCCGCTGGT CTTCCTCGCC
GTCGGCCTGG ACGAGGACCG CATCCACGCC CCCAACGAGA AGGTGGAGAT CCCCCTGCTG
CTCAAGGGGG CCGAGAGCGC CGCCTACCTG TGGGAGGAGC TCGGCGGCCT CGGCTGA
 
Protein sequence
MDVRAHIEAH RDEFVSSLKE WLAIPSISAD PAHHPDVVRS ARWLADHLTA TGFPTVEVWQ 
TPGLPAVFAE WPAADPDAPT VLVYGHHDVQ PVDPVQEWET DPFVPTERGT SLFARGASDD
KGQVLFHALG VRAALAASGA DAPPVTVKLL VEGEEESGSV HFADLMRANR DRLACDVVVI
SDTTMWAADT PSMCVGMRGV TDVEISLYGP ERDLHSGSFG GAVPNPLKAM SDLLSGLHDE
DGRVAVPGFY DGVVEASREE RELIARLPFD EREWLATAAS TATWGEKGYS TLERIWLRPT
AEINGMWGGH TGSGGKTIVP RSAHAKVSFR LVPGQDPLHV QDRVRAHVEA AVPEGLRAET
EFGGPGVRAC ASDLSSTALK AARSAMERAF GTQVLFTREG GSGPEADIAD ILGAPLVFLA
VGLDEDRIHA PNEKVEIPLL LKGAESAAYL WEELGGLG