Gene Ndas_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2001 
Symbol 
ID9245851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2422049 
End bp2423233 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679933 
Protein GI297560959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.781234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC AGCAGCACTG GACCACCACA CCCCTCACCG CGGACCTCCT GCGCGGCGCC 
CTCGACCTGG AGCGCACCGA GCGCGGCGTG CTGCCGCACC GGCTGCCCGC GCGGGCCCGC
GCCCAGTGCG GGGACCCGCA GCTGCTCATG GCCGAGTCCC AGCCCTCCGG GGTACGGCTG
GCCCTGCGGA CCCGGGCCAC CGCCCTCGAA CTCGACACGC TGCCCACCAA GCGCGCCTAC
ACGGGGGCCC CGCCCCGCCC GGACGGCGTG TACGACCTGC TCGTGGACGG ACGCCTCGTC
GACCAGGCCA CTGCGACCGG CGGCGACACC CTGACCGTCG ACATGGCCAC CGGCTCGGTC
GAGCACCGGC CCGGACCCGT GGCCACCGTG CGCTTCACCG ACTTGCCCGA CCGCGTCAAG
GACGTCGAGA TCTGGCTGCC GCACAACGAG ACCACCGAAC TCGTCGCCCT GCGCACCGAC
GCGGCCGTCG AACCCGCACC CGACCGGGGA CGCAGGGTGT GGCTGCACCA CGGCAGCTCC
ATCAGCCACG GCTCCAACGC CGCCAGCCCC ACCACCGTCT GGCCCGCGCT GGCCGCCTCC
CTCGGCGGTG TGGAACTGGT CAACCTCGGC CTGGGCGGCA GCGCCCTGCT CGACCCGTTC
ACGGCCCGCG CCATGCGCGA CGCCCCCGCC GACCTGATCA GCCTCAAGAT CGGCATCAAC
CTGGTCAACG CCGACCTGAT GCGCCTGCGC GCCTTCGGTC CCGCGGTGCA CGGCTTCCTG
GACACCCTCC GGGAGGGGCA CCCCAGCACA CCGCTGCTGG TCGTCTCGCC GATCCTGTGC
CCCATCCACG AGGACACACC CGGCCCGTGC GCCTTCGACC CGGACTCCCT GCGCGCCGGG
GAACTGCGGT TCCGGGCCAC CGGCGACCCC GGCGAGCGGG CCGCCGGAAA GCTGACGCTC
ACCGTCATCC GCGAGGAGCT GTCGCGTGTC GTGCGCCAAC GCGCGGCCGA GGACCCGAAC
CTGCACTACC TCGACGGCCG CGAACTCTAC GGTGAGGCGG ACAACGCCGA ACTGCCGCTG
CCCGACGACC TGCACCCGGA CGCCGCCACA CACCGGCGCA TCGGTGAGCG CTTCGCCGAA
CGCGCCTTCG GCGCGGGCGG CCCGTTCCAC GGCCACAGAC CCTGA
 
Protein sequence
MNTQQHWTTT PLTADLLRGA LDLERTERGV LPHRLPARAR AQCGDPQLLM AESQPSGVRL 
ALRTRATALE LDTLPTKRAY TGAPPRPDGV YDLLVDGRLV DQATATGGDT LTVDMATGSV
EHRPGPVATV RFTDLPDRVK DVEIWLPHNE TTELVALRTD AAVEPAPDRG RRVWLHHGSS
ISHGSNAASP TTVWPALAAS LGGVELVNLG LGGSALLDPF TARAMRDAPA DLISLKIGIN
LVNADLMRLR AFGPAVHGFL DTLREGHPST PLLVVSPILC PIHEDTPGPC AFDPDSLRAG
ELRFRATGDP GERAAGKLTL TVIREELSRV VRQRAAEDPN LHYLDGRELY GEADNAELPL
PDDLHPDAAT HRRIGERFAE RAFGAGGPFH GHRP