Gene Ndas_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0638 
Symbol 
ID9244480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp784924 
End bp786264 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF323 
Protein accessionYP_003678590 
Protein GI297559616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.749121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCACG AACAGGAGCG CGCCAAGGAG CGCATCGCCG CCGAACTCGA CCGGGCGAGG 
GACCGCAGCA ACGGGCTGAC CCTCCACGCC CTCGACGAGG GGGAGCTGCT CGCCCAGCAC
TCGCCCCTGA TGTCGCCGCT GGTCTGGGAC CTGGCGCACG TGGGCAACTA CGAGGAGCAG
TGGCTGCTGC GCGCCGCGGG GGGCCGTGAG GCGCTGCGCC CCGACATCGA CACCCTCTAC
GACGCCTTCG AGAACCCGCG CGCCGAACGG GTCAGCCTCC CGCTGCTGCG GCCCGAGGAG
GCCCGCGACT ACAACGCGCG CGTGCGCAGG GAGGTGCTCG ACGCGCTGGA GTCCGCCGAC
CTCACCCGGG TGGAGACCGG CCCGGACGGC GAACGCTCCC TGCTGGACGC CGGGTTCGTC
TTCCACATGG TCATCCAGCA CGAGCACCAG CACGGCGAGA CCATGCTCGC CACCCACCAG
CTCCGCAAGG GCGAGCCCGT CCTGCTGGAG GAGGCCGCGC CCGTCACCAC GCTGCGCCCG
CCGGTCCGGG ACGAGGTGTT CGTGCCCGAG GGGCCGTTCA CCATGGGCAC CGACGACGAC
CCCTGGGCCT ACGACAACGA GCGCCCCGCG CGCACCGTCG ACCTGGGCCC CTACTGGATC
GACACCGCCC TCGTGACCAA CGCCGCCTAC CAGGAGTTCA TGGACGACGG CGGCTACCAG
ACCCGCCGCT GGTGGACCCG CGACGGCTGG GAGTGGAAGG AGAAGCGGGG AGCGGTCTCC
CCTGCCTTCT GGACCCGGGA GGGGACCGGG TGGTCGCGCC GCCGGTTCGG CCGCCAGGAG
ATGGTGCCCC CCGACGAGCC CGTGCAGCAT GTGTGCTTCC ACGAGGCCCG GGCCTACGCC
GCCTGGGCGG GCAAGCGCCT GCCGAGCGAG CCCGAGTGGG AGAAGGCCGC GCGCTTCGAC
CCGGTCAGCG GCCGGTCCCG GCGCTACCCG TGGGGCGACA CCGATCCCGG ACCCGGCCAC
GCCAACCTGG GCCAGCGCAG GCTGGGCCCC TCACCCGCCG GGCGCCACCC CGACGGCGCC
TCGCCGCTGG GCGTCCAGCA GCTGGTCGGC GACGTGTGGG AGTGGACCTC CACCACCTTC
ACCGGCTACC CGGGGTTCCG CGCCTTCCCG TACGAGGACT ACTCGGAGGT GTTCTTCGAC
GACGGGTACA AGGTGCTGCG GGGCGGCTCC TGGGCCACCC ACCCCACCGC GGCCCGCGCC
ACGTTCCGCA ACTGGGACCA CCCGATCCGC CGGCAGATCT TCAGCGGTTT CCGCTGCGCG
CGCGACGCGG AACCCGCCTG A
 
Protein sequence
MNHEQERAKE RIAAELDRAR DRSNGLTLHA LDEGELLAQH SPLMSPLVWD LAHVGNYEEQ 
WLLRAAGGRE ALRPDIDTLY DAFENPRAER VSLPLLRPEE ARDYNARVRR EVLDALESAD
LTRVETGPDG ERSLLDAGFV FHMVIQHEHQ HGETMLATHQ LRKGEPVLLE EAAPVTTLRP
PVRDEVFVPE GPFTMGTDDD PWAYDNERPA RTVDLGPYWI DTALVTNAAY QEFMDDGGYQ
TRRWWTRDGW EWKEKRGAVS PAFWTREGTG WSRRRFGRQE MVPPDEPVQH VCFHEARAYA
AWAGKRLPSE PEWEKAARFD PVSGRSRRYP WGDTDPGPGH ANLGQRRLGP SPAGRHPDGA
SPLGVQQLVG DVWEWTSTTF TGYPGFRAFP YEDYSEVFFD DGYKVLRGGS WATHPTAARA
TFRNWDHPIR RQIFSGFRCA RDAEPA