Gene Ndas_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3506 
Symbol 
ID9247375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4208888 
End bp4210531 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein of unknown function DUF187 
Protein accessionYP_003681413 
Protein GI297562439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.339986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGAAAC CGCACCGGAA CCGTCCCGCT CATCCGCCCG CCCGGGGGGC GGGCCGGGTC 
TCCCGAGCCG TCGCCCTGGC CGCAGCGCTC ACCGCGTCGC TGCTCCACGC GCCGCACGCG
GCGGCCGCCG CTCCGGCCGT CCCGGCGACG AACACCTCCG CCGAGCCGTG CGCGGTCGAT
CCGCAGGCGC CGCCCAAGCG GCAGATGCGC GCCGAGTGGA TCTCCTCGGT GGTCAACATC
GACTGGCCCA GCGAGCAGGG CCTCTCACCC GAGCGGCAGA AGGCCGAGCT GATCGACCTG
TACGACCGGG CGAAGGCCGA CGGGCTCAAC GCCGTGTTCG TGCAGATCCG GCCGACCGCC
GACGCGTTCT GGCCCTCACC GCACGAGCCC TGGTCGGAGT GGCTCACCGG CACGCAGGGC
ACCGACCCCG GGTACGACCC GCTGGCGTTC GCGGTCGAGG AGGCGCACGC CCGCAACCTG
GAGTTCCACG GCTGGTTCAA CCCCTACCGG GTGGCCATGC ACGACGACCC CTCGCGCCTG
GTGGCCGACC ACCCGGCGCG GGTCAACCCG GACTGGGTGT TCGCCTACGG CGGCAAGCTC
TACTACGACC CGGGCATCCC CGAGGTGCGC GCGTTCGTCG TCGAGGCGAT GATGCACGCG
GTCGAGAACT ACGACCTGGA CGGCGTCCAC TTCGACGACT ACTTCTACCC CTACCCGGTG
GCGGGCGAGA CGGTGCCCGA CCAGGACACG TTCGCCGAGT ACGGCGGAGA GTTCGGCGAC
ATCGGGGACT GGCGGCGCGA CAACGTCAAC CGCATGGTCC AGGAGATGGA CGAGGCGGTG
CACACCGCCA AGCCGCACGT GAAGTTCGGC ATCAGCCCGT TCGGGATCTG GCGCAACGAC
ACCAGCGACC CGAACGGCTC CGACACCGGC GGGTTCGAGT CCTACAGCCA GATCTACGCC
GACAGCCGCA GGTGGGTGCG CGAGGGCTGG GTGGACTACA TCAACCCGCA GGTCTACTGG
GAGATCGGCC TGCCCGTGGC CGACTACGCG GTGCTCGTCC CCTGGTGGGA GCAGGTCACC
GAGGGCACCG ACGTACACCT CTACATCGGC CAGGCCGCCT ACAAGGTCGG CAACGCGGGC
GCCTGGTCCG ACCCGGACGA ACTCTCCCGG CACCTGGACC TGAACCGCGA GTACCCGGGC
GTGGACGGGG ACGTCTACTT CAGCGCGAAC TCGCTGCGCA CCAACGCCAG GGACGCCATG
GACGTCGTGG TCGAACAGCA CTACGCCAAC CCCGCCCTGA TCCCGGTCAA GGAGGACCTG
GGCGGGGCCG CTCCCGCACC GCCGGTGGTC ACCGCCGCCG CCCGGGCCGA CGGCGGCACC
GAGCTGACCA TCCGCCCCGG ACGCGGCGGC AGGCCCGCCT ACTACGCGGT CTACGAGCTG
GAGGGCGCAC CCGGACGGCA GGAGGTCCCG TGCGAAGTGC AGGACGCGCG CGCACTGATC
GGCACGGTGC GCGCGGCCGA GGACGGTGGC GAGACGGTCT TCACCGCGCC CGGCGGCGGT
GACGTGACCT ACTACGTGAC CGCCCTGGAC CGGCTGCACC ACGAGAGCAC GACGAGCAAC
CCGAGGCATG TGCCCCGGGG CTGA
 
Protein sequence
MAKPHRNRPA HPPARGAGRV SRAVALAAAL TASLLHAPHA AAAAPAVPAT NTSAEPCAVD 
PQAPPKRQMR AEWISSVVNI DWPSEQGLSP ERQKAELIDL YDRAKADGLN AVFVQIRPTA
DAFWPSPHEP WSEWLTGTQG TDPGYDPLAF AVEEAHARNL EFHGWFNPYR VAMHDDPSRL
VADHPARVNP DWVFAYGGKL YYDPGIPEVR AFVVEAMMHA VENYDLDGVH FDDYFYPYPV
AGETVPDQDT FAEYGGEFGD IGDWRRDNVN RMVQEMDEAV HTAKPHVKFG ISPFGIWRND
TSDPNGSDTG GFESYSQIYA DSRRWVREGW VDYINPQVYW EIGLPVADYA VLVPWWEQVT
EGTDVHLYIG QAAYKVGNAG AWSDPDELSR HLDLNREYPG VDGDVYFSAN SLRTNARDAM
DVVVEQHYAN PALIPVKEDL GGAAPAPPVV TAAARADGGT ELTIRPGRGG RPAYYAVYEL
EGAPGRQEVP CEVQDARALI GTVRAAEDGG ETVFTAPGGG DVTYYVTALD RLHHESTTSN
PRHVPRG