Gene Ndas_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4033 
Symbol 
ID9247905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4826376 
End bp4828325 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681936 
Protein GI297562962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAGA CACAACCACA CAACAACGGA CGGGGCGCGG CCCCGCGCGT GGACGCGGCG 
AGCGTCGAGG CGGTCGCGGG CGCGCCCGAG TACGACTCCA CCTCACGCCT GAGCTGCGCG
GCCTGGACCG ACCCGGCCTT TCGCGCCGAC GTCCTGCGCG AACGCGTGGA GAACCCCCAC
CGCACCCTGG CCCGGGAACC CGGGCTGGAC GCGGCCCTGG TCACCGAGGA GTGCGTCCGG
GCGAGGAACG TCCGCGCCAC CGCGGGGGCG CTGTTCCTCG GCGCCGCCCT CCTGGTCTGC
CCCTTCAACC TCCCGGCGGG GCTCCTCACC CTGGCCGTGG TCCTCGCCCT GGCGATGACG
GCGCGGTACC GCGCCACGCA CGACGGCGCC CGGCCCCGTG TGGGCTGCGT GTCCCTGGTC
CTCGGCGCGG TCGCCCTCTA CATGCTCTAC ACCTTCGCCT CCCCGCTCCT GGTGCCGCTG
TTCGGCGCAG GGGCGGACGC CATGGGCGAC ATGAACGGTA CGGGAGGCAT GGAGGACGAC
CCCTGGGGGT CCACCGACCC CTCGGAGGAC CTCCCCGCCG GCCCCGGATT CGGCTCCCTG
ATCCCCTTCT GGCTGGGTCT GCTCGTCCTG CTCGCCGCCG CCGTGGCCAT CGGCGGCTGG
GTCCGGTTCA GGGCCAACAC CGCGTTCCGC CGCATCCGGA GCGCGGCGGC CTCCCCCCGT
ACGGGCAGCG GTGTCGGCTC CGTCCCGGTG GCCTTCTACA ACGACTTCAC GCCCTTCGTC
GGCACCGGGG CGCACCACAG CTCCTGGCCG GTGATGCTCA AGCTCCTGCC CCGGGACGGC
GGGGGGACGC AGCCGCCGGA CGCGGCGGAG GACCGGGAGA CCGGCGGGGA CCGGAAGGCG
GCCAACGGCC ACGGGACCGG GGCCGGGAAC GGCGCGAGCG CCCCGGCCTC CCCCGCGGAC
GGCGCGTACG AGCGGCCCGA CGAGGGCCGT GAGGTCCCTC CCGCGGCCGA CGCGACCCTG
GTCAAGGACC TCTACGCCAG GCTGCGCGAG GAGCTGCCCA AGCTCACCGG GACCGAGGGC
GTGCGGAGCA GCACCCGGCG CGAGGTCGAG GTGGCCGACT GCGTGTTCCT GCCGGGTCTG
CGCCAGGACG ACGTGGTGGC CCTCGCCCCG CGGCTGATCG ACCGGAACAG GTGGGCGCTG
CGCGAGGAGT GGGTCGACGG TTTCGTCGAC GCCTTCCACG AACGCGCCCG GCACTTCCTG
GAGATCGGGA TCAGCATGTG GGAGAGCCAG GTCGTGGTGA CGGTCTTCGT GCGCCTGTCC
ACCCAGGGCG GTCTGCTGCA CGTCGAGGGC GAGACCATGG TCATGCCTCC CCTGTCCCCC
GACTACCGGG TTCCCCAGGA ACCGCTGTCC GTCGGCGCCG ACGCCGGGGA GGTCGCCGTG
CTCGTCGGGG ACTCCCTGTT CGGCGTCCTG GAGGACCTGC GGACCAACCC GGCCGAGGCC
CTGGCCTGGG TCGCCTCCCG GTCGGCCACC AGGAGGAACA ACAGGGTGCA CACCTGGGCC
CGGAGCAACA AGGAGTTCTT CGACTACTCC CCGCGGATGG GCGTCCGGGA GAGGGCGGCC
ACGCCCCGGG TCCGGCAGCT CTTCCAGTCG CACGACATCC GCCGCGTCAC CCGGGCCATC
CCGGAGAAGG TGCTGGTCTG CGTCCGCGAC GTGCTTCGGG AGGCGGGCTA CGACACCGAG
CAGGTCGCCC GCATCATCCA GAACATCAGC AACTACGGCG CCACCTTCCA CGGAGGACGG
CACAGCTTCT CGGGCAGCAC CTTCGGATCG GGCGACATCC ACCACCACAC CCCGCCGAAC
GCGGACGAGG GGCCCGGCGG TGGCGTTCCC GGTGGACCTG ACGGCGCCCC GGGCCCGGTC
CCGCAGAACA CGAGGACGGA GGCGAGGTGA
 
Protein sequence
MTQTQPHNNG RGAAPRVDAA SVEAVAGAPE YDSTSRLSCA AWTDPAFRAD VLRERVENPH 
RTLAREPGLD AALVTEECVR ARNVRATAGA LFLGAALLVC PFNLPAGLLT LAVVLALAMT
ARYRATHDGA RPRVGCVSLV LGAVALYMLY TFASPLLVPL FGAGADAMGD MNGTGGMEDD
PWGSTDPSED LPAGPGFGSL IPFWLGLLVL LAAAVAIGGW VRFRANTAFR RIRSAAASPR
TGSGVGSVPV AFYNDFTPFV GTGAHHSSWP VMLKLLPRDG GGTQPPDAAE DRETGGDRKA
ANGHGTGAGN GASAPASPAD GAYERPDEGR EVPPAADATL VKDLYARLRE ELPKLTGTEG
VRSSTRREVE VADCVFLPGL RQDDVVALAP RLIDRNRWAL REEWVDGFVD AFHERARHFL
EIGISMWESQ VVVTVFVRLS TQGGLLHVEG ETMVMPPLSP DYRVPQEPLS VGADAGEVAV
LVGDSLFGVL EDLRTNPAEA LAWVASRSAT RRNNRVHTWA RSNKEFFDYS PRMGVRERAA
TPRVRQLFQS HDIRRVTRAI PEKVLVCVRD VLREAGYDTE QVARIIQNIS NYGATFHGGR
HSFSGSTFGS GDIHHHTPPN ADEGPGGGVP GGPDGAPGPV PQNTRTEAR