Gene Ndas_4955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4955 
Symbol 
ID9248843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp98132 
End bp99304 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content74% 
IMG OID 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003682843 
Protein GI297563870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.405656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.601805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCT TCGACGACCT CATCGGCCAG CGGTCCACGG TGGGCCAGCT CCAGCGGGCC 
GTCGCCGGGG CCGCCGACCT CGTCTCCGGC GGACGGGGCA CGGGGATGAC CCACGCCTGG
CTGTTCACCG GGCCGCCCGG TTCGGGCCGC TCGGAGGCCG CCCGGGCCTT CGCGGCCGCC
CTCCAGTGCC GGGACGGCGG CTGCGGGCAC TGCGCCTCCT GTCACCAGAC CCTCGCCGGC
ACCCACCCCG ACGTGCTGTA CGTGCGGCCC AGCGGACTGA GCTTCGGTGT GGCCGCCACC
CGCGACCTCG TGTTGCGCGC AGGCTCCAAA CCCTCCGGCG GCCGGTTCCG GATCGTGCTG
TTCGAGGACG CCGACCGCGC CACCGAGGCC GCCTCCAACG CCCTGCTCAA GGCGGTGGAG
GAGCCCTCGC CGCGCACCGT GTGGCTGCTG TGCACCCCCA CCCCCGACGA CCTGCTCGTC
ACCATCCGTT CGCGCTGCCG CATGGTCACC CTGGCCACGC CCGGCACCCG GGAGCTGGTG
GACGCCCTCG TCCAGCGCGA CGGGATCGAC GCCGAGACCG CCCGCGCCGC GGCGGTCGCC
GCCTCCGGGC GGATCGACCG GGCCCGGCAG CTGGCCACCG ACCCCGAGGC GCGCAGACGC
CGGGAGGAGG TGCTGTCCAT CCCCGCCCGG CTCGACGGGC TCGGCGCGTG CGTCACCTCC
GCCGCCCGGC TCTACGAGAT CGCCGAGGAG GAGTCCAAGG CCCTCACCAC GGCGCTCGAC
GAGAAGGAGA GGGAGGAGCT GCGGGCCGCG TTCGGCGAGG GCTCCACGGG CAAGGGCGTG
GCCAAGGCGA TGCGCGGCTC CGCGGGGGCG ATGAAGGACC TGGAGGAGCG GCAGAAGCGC
CGTGCCACCC GCATCAAGCG CGACTCCTAC GACCGCGCCC TGCTCGACCT GGTCGCGTTC
TACCGGGACG CGCTCACCCT CCAGCTGGGC GCGAGGGTGG AGCTGTCCAC GGCCGAGCGC
TCCGGCGACC TGGAACGCGT CGCCCGCTCC AGCACACCGG AGTCCACGCT GCGCAGGATC
GACGCCATCA TGGAGTGCCG TGAGCGCATC GCCGCGAACG TGCACCCCCA GATCGCCATG
GAGGCCATGA CCTCCGCCCT CCTGGCCGGA TAG
 
Protein sequence
MTVFDDLIGQ RSTVGQLQRA VAGAADLVSG GRGTGMTHAW LFTGPPGSGR SEAARAFAAA 
LQCRDGGCGH CASCHQTLAG THPDVLYVRP SGLSFGVAAT RDLVLRAGSK PSGGRFRIVL
FEDADRATEA ASNALLKAVE EPSPRTVWLL CTPTPDDLLV TIRSRCRMVT LATPGTRELV
DALVQRDGID AETARAAAVA ASGRIDRARQ LATDPEARRR REEVLSIPAR LDGLGACVTS
AARLYEIAEE ESKALTTALD EKEREELRAA FGEGSTGKGV AKAMRGSAGA MKDLEERQKR
RATRIKRDSY DRALLDLVAF YRDALTLQLG ARVELSTAER SGDLERVARS STPESTLRRI
DAIMECRERI AANVHPQIAM EAMTSALLAG