Gene Ndas_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2041 
Symbol 
ID9245891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2462355 
End bp2463401 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content76% 
IMG OID 
Productoxidoreductase domain protein 
Protein accessionYP_003679973 
Protein GI297560999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.284403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGG GAGTGGTGGG ACTGGGGATG GGCCTGCACC TGGCGGTGTG GGCCGCCAGA 
CTGGGCATGG ACGTGGTGGC GGCCTGCGAC CGCGACCCCT CCCTGCACGC CGCGGCCCGC
GAGCGGCTGC CAGGCGCGAC CCTCACCGAC CGCTGGCGGG ACCTGCTCGA CCAGGACTTG
GACGGGGTCA TTCTGGCCAA CGACTTCGAC GCCCACGCGC CCCTGGCCCT GGCCTTCCTC
GACCGCGGCG TCCACGTGCT CTCCGAGACC GCCGCGTGCG CCGACGAGGC CGAGGCGAGG
GCGCTGGTCG CGGCGGCCGA CCGCTCGTCG GCGACCTACT CCCTGGCCGA GAACTACACC
CTCCACCCGC ACGTGCTGCT CGTCCGCGAG GCCGTCCGGG CGGGCGAACT GGGCCGGATC
AGCCTCATCG AGGCCGACTA CCTGCACGGC ATGTCCCCCG AGGGCGTCGC CGGGCTGACC
GGCGACCCCG CCCACTGGCG CGGGCGCATC GCCCCCACCG CCTACTGCAC CCACTCCCTG
TCACCGATCC TGGCGATCAC CGGTGCGCAC CCGGTGGAGG TCAGCGCGTT CACCGTGGAC
GAGGCCGCGC CGCGCCAGGC CAGCACCATG GTGGTGCGCC TGTCCACGGG CGCCCTGGCC
GTCACCCGCA ACGGCTTCCT CCAGGGCGAA CCCGACAGCC ACTGGAGCTG GGTGTCGGTG
CGCGGCACCC GCGGGCTGGC CGAGTCGGTG CGGGCGCGGG GAGAGCGCGC CTGGTCGGTG
CGCGTGCGCC ACGAGGGGTG GACCCGCCCC GACGGCGACG CCCACGAGGA GGAACGCGTC
CCGCCCGCGC TGTCGCTGGA CGGCGAGCCC GTGGAACGCG GGGCCGAGGG CACGGTGCGC
CTGCTGCGGG GCTTTCGCGA CACCGTCGAG CACGGCGCCG AGCCGCTGGT GCCGGTGCGC
GCGGCCGTGG CGGCCTCCCT GGTCGGGGTG GCCGGGGCCG AGTCGCTGGC CCGGGGGTCG
TGTCCGGTCC CGGTTCCGCC GCTGTGA
 
Protein sequence
MRVGVVGLGM GLHLAVWAAR LGMDVVAACD RDPSLHAAAR ERLPGATLTD RWRDLLDQDL 
DGVILANDFD AHAPLALAFL DRGVHVLSET AACADEAEAR ALVAAADRSS ATYSLAENYT
LHPHVLLVRE AVRAGELGRI SLIEADYLHG MSPEGVAGLT GDPAHWRGRI APTAYCTHSL
SPILAITGAH PVEVSAFTVD EAAPRQASTM VVRLSTGALA VTRNGFLQGE PDSHWSWVSV
RGTRGLAESV RARGERAWSV RVRHEGWTRP DGDAHEEERV PPALSLDGEP VERGAEGTVR
LLRGFRDTVE HGAEPLVPVR AAVAASLVGV AGAESLARGS CPVPVPPL