Gene Ndas_2850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2850 
Symbol 
ID9246701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3402150 
End bp3403415 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative oxygenase subunit protein 
Protein accessionYP_003680767 
Protein GI297561793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGGA TCCTCGTCGT CGGAGCCGGA CAGTCCGGAC TCCAGCTGGC CGTCGGCCTG 
CTGGGCGAGG GGTACGAGGT GACCCTGGCC ACCGAGGCCG GGCCTGAGGA GGTCCGCGCC
GGGCGGGTCA CGTCCACGCA GTGCCTCTTC GGCCCCGCCC TGCGCCGCGA GCGCCGCCAC
GGCCTCGCCT TCTGGGACGA CCGCGCGCCC GCCGTCCGCG GCGTCGGGTT CCGCGTGGCC
GACCGCGCCG GGGCGGGCGC CCCCGCCCTG TCCTGGGTCG GCGCGCTGGA CGAGCACGCC
CAGTCGGTCG ACCAGCGGCT CAAGATGTCG GCCTGGCTCG ACCTGTTCGT CCGGCGCGGC
GGCGCCCTCC TGCGCGGGCG CGTCCTGGCC GAGGACCTGG AACGGCTCGC CGCCGACCAC
GAACTCACCG TCGTCGCCTC CGGGCGCGGC GCGCTCTCCG AGGTCTTCCC CCGCGACACC
CGGCGCTCGC ACTTCCGGTC CCCGCAGCGC TCCCTGGCGC TGGCCTACGT GACCGGCGCC
GGGCCCCACC CCGACGGGCC CGTGCTCAGC CGCACCGTCG TCCCCGGCGC GGGGGAGGTC
ACCACCCTGC CCACCTACTC CCTGGCCGGG GTCTGCGAGG CCGTCATGGT CGAGGCGGTG
CCCGGCGGCC CCCTGGACCG GCCGCTGCCC CCCGGCGCCT CCGGCGAGGA GGTCCTCGCG
GGCCTGCTGG ACGTGCTGTA CCGCGAGGCG CCCTGGGAGT ACGAGCGCCT GGCCCACGCC
CGCCTCGCCG ACCCGGGGGC GGCCCTGCGC GGCGGCTACG CGCCGGTGGT GCGCGAACCC
GTCGCCCGCC TGGCGAACGG AACCCCCGTC CTGGGCATGG CGGACTCCGT GGTCGCCAAC
GACCCCGTCA CCGCGCAGGG GGCCAACATG GCCTCCTTCG GCGCCGAGGT CTACCGCCGC
GCCGTCGTCG ACCACGGACG TCGGCCCTTC GACGAGGCCT TCATGCGCTC GGCGTTCGCC
GCCTACTGGC GCCTGGCCAG CCAGGTGACC GCGTGGAGCA GGGTCCTGCT CACCGCTCCG
CCGCACCTGG AGGAGCTGTA CCGCCTCGCC GCGCGCCACC AGGAGACGGC CGACCGCTTC
GCCAACTGCT TCAGCGACCC CGGCGACCTG ATCGGGTGGT TCCTGCACCC CGAACGGGCA
CTGGCCTACG TGGACGGCGT CCGGCGCGCC GAACCCGCAC GGTCATCCCA TCTCCCTATC
TCCTGA
 
Protein sequence
MRRILVVGAG QSGLQLAVGL LGEGYEVTLA TEAGPEEVRA GRVTSTQCLF GPALRRERRH 
GLAFWDDRAP AVRGVGFRVA DRAGAGAPAL SWVGALDEHA QSVDQRLKMS AWLDLFVRRG
GALLRGRVLA EDLERLAADH ELTVVASGRG ALSEVFPRDT RRSHFRSPQR SLALAYVTGA
GPHPDGPVLS RTVVPGAGEV TTLPTYSLAG VCEAVMVEAV PGGPLDRPLP PGASGEEVLA
GLLDVLYREA PWEYERLAHA RLADPGAALR GGYAPVVREP VARLANGTPV LGMADSVVAN
DPVTAQGANM ASFGAEVYRR AVVDHGRRPF DEAFMRSAFA AYWRLASQVT AWSRVLLTAP
PHLEELYRLA ARHQETADRF ANCFSDPGDL IGWFLHPERA LAYVDGVRRA EPARSSHLPI
S