Gene Ndas_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4094 
Symbol 
ID9247968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4891606 
End bp4892847 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content77% 
IMG OID 
ProductFAD dependent oxidoreductase 
Protein accessionYP_003681996 
Protein GI297563022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.232539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.561641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAACG ACGGGGACTC CTCCCAGCGT GCCGGTGGCG GAGACGGCGG CTCGCCCCCG 
GGGCGGGCGG TGATCGTCGG CGCGGGCATC GCCGGGCTCG CCACCGCGCT ACGGCTCCAC
CGGGCCGGTT GGGAGACGCT CGTGGTCGAG CGCGCTCCGG CCCGCCGCAG AGGCGGCTAC
ATGGTGAACC TGGTCGGGTG GGGATACGAC GCCGCCGAGC GCCTCGGCCT CGTCCCCGCG
CTCTCGGCCA GCGACATCGG CCTGTTCTCC ACGGTCCTGG TGAGGGCCGA CGGTCGGCGC
AAGTTCTCCG TCCCTCCCGA GATCGCCAGG GCCGCCCTCG GCGACCGGGC GCTGACCGTG
TTCCGGGAGA ACCTGGAATC GGCGCTGTAC GAGGCCGTGC GCGGCAGCGC GGTCCTGCGA
TTCGGCACCA CCGCCGTGGA CGTGGCCCAG GACGCCGACG GCGTCCGGGT CGGGCTCAGC
GACGGCACCA CCGAGCGCGC CGACCTGCTC GTCGGCGCCG ACGGGCTGCG CTCCGGCGTC
CGCGCCGCCG TGTTCGGCCC CGAGGCGGAC TTCCGCGTCG ACCTGGACCA CGTCGTGGGT
GCGCTCCCGC TCGACCGGCT CCCCGAGGAC GTGCCCGAGG GGACCGGCAC CACGTTCATC
GGGCCGGGGC GCACGGCGGC GGTGGTCAAC CTCGGTCCGG GCCGGTCCTC GGCCTTCTTC
GCCTACCGCT GCGCGGATCC GGACGCCGAA CTGGCCCGGG GACCGGTCCG GGCGCTCACC
TCGGCGTTCG GCGACCTCGG CGGGGGCGTG CCCGACGTGC TCCGCCAGTT GCGCGCCGAC
CCGTCCGGCG CCTACTTCGA CTCCGTCAGC CAGGTCCGCG CGGACCGGTG GAGCCGCGGC
CGGGTCGTGC TGCTGGGCGA CGCGGCCTGG TGCCCCTCCC TGTTCGCCGG GTACGGCGCG
GCCCTCGCGC TCAGCGGCGC CGACCGGCTC GGCGACGCCC TGGAGCGGCA CGGCGGCGAC
GTCACCGGGG CCCTGGCGCG GTGGGAGGCG GGCCTGCGCC CCGAGACCCG CAGGCGGCAG
GCGCTCGCCC GGCGGGGGAC GCGCCAGTAC GCCCCCTCCA GCCGCGCGCA CGTGTGGATG
AACGACCTGG CGATCCGGGC CGTCCTGCTT CCGGGCGTCC GCGGCCTCGT CCAGCGCCGC
ATCCGGCGCG CCGGTGAGCG GCACGCCGCG GCGGACGGGT GA
 
Protein sequence
MTNDGDSSQR AGGGDGGSPP GRAVIVGAGI AGLATALRLH RAGWETLVVE RAPARRRGGY 
MVNLVGWGYD AAERLGLVPA LSASDIGLFS TVLVRADGRR KFSVPPEIAR AALGDRALTV
FRENLESALY EAVRGSAVLR FGTTAVDVAQ DADGVRVGLS DGTTERADLL VGADGLRSGV
RAAVFGPEAD FRVDLDHVVG ALPLDRLPED VPEGTGTTFI GPGRTAAVVN LGPGRSSAFF
AYRCADPDAE LARGPVRALT SAFGDLGGGV PDVLRQLRAD PSGAYFDSVS QVRADRWSRG
RVVLLGDAAW CPSLFAGYGA ALALSGADRL GDALERHGGD VTGALARWEA GLRPETRRRQ
ALARRGTRQY APSSRAHVWM NDLAIRAVLL PGVRGLVQRR IRRAGERHAA ADG