Gene Ndas_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1664 
Symbol 
ID9245514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2033015 
End bp2034292 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content73% 
IMG OID 
ProductFAD dependent oxidoreductase 
Protein accessionYP_003679599 
Protein GI297560625 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.896547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGG AAGAGATCGA GGTCGTCGTC GTCGGAGGGG GGCAGGCCGG GATCGCGATG 
AGCGAGCACC TCAGCGACCG CAGGGTGCCC CACGTCGTCC TGGAGAGGAG CCGCGTCGCC
GAGCGCTGGC GCTCGGAGCG GTGGGACTCG CTGGTCACGA ACGGGCCCGT CTGGCACGAC
CGGTTCCCCG GCCTGGAGTT CACCGGCCTG GACCCCGAGG ACTTCGCCTC CAAGGACGGC
GTCGCGGACT ACCTGGCGAC CTACGCCGAG AAGATCGGCG CGCCGATCCG GTGCGGGGTC
GAGGTGACCT CGGTGCGACG CAACACCGGA CGCCGGGGGT TCCGGGTCCG GACCTCGGAC
GGTGACATCG ACGCCCGCTA CGTCGTGGCC GCCACCGGCC CGTTCCAGCG GCCGGTCGTC
CCCCCGCTCA TCCCCGAGGA CTCGGGTTTG GTCCAGCTCC ACTCCAGCGC CTACCGCAAC
CCCGCGCAAC TGCCCGAGGG CGGCGTGCTC GTGGTCGGGT CGGGCTCCTC GGGGGTCCAG
ATCGCCGACG AACTCCGCGC CAGCGGCCGA CGGGTGTACC TGTCCGTCGG TCCGCACGAC
CGCCCGCCGC GCCGGTACCG GGGGCACGAC TTCGTCTGGT GGCTGGGCGT CCTCGGCAAG
TGGGAGGCCT CGGCCCCCGC GCGGGGCGCC GAGCACGTCA CGATCGCGGT CAGCGGCGCG
CACGGCGGCC ACACCGTGGA CTTCCGGGCC CTGGCTCAGC GCGGCATCAC GCTGGTCGGC
CGGACGGAGT CCTTCGACGC CGGAACCGTC CGCTTCGCGC CCGACCTGCG GGACAACATC
GCCCGCGGCG ACGCCAACTA CCTGTCGCTG CTGGACGAGG CCGACGCCTA CGTCGAACGC
AACGGACTCG ACCTGCCCGA GGAGCCCGGG GCCCGCGTCC TGGGCGCCTA CCCGGAGTCG
GTGACCGACC CCCTCCGCGA ACTCGACCTC GCCGCCGCCG GGGTCCGGAC CGTCCTGTGG
GCGACCGGCT TCACCGCCGA CTACGGCTGG CTGCGGGTGG ACGCCTTCGA CGAGAACGGC
AGGCCCCGGC ACCAGCGCGG GGTCTCCTCC GAGCCCGGCG TCTACTTCAT GGGGCTGCCG
TGGCAGTCCC GCCGCGGTTC GAGCTTCATC TGGGGGGCCT GGCACGACGC CAAGTACGTG
GCCGACCAGA TCTGCATCCA GCGCGGCTAC ATGGAACACC ACGACGCGCA CCCGCACCCG
TCCCACACCC AGGGGTGA
 
Protein sequence
MSLEEIEVVV VGGGQAGIAM SEHLSDRRVP HVVLERSRVA ERWRSERWDS LVTNGPVWHD 
RFPGLEFTGL DPEDFASKDG VADYLATYAE KIGAPIRCGV EVTSVRRNTG RRGFRVRTSD
GDIDARYVVA ATGPFQRPVV PPLIPEDSGL VQLHSSAYRN PAQLPEGGVL VVGSGSSGVQ
IADELRASGR RVYLSVGPHD RPPRRYRGHD FVWWLGVLGK WEASAPARGA EHVTIAVSGA
HGGHTVDFRA LAQRGITLVG RTESFDAGTV RFAPDLRDNI ARGDANYLSL LDEADAYVER
NGLDLPEEPG ARVLGAYPES VTDPLRELDL AAAGVRTVLW ATGFTADYGW LRVDAFDENG
RPRHQRGVSS EPGVYFMGLP WQSRRGSSFI WGAWHDAKYV ADQICIQRGY MEHHDAHPHP
SHTQG