Gene Ndas_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5089 
Symbol 
ID9248978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp232206 
End bp234056 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content71% 
IMG OID 
Productpyruvate flavodoxin/ferredoxin oxidoreductase domain protein 
Protein accessionYP_003682976 
Protein GI297564003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.73542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAAAC AGATCGAACA GCTCGACCGC GTCATCATCC GCTTCGCCGG GGACTCCGGC 
GACGGCATGC AGCTGACCGG TGACCGCTTC ACGCAGGAGA CGGCGTCGTT CGGCAACGAC
CTGTCCACCC TGCCGAACTT CCCCGCCGAG ATCCGCGCTC CCGCCGGAAC CCTGCCCGGG
GTGTCCAGCT TCCAGCTGCA CTTCGCCGAC CACGACATCA TGACCCCGGG GGACGCCCCG
GACGTGCTGG TCGCGATGAA CCCCGCGGCC CTCAAGGCCA ACCTGGGCGA CCTGCCCCGG
GGCGCCACCG TGATCGTCAA CACCGACGAG TTCACCAAGC GCAGCCTGGC CAAGGTCGGG
TACACGGCCG ACCCGCTGAC GGACGGGACG CTGGACGCGT TCAAGGTGAG CGCGGTGCCG
CTGACGTCCA TGACGGTGGA GGCGCTGTCC GGCGCCGACA TCTCCAAGAA GGACGCGCAG
CGGGCCAAGA ACATGTTCGC CCTGGGCCTG CTGTCGTGGA TGTACAACCG CCCGACGGAG
GGGACGACCT CGTTCCTGAA GTCGAAGTTC GCCGCCAAGC CCGACATCCT GGCCGCGAAC
CTGACCGCGT TCCAGGCGGG GTGGAACTTC GGCGAGACCA CCGAGGACTT CGCGGTCTCC
TACGAGATCA AGCCCGCGCG GCTCCCGGCG GGCACCTACC GCAACATCAC CGGCAACCTG
GCCACCGCCT ACGGGCTGAT CGCCGGGTCC GAGCGGTCGG GGCTGCCCCT GTTCCTGGGC
TCGTACCCGA TCACCCCGGC CTCGGACATC CTGCACGAGC TGTCCCGGCA CAAGCGGTTC
GGTGTGCGCA CGTTCCAGGC CGAGGACGAG ATCTCCGGTG TGGGCGCCGC CCTGGGCGCG
GCCTTCGGCG GCTCCCTGGG CGTGACCACC ACCTCGGGTC CGGGCATGGT GCTCAAGCAG
GAGACGGTGG GGTTGGCGGT GATGACCGAG CTGCCGCTGG TCATCGTGGA CGTGCAGCGG
GCCGGTCCGA GCACCGGCAT GCCCACCAAG ACCGAGCAGA CCGACCTGCT CATGGCCCTG
TACGGGCGCA ACGGCGAGTC GCCGGTGCCC GTGGTGGCGC CCGCCTCCCC CGCGGACTGC
TTCGACGCCG CGCTGGAGGC CGTGCGGATC GCGGTGCGCT ACCGCACGCC GGTGGTGGTG
CTCTCCGACG GGTACCTGGC CAACGGTTCG GAGCCGTGGC GGCTGCCGGA GGTCTCGGAG
CTGCCGCGGA TCGACCCGGC CTTCGCGACC GGGCCCAACG GGCCGGGCGG GACCTTCCTG
CCCTACCTGC GCGACGAGGA GACGCTGGCC CGCCCGTGGG CGGTGCCGGG CACGGCGGGG
CTGGAGCACC GGATCGGCGG TATCGAGAAG CACGCGCAGA GCGGCGACAT CTCGTACACG
CCCGCCAACC ACGACCTGAT GGTGCGCACG CGCCAGGCCA AGATCGACGC GATCGCCCGC
GACATCCCCG AGCTGGCGGT GGACGACCCC GGCGGGGAGG CGGACGTGCT GGTCCTGGGC
TGGGGCGGCA CGTACGGGTC GATCACCGCG GCGGTGCGCC GCGTGCGCCG CGCGGGCGGC
CGGGTGGCGC AGGCGCACCT GCGCCACCTC AACCCGTTCC CGGCCAACCT CGGAACGGTC
CTCCACCGGT ACGAGCGGGT GGTGGTCCCC GAGATCAACC TGGGCCAGCT GTCCCTGCTG
CTGCGCGGCA GGTACCTGGT CGACGTCATC GGCTACAACA AGGTCCGCGG CCTGCCCTTC
AAGGCCGAGG AGCTCGCGGG CGTGCTTCAG GAGGTCATCG ACCGTGACTG A
 
Protein sequence
MAKQIEQLDR VIIRFAGDSG DGMQLTGDRF TQETASFGND LSTLPNFPAE IRAPAGTLPG 
VSSFQLHFAD HDIMTPGDAP DVLVAMNPAA LKANLGDLPR GATVIVNTDE FTKRSLAKVG
YTADPLTDGT LDAFKVSAVP LTSMTVEALS GADISKKDAQ RAKNMFALGL LSWMYNRPTE
GTTSFLKSKF AAKPDILAAN LTAFQAGWNF GETTEDFAVS YEIKPARLPA GTYRNITGNL
ATAYGLIAGS ERSGLPLFLG SYPITPASDI LHELSRHKRF GVRTFQAEDE ISGVGAALGA
AFGGSLGVTT TSGPGMVLKQ ETVGLAVMTE LPLVIVDVQR AGPSTGMPTK TEQTDLLMAL
YGRNGESPVP VVAPASPADC FDAALEAVRI AVRYRTPVVV LSDGYLANGS EPWRLPEVSE
LPRIDPAFAT GPNGPGGTFL PYLRDEETLA RPWAVPGTAG LEHRIGGIEK HAQSGDISYT
PANHDLMVRT RQAKIDAIAR DIPELAVDDP GGEADVLVLG WGGTYGSITA AVRRVRRAGG
RVAQAHLRHL NPFPANLGTV LHRYERVVVP EINLGQLSLL LRGRYLVDVI GYNKVRGLPF
KAEELAGVLQ EVIDRD