Gene Noca_4289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4289 
Symbol 
ID4596804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4530345 
End bp4531850 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content72% 
IMG OID639778896 
ProductNADH dehydrogenase 
Protein accessionYP_925473 
Protein GI119718508 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.179373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGCA CCGAGGCCCG GCCGGTCGAA GCCGCGGCGG CGCAGCACCG GCACCGGGTG 
GTCGTCATCG GATCCGGGTT CGGAGGGCTG TTCGGGACCA AGGCACTGCG CCGGGTCGAC
GTGGACGTGA CGATGGTCGC GAAGACCACG CACCACCTGT TCCAGCCGCT GCTCTACCAG
GTCGCGACCG GGATCCTCAG CCAGGGCGAG ATCGCGCCGC CCACCCGCGA GGTCCTCAGC
AGTCAGCGGA ACGTCACCGT GCTGCTGGGC GAGGTCAGCG GGATCGACCT CGCCGCGCGG
ACCGTCACCT CCCAGGTCCT GGGCCGTCCG ACGGTGACGC CGTACGACTC CCTGATCGTG
GCCGCGGGCG CCGGCCAGTC CTACTTCGGC AACGACCAGT TCGCCGAGTA CGCGCCCGGG
ATGAAGAGCA TCGACGACGC GCTCGAGCTG CGCGGCCGGA TCTTCGGCGC CTTCGAGCTG
GCCGAGCTCG GGGCCGCGCG CGGCGACCAC ATCGACCACC TGCTCACGTT CGTGGTGGTC
GGCGCCGGCC CGACGGGCGT GGAGATGGCC GGGCAGATCG CCGAGCTCGC GCACCGCACC
CTGCGCAAGG ACTTCCACCA CATCAACACC CGCACCGCCC GGGTGATCCT CGTCGACGCC
GCCCCGCAGG TGCTGCCGCC GTTCGGGGCG AAGCTCGGGG CGAAGACCAA GACCGAGCTG
GAGAAGCTCG GCGTCGAGGT GGTGCTCGGC GCGATGGTGA CCGACGTCGA CGAGCGCGGC
ATCGAGATGA AGTTCAAGGA CGGCCGGGTC GAGCGGGTCG ACACCGTCAC CAAGATCTGG
GCCGCGGGGG TCCAGGCCAG CCCGCTGGGC CGCACCCTCT CCGAGCAGAC CGGCGCGCCC
CTCGACCGGG CCGGCCGGAT CGCCGTCAAC CCCGACCTGA CCCTGCCCGG GCACCCCGAG
GTGTTCGTGG TCGGCGACAT GATCGCCCTG GACAACCTCC CCGGCGTCGC GCAGGTCGCG
ATCCAGGGAG CGAGGTACGC CGCCGAGGAG ATCGAGCGGC GGCTGCGGTC CAAGCCCTCG
CAGGGGCCGT TCAAGTACTT CGACAAGGGT TCGATGGCGA TCATCAGCCG GTTCCGCGCG
GTCGCGATGA TCGGCCGGGT CCGGGTCACC GGGGTGCTCG CCTGGCTGAT GTGGCTGGGC
CTGCACCTGG TGTACATCAC CGGCTTCAAG AGCCGGGTCA CGGCGCTGCT GCACTGGGCG
GTCTCGTTCG TCGGCCGCGG CCGGGCCGAG CGGACGACCA CCGAGCAGCA GATCTTCGCG
CGCAGCGCGC TCGGCCGGCT CGAGCACGGC GCCGCCGACC TGGTCTCCGA CCCCGGGGCG
TACGACGCCA CCCGGGAGCT GCTCGAGACC ACGCGCCGGG CCGAGCTCGA GGCGCAGGCC
CTCGAGGAGG CGCGGCTCAC CGATGCCGGC GAACGGGGCG TGAGGACCGG CGACCGCGCC
GGCTGA
 
Protein sequence
MAGTEARPVE AAAAQHRHRV VVIGSGFGGL FGTKALRRVD VDVTMVAKTT HHLFQPLLYQ 
VATGILSQGE IAPPTREVLS SQRNVTVLLG EVSGIDLAAR TVTSQVLGRP TVTPYDSLIV
AAGAGQSYFG NDQFAEYAPG MKSIDDALEL RGRIFGAFEL AELGAARGDH IDHLLTFVVV
GAGPTGVEMA GQIAELAHRT LRKDFHHINT RTARVILVDA APQVLPPFGA KLGAKTKTEL
EKLGVEVVLG AMVTDVDERG IEMKFKDGRV ERVDTVTKIW AAGVQASPLG RTLSEQTGAP
LDRAGRIAVN PDLTLPGHPE VFVVGDMIAL DNLPGVAQVA IQGARYAAEE IERRLRSKPS
QGPFKYFDKG SMAIISRFRA VAMIGRVRVT GVLAWLMWLG LHLVYITGFK SRVTALLHWA
VSFVGRGRAE RTTTEQQIFA RSALGRLEHG AADLVSDPGA YDATRELLET TRRAELEAQA
LEEARLTDAG ERGVRTGDRA G