Gene Noca_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0523 
Symbol 
ID4596443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp552835 
End bp554175 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID639775137 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_921752 
Protein GI119714787 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACC AGGATCTCTA CTCCGGCTCC AGCGAGACGA CCGAGGGCCG GGTCTTCACC 
GTCACCGGGC AGGACTGGGA CTCGATCGCC GAGGGCCTGG CCGAGGACGA GGCGCAGGAG
CGCATCGTCG TCAACATGGG CCCCCAGCAC CCTTCGACCC ACGGGGTGCT CCGGCTGATC
CTCGAGCTCG AGGGCGAGAC GGTGACCGAG GCGCGGGCCG GCATCGGCTA CCTGCACACC
GGCATCGAGA AGAACATGGA GTACCGCACC TGGACGCAGG GCGTGACGTT CTGCACCCGG
ATGGACTACC TCAGCCCGTT CTTCAACGAG ATGACCTACG TGCTCGGGAT CGAGCGGCTC
CTCGACATCG AGGACCGGGT GCCCGAGAAG GCCCAGGTCA TGCGGGTCCT GCTCATGGAG
CTCAACCGGA TCTCCTCCCA CCTGGTCGCC ATCGCGACCG GTGGCATGGA GCTCGGTGCG
CTGACCGTGA TGACGATCGG CTTCCGCGAG CGCGAGCTGG TGCTCGACCT GTTCGAGCTG
ATCACCGGCC TGCGGATGAA CCACGCGTTC ATCCGTCCCG GTGGCGTCGC CCAGGACATG
CCGCCGGGCG CGCTCGACGA GATCCGCGGC TTCGTGGCGC TGATGAAGAA GCGGTTGCCG
GAGTACGCCG ACCTCTGCAA CGCGAACCCG ATCTTCAAGG GGCGCCTCGA GGGCATCGGC
CACCTCGACC TCGCCGGCTG CCTGGCGCTC GGCCTCACCG GCCCGGTGCT GCGCAGCACC
GGCTACCCGT GGGACCTGCG CAAGACCCAG CCGTACTGCG GCTACGAGAC CTACGACTTC
GACGTCCAGA CGTGGGACAC CTCCGACTCC TACGGCCGGT TCCGCATCCG CTTGAACGAG
ATGTGGGAGT CGCTGCGGAT CATCGAGCAG GCCGCCGACC GGCTGGCCGG TCTCGACGGC
GCCCCGGTGA TGATCGAGGA CAAGAAGATC GGCTGGCCCA GCCAGCTTGC GATCGGCAGC
GACGGCATGG GCAACAGCCT CGACCACATC CGCCACATCA TGGGTGAGTC GATGGAGGCG
CTGATCCACC ACTTCAAGCT GGTCACCGAG GGCTTCCGGG TGCCGCCCGG CCAGGCCTAC
GTGCCGGTGG AGTCCCCGCG TGGCGAGCTC GGCGCCCACG TCGTGTCCGA CGGCGGCACC
CGCCCGTTCC GCGCGCACTT CCGCGACCCG TCGTTCACCA ACCTGCAGGC GACCAGCGTG
ATGGCCGAGG GCGGCATGGT CGCCGACGTC ATCGTCGCGA TCGCGTCCAT CGATCCGGTC
ATGGGAGGCG TCGACCGATG A
 
Protein sequence
MADQDLYSGS SETTEGRVFT VTGQDWDSIA EGLAEDEAQE RIVVNMGPQH PSTHGVLRLI 
LELEGETVTE ARAGIGYLHT GIEKNMEYRT WTQGVTFCTR MDYLSPFFNE MTYVLGIERL
LDIEDRVPEK AQVMRVLLME LNRISSHLVA IATGGMELGA LTVMTIGFRE RELVLDLFEL
ITGLRMNHAF IRPGGVAQDM PPGALDEIRG FVALMKKRLP EYADLCNANP IFKGRLEGIG
HLDLAGCLAL GLTGPVLRST GYPWDLRKTQ PYCGYETYDF DVQTWDTSDS YGRFRIRLNE
MWESLRIIEQ AADRLAGLDG APVMIEDKKI GWPSQLAIGS DGMGNSLDHI RHIMGESMEA
LIHHFKLVTE GFRVPPGQAY VPVESPRGEL GAHVVSDGGT RPFRAHFRDP SFTNLQATSV
MAEGGMVADV IVAIASIDPV MGGVDR