Gene Ndas_5058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5058 
Symbol 
ID9248947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp198017 
End bp199336 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content71% 
IMG OID 
ProductNADH-quinone oxidoreductase, F subunit 
Protein accessionYP_003682945 
Protein GI297563972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC TGACCCCGAT CCTGTCGCGC GACTGGGACC GCCCCGACTC CTTCACCCTG 
GAGGGCTACC GGGCCACCGG CGGCTACGGC GCGCTGCGCA AGGCGCTGGC CATGGAGCCC
GACGCCATCG TGGACCTGGT CAAGAAGTCC GGCCTGCGCG GTCGCGGCGG CGCCGGGTTC
CCCACCGGGA TGAAGTGGGG CTTCCTGCCC GCGGACAACC CCAACCCGCG CTACCTCGTC
GTCAACGCCG ACGAGTCCGA GCCGGGCACC TGCAAGGACA TCCCGCTCAT GCTCGCCAAT
CCGCACGTGC TGGTGGAGGG GGTGGCGATC GCCGCGTACG CGATCAAGTC CAGCCAGGCC
TTCATCTACG TGCGCGGCGA GGTCCTGCAC GTCATCCGGC GCCTGCGCCG GGCCGTCGCC
GAGGCCTACG AGGCCGGGCT GCTCGGCAGG GACGTCCTGG GCACCGGCTT CGACCTCGAC
GTCGTCGTCC ACGCCGGGGC GGGCGCCTAC ATCTGCGGTG AGGAGACCGC GCTGCTGGAC
TCCCTGGAGG GCTACCGGGG CCAGCCCCGG CTCAAGCCGC CCTTCCCCGC GGTCGCCGGG
CTCTACGCCT CGCCGACCGT GGTCAACAAC GTCGAGTCCA TCGCCAGCGT GCCGAGCATC
GTGGCCAACG GCGCCGAATG GTTCACGTCC ATGGGCACCG AGAAGTCCGC CGGGTTCGGC
TTCTTCTCCC TGTCCGGGCA CGTGGCCAAC CCCGGCCAGT ACGAGGCCCC GCTGGGCGTG
ACCCTGCGCG AGCTGCTCGA CATGTCGGGC GGGATGCGGC CCGGGCACCG GCTCAAGTTC
TGGACGCCCG GCGGCTCCTC CACGCCGATC TTCACCGAGG AGCACCTGGA CACCCCGCTC
GACTTCGAGT CGGTGGGCGC CGCCGGGTCC ATGCTCGGCA CCCGCGCCCT CCAGATCTTC
GACGAGACGA CCTGCGTGGT GAAGGCCGTC GGCCGCTGGA TCGCCTTCTA CGCCCACGAG
TCCTGCGGCA AGTGCACGCC CTGCCGCGAG GGCAACTTCT GGATGGTCCA GGTCCTGGAC
CGGCTGGAGA ACGGGCAGGG CACCGAGGCC GACCTCGACA AGCTCCTGGA CATCTGCGAC
AACCTGCTCG GCCGCTCCTT CTGCGCCCTC GGCGACGGCG CGACCAGCCC GGTGACCTCC
TCGATCAAGC ACTTCCGCCA GGAGTACATC GACCACGTGG AGCGGGGCGG CTGCCCCTTC
GACCACTCCC GGGCCACCCT CTGGGGCGAC CGGCCGACCA CGACGGGAGG ACAGCAGTGA
 
Protein sequence
MTTLTPILSR DWDRPDSFTL EGYRATGGYG ALRKALAMEP DAIVDLVKKS GLRGRGGAGF 
PTGMKWGFLP ADNPNPRYLV VNADESEPGT CKDIPLMLAN PHVLVEGVAI AAYAIKSSQA
FIYVRGEVLH VIRRLRRAVA EAYEAGLLGR DVLGTGFDLD VVVHAGAGAY ICGEETALLD
SLEGYRGQPR LKPPFPAVAG LYASPTVVNN VESIASVPSI VANGAEWFTS MGTEKSAGFG
FFSLSGHVAN PGQYEAPLGV TLRELLDMSG GMRPGHRLKF WTPGGSSTPI FTEEHLDTPL
DFESVGAAGS MLGTRALQIF DETTCVVKAV GRWIAFYAHE SCGKCTPCRE GNFWMVQVLD
RLENGQGTEA DLDKLLDICD NLLGRSFCAL GDGATSPVTS SIKHFRQEYI DHVERGGCPF
DHSRATLWGD RPTTTGGQQ