Gene Ndas_4894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4894 
Symbol 
ID9248781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp23926 
End bp26886 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content72% 
IMG OID 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_003682783 
Protein GI297563810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.155841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAACAG TGCTGATCGC GCATTTCATC GCAGCTGCGA TCGCGCCGTT ACTGGTGCGC 
CAGTGGGGCC GTAACGCCTT CCTCGCGCTC GCCGTCGTCC CGGGTGTGTC CACCCTCTGG
GCCCTCTTCC AGATCCCCGC GATCATGCGC GGCGAGACCC TCGCCGTCTC CATCCCCTGG
GCACCGGAGT TCTCCCTCCG GCTCGGCCTG TACATGGACG CCCTGGGCCT GGTGATGACG
CTCATCGCCG CGGGCGTGGG CGCGCTGATC CTGATCTACT GCGCCCGCTA CTTCGACGAC
TCCGAACCCG GTCTGGCCCG CTTCGCCGGG GTGTTCGTCG CCTTCGCCGG GGCCATGCTG
GGCCTGGTGC TGGCCGACGA CCTCATCCAG CTCTTCGTCT ACTGGGAGCT GACCACGGTC
TTCTCCTACC TGCTCATCGG GCACAGCACG GAGCTGAAGG AGAGCCGCCG GGCGGCGATG
ACGGCCCTGA CCGTCACCAC CTTCGGCGGA CTGGCGATGC TCGTGGGCAT GATCATGCTC
GGCGAGACCG CCGGGACCTA CGTGATCTCC GAGATCGTCG CCTCGCCGCC GGCCGGCGCC
CTGGTCAACG TCGCGCTCGT GCTCATCATG GTCGGCGCGA TGTCCAAGTC GGCCCTGCTG
CCGTTCAGCC TGTGGCTCCC CGCCGCCATG CGCGCCCCCA CGCCGGTCTC CGGCTACCTG
CACGCCGCGG CGATGGTCAA GGCGGGCGTC TACCTGGTGG CCCGGCTCAC CCCCGCCTTC
CACGACGTCG CCGTGTGGAA GTACACGGCC CTGTTCTTCG GCACGCTCAC CATGGTCCTC
GCGGGCTGGA AGGCGCTGCG CCAGTACGAC CTCAAGCTGG TCCTGGCCTA CGGCACCATC
AGCCAGCTGG GCTTCCTCAT CACCCTGCTG GGCGCCGGAA CCCAGGCCGG CGCCCTGGCC
GGGATCTCGA TGCTCATCGC GCACTCGCTG TTCAAGGCGC CGCTGTTCCT CGTGGTCGGC
GTCATCGACC ACAGCACCGG GACCCGCGAC CTGCGCGAGC TGTCCGGCCT GCGCTCCTCC
ATGCCCGTGA CCTTCTGGAC CTCGGTCGTG GCTCTGGCCT CCATGGCGGG TCTGCCGCCC
ACCCTCGGGT TCGCCGCCAA GGAGCTGGCC TTCGGCGCCT TCGAGCACGG CGGAGGCGCC
GACATCGCCG TCCTGGCGGG CATCGTGATC GGCTCCACCT TCACCGTCGC CTACAGCCTG
CGCTTCCTGT GGGGCGCCTT CTGGGACAAG GAGAACGTCC CGGCCACCCC GCTGCACGCC
CCCGGCCCGC TCCTCCAGGC GCCCGCCGCG GTCCTGGCCG GGCTCGGCCT GCTCGGCGGC
CTGCTCTCCA CCCTCGTGGA CCCCTTCCTG GCCGTGTACG CCGACACCGT CCCCGTCGCC
GAGGGCGACT ACGCCAAGCA CCTGGCCCTG TGGCACGGCC TGGGACTGCC CCTGCTGCTG
TCGGCGGTGT GCGTCGCGGG CGGCCTGGCG CTGTTCCGGT TCCGCCACGA GATCGTGTGG
GCGGGAACGC GGACCGCGCT GCCCGACGCC GACCGGGTCT ACCGCCGGAT GCTGGCCGCG
CTGGAGAACC TGGCCCTCCA GGTCACCGGC GGCACCCAGC GCGGTTCGCT GCCGGTCTAC
CTCGGCACCA TCCTGGTCAC GCTGGTCGTC GCCGCCGGGT GGTGGATGGT CAGCGGCCGG
ATCTGGGAGG GGGACTACCC CGCGGTGCGC CTGTGGGACA GCCCGGTCCA GATCATCCCG
GTCCTGATCA TCGGCGCGTC CGCCCTGCTG ACGCTGTTCG TGCGCCGCCG CCTGTTCGCG
GTGATCCTCG TGGGCACCGG CGGCTACGGC GTCGCCGCCC TGTTCTACCT GATGGGCGCC
CCCGACATCG CGCTCACCCA GTTCCTCGTG GAGACCGTCA GCCTGGTGGT CTTCGTGCTG
GTGCTGCGCC GCTTCCCGGC CCGCTTCTCG GCGCCCGCGC TGCGCGGGCG CCGGGTGTGG
AACCTGGCCC TGGGCGCGGC CACCGGAGTC CTGGTGGCCG CGATGACCTG GTTCGCCCTC
GCGGGCCGCC AGGAGCCCTC CATCTCGGCG GGCTACCCGG CCGCGGCCGA GGAGGCGGGC
GGCTACAACA TCATCTCGGT CCTGCTGGTC GACGTCCGGG CCTGGGACAC CATGGGCGAG
ATCTCGGTGC TCGCCGCCGC GGCCGCCGGC GTGGCCAGCC TGATGTTCGT GCGCCGCCGC
GCCCAGCCGC GCAAGACGCC GGGCGGGGTG ATGGCCCTGT CGGTCACCGA ACCGCCGCCG
CCCGCCGACG GCCAGCGCGG ACTCGACCTC GGCGGCCTGC GGTTCGCCCC CCGCAGGATG
CACGTCGAGC CCCGGTGGGC GCGCACCTGG CTGCCGGGCG CCGACGCCCT GCCCACCGAG
CGCCGGTCGG TCATCTTCGA GGTCGTCTCG CGCTTCCTGT TCCCCGTGAT CATGGTGATC
TCGGTGTACC TGCTGCTCAC CGGCCACACC GCCATCGGCG GGGGCTTCGC GGGCGGCATC
GTCGCGGGCC TGGCGTTCAT CGTCCGCTAC CTGGCCGGAG GCCGGTTCGA GCTGTACGCC
ACCGCGTGGG TGCAGCCGGG CGCGCTGATC GGCGCGGGCC TGGCCGTGGC CACCGGCACC
GCGCTGGGCG GGGCCGTCTT CGGCACCGAC GTCCTCGCGG GAGGCGACAC CTACATCGAC
TTCTGGATCC TGGGCGAGGC CCACGTCACG GTGTCCATGC TCTTCGACAT CGGCGTGTAC
CTGCTGGTGA TCGGGCTCAT CCTCGACATC CTGCGCAGCC TCGGCGCCCG CATCGACGAG
CAGATCGAGC GGGACGCCGC GCACTCGGAG GCCCAGGCCA GGACCGGGGG ACCCGAACGC
CCCGAGGAGG TCATCTCGTG A
 
Protein sequence
MTTVLIAHFI AAAIAPLLVR QWGRNAFLAL AVVPGVSTLW ALFQIPAIMR GETLAVSIPW 
APEFSLRLGL YMDALGLVMT LIAAGVGALI LIYCARYFDD SEPGLARFAG VFVAFAGAML
GLVLADDLIQ LFVYWELTTV FSYLLIGHST ELKESRRAAM TALTVTTFGG LAMLVGMIML
GETAGTYVIS EIVASPPAGA LVNVALVLIM VGAMSKSALL PFSLWLPAAM RAPTPVSGYL
HAAAMVKAGV YLVARLTPAF HDVAVWKYTA LFFGTLTMVL AGWKALRQYD LKLVLAYGTI
SQLGFLITLL GAGTQAGALA GISMLIAHSL FKAPLFLVVG VIDHSTGTRD LRELSGLRSS
MPVTFWTSVV ALASMAGLPP TLGFAAKELA FGAFEHGGGA DIAVLAGIVI GSTFTVAYSL
RFLWGAFWDK ENVPATPLHA PGPLLQAPAA VLAGLGLLGG LLSTLVDPFL AVYADTVPVA
EGDYAKHLAL WHGLGLPLLL SAVCVAGGLA LFRFRHEIVW AGTRTALPDA DRVYRRMLAA
LENLALQVTG GTQRGSLPVY LGTILVTLVV AAGWWMVSGR IWEGDYPAVR LWDSPVQIIP
VLIIGASALL TLFVRRRLFA VILVGTGGYG VAALFYLMGA PDIALTQFLV ETVSLVVFVL
VLRRFPARFS APALRGRRVW NLALGAATGV LVAAMTWFAL AGRQEPSISA GYPAAAEEAG
GYNIISVLLV DVRAWDTMGE ISVLAAAAAG VASLMFVRRR AQPRKTPGGV MALSVTEPPP
PADGQRGLDL GGLRFAPRRM HVEPRWARTW LPGADALPTE RRSVIFEVVS RFLFPVIMVI
SVYLLLTGHT AIGGGFAGGI VAGLAFIVRY LAGGRFELYA TAWVQPGALI GAGLAVATGT
ALGGAVFGTD VLAGGDTYID FWILGEAHVT VSMLFDIGVY LLVIGLILDI LRSLGARIDE
QIERDAAHSE AQARTGGPER PEEVIS