Gene Ndas_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3149 
Symbol 
ID9247005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3767144 
End bp3768820 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content65% 
IMG OID 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003681064 
Protein GI297562090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA CCACGTCAAG CCCCACTCTC GCGGGCGGTC AGGCACCGTC GCGCAAGGGG 
TCGATGATCG TCAAATGGAT GACGTCCACC GACCACAAGG TCATTGGGTA CATGTACCTG
ATCACCTCCT TCGTGTTCTT CCTCTTCGGT GGTCTGCTCG CGGTGCTCAT GCGCATCGAG
CTGTTCTTCC CGGGCATGCA GGCCATGTCC AACGAGCAGT TCAACCAGCT GTTCACCATG
CACGGCACGA TCATGCTGCT GATGTTCGCG ACCCCGCTGT TCGTCGGGTT CTCGAACGTG
ATCATGCCGT TGCAGATCGG TTCGCCCGAC GTGGCGTTCC CGAGGATGAA CCTGTTCAGC
TACTACCTGT TCCTGTTCGG CAGCCTCATC GCCATCAGCG GGTTCCTGAC CCCGGGCGGC
GCGGCCAGCT TCGGCTGGTT CGCCTACACG CCGCTGTCGG ACGCGGTGCG TTCGCCGGGT
CTGGGCGGTG ACCTGTGGAT CCTGGGCCTG GTGGTCTCGG GTCTGGGCAC CATCCTGGGC
GCGGTCAACT TCATCACGAC CGGCCTGTGC ATGCGCGCGC CCGGCATGAC GATGTTCCGC
ATGCCGATCT TCACCTGGAA CACCCTGCTC ACCAGCGTGC TGGTGCTCAT CGCGTTCCCG
GTCCTGACCG CGGCCCTGAT CGCGCTGGGC GCGGACCGCA TCGTCGGCAC CCAGGTGTTC
AACGCCGAGC ACGGCGGGGC CATCCTGTGG CAGCACCTGT TCTGGTTCTT CGGCCACCCC
GAGGTGTACA TCATCGCGCT GCCGTTCTTC GGCATCGTGA CCGAGATCAT CCCGGTGTTC
AGCCGCAAGC CGATCTTCGG CTACAAGAGC CTGGTCGCGG CGACCATCGC CATCACCGGC
CTGTCGGTCA CCGTGTGGGC CCACCACATG TTCCCGACCG GCGCGGTCCT GCTGCCGTTC
TTCTCGTTCA TGAGCTTCCT CATCGCGGTC CCGACCGGCG TGAAGTTCTT CAACTGGATC
GGCACCATGT GGCGGGGCCA GATCACCTTC GAGACGCCGA TGCTGTTCGT CATCGGCTTC
CTGGTGACCT TCCTGTTCGG TGGTCTGACC GGTGTGCTGC TGGCCTCCCC GCCGATCGAC
TTCCACGTCA CCGACTCCTA CTTCGTGGTG GCCCACTTCC ACTACGTGGT GTTCGGCACC
GTGGTGTTCG CGATGTTCGC GGGCTTCTAC TTCTGGTGGC CCAAGTTCAC CGGCAAGATG
CTCAACGAGA AGCTGGGCAA GTTCCACTTC TGGCTGCTGT TCCTGGGCTT CCACGGCACG
TTCCTGGTGC AGCACTGGCT GGGCGCCGCC GGCTTCCCGC GCCGCTACGC CGACTACCTG
CCCAGTGACG GCTTCACCGA GCTCAACCAG ATCTCCTCGG TCTCCTCGTT CGTGCTGGCG
GCCTCGACGC TGATCTTCTT CTGGAACATG TACATCACCT CCAAGAAGGC GCCGCTGGTC
ACCGTGGACG ACCCGTGGGG TTACGGCTGC TCGCTGGAGT GGGCCACGTC CTGCCCGCCG
CCGCGGCACA ACTTCACGTC GCTGCCGCGG ATCCGCTCCG AGCGTCCCGC GTTCGACCTG
AACCACCCGC ACGCCGCCGC CCCGGGCGCC GTCCCGGTGG GCGCCACGAA GGAGTAG
 
Protein sequence
MSTTTSSPTL AGGQAPSRKG SMIVKWMTST DHKVIGYMYL ITSFVFFLFG GLLAVLMRIE 
LFFPGMQAMS NEQFNQLFTM HGTIMLLMFA TPLFVGFSNV IMPLQIGSPD VAFPRMNLFS
YYLFLFGSLI AISGFLTPGG AASFGWFAYT PLSDAVRSPG LGGDLWILGL VVSGLGTILG
AVNFITTGLC MRAPGMTMFR MPIFTWNTLL TSVLVLIAFP VLTAALIALG ADRIVGTQVF
NAEHGGAILW QHLFWFFGHP EVYIIALPFF GIVTEIIPVF SRKPIFGYKS LVAATIAITG
LSVTVWAHHM FPTGAVLLPF FSFMSFLIAV PTGVKFFNWI GTMWRGQITF ETPMLFVIGF
LVTFLFGGLT GVLLASPPID FHVTDSYFVV AHFHYVVFGT VVFAMFAGFY FWWPKFTGKM
LNEKLGKFHF WLLFLGFHGT FLVQHWLGAA GFPRRYADYL PSDGFTELNQ ISSVSSFVLA
ASTLIFFWNM YITSKKAPLV TVDDPWGYGC SLEWATSCPP PRHNFTSLPR IRSERPAFDL
NHPHAAAPGA VPVGATKE