Gene Ndas_3150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3150 
Symbol 
ID9247006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3768817 
End bp3769839 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content69% 
IMG OID 
Productcytochrome c oxidase, subunit II 
Protein accessionYP_003681065 
Protein GI297562091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCGA CCCGCCAAGA GAATCGGCGC CGCAACCTGC GCCGATGGGC ACCACGCGGC 
GCCGCGCTCG CCGTGCTGGG GCTGGCCGCG ACCGGCTGCG CGTCGAACGA TCTCACTCGT
TTGGGCATGC CGGAGCCGAT CACCAACCAG GCCGAGCGTG TTCTCTCGCT CTGGCAGGGC
TCCTGGGTGG CGGCTTTCGC GGTCGGCATT CTCGTGTGGG GGCTGATCGT CTGGTCGGTC
ATCTTCCACC GCAAGCGCTC TGAGCAGTTG CCGCCGCAGG TGCGGTACAA CATGCCCATC
GAAGCGCTCT ACACCGTGCT GCCGATCGTC ATCATCTCGG TGCTGTTCTT CTTCACCGCC
CGGGACCAGG CGATCCTGCT CGACACCGAC GAGCCGGCGG ACGTCAACAT CGAGGTCGTG
GCCTTCCAAT GGGCCTGGCA GTTCAACTAC CTCGACGACA AGAAGGAGAA CGGCGGGGAG
GTGCTCTTCT CCGAGACGGG TATCCCCAAC CCGGACGGCA CCGCCGACCC CTCCACCCAG
ACGACCCTGG TGCTGCCCGA GGGCGCGACC GTCCACTTCG ACCTGCACTC GCCGGACGTC
ATCCACTCGT TCTGGATCCC CGAGTTCGGT TTCAAGATGG ACGTCATCCC CGGTCGGGAC
AACGCCTTCC AGGCCGACAT CAACGAGGGC ACCGCGGGCG AGTACGTCGG CCGCTGCGCC
GAGCTGTGCG GTGTGGACCA CGCCCGCATG CTCTTCAACG TCCAGGTCCT GCCCCAGGAC
GAGTACGACG CCTGGGCCGC CGAGCAGCAG CAGGCCGCCG AGGAGGCCGA GCTGGAGGCC
GCCGACACCG GCGACACGCC GGACTCCGGT GAGGGCTCCG GTGCCGAGGG TGCGGGCGCC
GAGGGCGCCG GCACCGACGA GGAGGGCACC GGCAGCGGCG GTTCCGAGGC CGAGGGCTCC
GGTACCGACG AGCAGGACAC CGGTTCCGGC GCGTCCGACG CCGAGGAGAA TGAGCAGTCA
TGA
 
Protein sequence
MSPTRQENRR RNLRRWAPRG AALAVLGLAA TGCASNDLTR LGMPEPITNQ AERVLSLWQG 
SWVAAFAVGI LVWGLIVWSV IFHRKRSEQL PPQVRYNMPI EALYTVLPIV IISVLFFFTA
RDQAILLDTD EPADVNIEVV AFQWAWQFNY LDDKKENGGE VLFSETGIPN PDGTADPSTQ
TTLVLPEGAT VHFDLHSPDV IHSFWIPEFG FKMDVIPGRD NAFQADINEG TAGEYVGRCA
ELCGVDHARM LFNVQVLPQD EYDAWAAEQQ QAAEEAELEA ADTGDTPDSG EGSGAEGAGA
EGAGTDEEGT GSGGSEAEGS GTDEQDTGSG ASDAEENEQS