Gene Ndas_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1931 
Symbol 
ID9245781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2352452 
End bp2354134 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content65% 
IMG OID 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_003679864 
Protein GI297560890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.768884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.452055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACCAG CGAAAACCGA GGCAGAGGAG TCCTCCTCCG TGAAGGCCCC CAAGGGGTCG 
ATCATCGTGA GCTGGCTGAC CTCGACCGAC CACAAGGTCA TCGGGTACAT GTACATCATC
ACCGCCTTCG CCTTCTTCGT CTTCGGCGGA ATCCTGGCGG TGCTCATCCG CGCCGAACTG
TTCTTTCCGG GCATGCAGAT CATGTCCAAC GAGGAGTACA ACCAGCTGTT CACCATGCAC
GGCACGATCA TGCTGCTGCT CTTCGCGACC CCGCTGTTCG TCGGGTTCGG CAACGTGATC
ATGCCGTTGC AGATCGGCGC GCCCGACGTG GCGTTCCCGA GGATGAACCT GTTCGGCTAC
TACCTGTTCC TGTTCGGCGG GCTGATCGTC TCGGCCGGGT TCCTGACCCC GGGCGGCGCG
GCCAGCTTCG GCTGGTTCGC CTACACGCCC CTGTCGAACG AGGTCCACTC ACCGGGGGTG
GGGGGCAACC TGTGGATCAT GGGGTTGGTG GTGTCGGGTC TGGGCACGAT CCTGGGCGCG
GTCAACTTCA TCACCACCGC CCTGTGCATG CGCGCGCCGG GCATGACGAT GTTCCGCATG
CCGATCTTCA CCTGGAACAT CATCCTGACC AGCGTGCTGG TGCTCATCGC GTTCCCCGTG
CTGACGGCGG CCCTGATCGC GCTGGGCGCG GACCGCATCG TCGGCACGCA GGTGTTCAAC
GCCGAGCACG GCGGGGCCAT CCTGTGGCAG CACCTGTTCT GGTTCTTCGG CCACCCCGAG
GTGTACATCA TCGCGCTGCC GTTCTTCGGC ATCGTGACCG AGATCCTGCC GGTGTTCAGC
CGCAAGCCGA TCTTCGGCTA CAAGGGGCTG GTGGCGGCGA CCATCGCCAT CACGGGCCTG
TCGGTGACGG TGTGGGCGCA CCACATGTTC CCGACGGGTG CGGTGCTTCT GCCGTTCTTC
TCGTTCATGA GCTTCCTCAT CGCGGTGCCG ACCGGGGTGA AGTTCTTCAA CTGGATCGGT
ACGATGTGGC GGGGCCAGAT CAGCTTCGAG ACGCCGATGC TGTTCTCGAT CGGGTTCCTA
GTGACCTTCC TGTTCGGCGG TCTGACCGGT GTGCTGCTGG CCTCCCCGCC GATCGACTTC
CACGTCACCG ACTCCTACTT CGTGGTGGCC CACTTCCACT ACGTGGTGTT CGGCACGGTG
GTGTTCGCGA TGTTCGCGGG CTTCTACTTC TGGTGGCCCA AGTTCACCGG GACGATGCTC
AACGAGAAGT TGGGCAAGTT CCACTTCTGG CTGCTGTTCC TGGGCTTCCA CGGCACGTTC
CTGGTGCAGC ACTGGCTGGG CGCCGCCGGC TTCCCGCGCC GCTACGCCGA CTACCTGCCC
GGTGACGGCT TCACCGAGCT CAACCAGATC TCCTCGGTCT CCTCGTTCGT GCTGGCGGCC
TCGACGCTGA TCTTCTTCTG GAACGTCTTC GTGACCGCGC GCAACGCTCC CCAGGTGGGG
ATGGACGACC CGTGGGGCTA CGGCTGCTCG CTGGAGTGGG CCACGTCCTG CCCGCCGCCG
CGGCACAACT TCACGTCGCT GCCGCGGATC CGTTCCGAGC GTCCCGCGTT CGACCTGAAC
CACCCGCATG TGGCGTCGCG GGCCCTGGAC TCCGGGCGCG AGGAGAGCGC CCCGAGGAGC
TGA
 
Protein sequence
MAPAKTEAEE SSSVKAPKGS IIVSWLTSTD HKVIGYMYII TAFAFFVFGG ILAVLIRAEL 
FFPGMQIMSN EEYNQLFTMH GTIMLLLFAT PLFVGFGNVI MPLQIGAPDV AFPRMNLFGY
YLFLFGGLIV SAGFLTPGGA ASFGWFAYTP LSNEVHSPGV GGNLWIMGLV VSGLGTILGA
VNFITTALCM RAPGMTMFRM PIFTWNIILT SVLVLIAFPV LTAALIALGA DRIVGTQVFN
AEHGGAILWQ HLFWFFGHPE VYIIALPFFG IVTEILPVFS RKPIFGYKGL VAATIAITGL
SVTVWAHHMF PTGAVLLPFF SFMSFLIAVP TGVKFFNWIG TMWRGQISFE TPMLFSIGFL
VTFLFGGLTG VLLASPPIDF HVTDSYFVVA HFHYVVFGTV VFAMFAGFYF WWPKFTGTML
NEKLGKFHFW LLFLGFHGTF LVQHWLGAAG FPRRYADYLP GDGFTELNQI SSVSSFVLAA
STLIFFWNVF VTARNAPQVG MDDPWGYGCS LEWATSCPPP RHNFTSLPRI RSERPAFDLN
HPHVASRALD SGREESAPRS