Gene Dshi_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3892 
Symbol 
ID5714421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp116359 
End bp117396 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID641276805 
Productcytochrome c oxidase subunit II 
Protein accessionYP_001542101 
Protein GI159046430 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.740774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.178097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG ATCTCGACAT TCTTGGCGGG CGCAGTGTGC GGGACATCGT GGGCGAGGCC 
CCGGATCTCG ACACGTCAGC CGAAGATGCT GCGCTCGAAA GCTTGGGGGA TCGCAGCGCC
CTCGACATCT GGAATTCGCA ATCCGCGCTG GAACCAAGCG GGCTGGGTGC CGCCGCCGCC
TATGACCTGA CAATCGGCAT GGTCGTGGGC CTCGGGGCGG TGTTCGTGGT CGTGATGGCG
ATCGCCTGGT TCGCATGGCA CAGCAAACGC CCAGCAGGCC ATTGGTGGGT GTGGACCGGC
GGCGTGATCA CGCCGCTCAT AGCAATCTCG ACCGTGATGG TGGCGTCGAC CGCTGCCCTT
GTGGCGACGA CGCGGCCCGC ACCCGACGCA CTGGTGATCG AGGTGACCGG CTATCAGTTC
TGGTGGGATG TGGTCTACGA TCCGGACGGG ACACCGTTGC GGGACGCCAA TGAATTGATC
CTGCCCGAGG GTCGCCCGGT CACCCTGCGT CTGAACTCCA ACGATGTGAT CCATTCCTTC
TGGGTGCCCT CGATTTCGGG CAAGATGGAC ATGATCCCCG GACGCACCAA CACTCTGACG
ATAACCGCGA CCGAAACCGG CCAGTTCCGC GGCCAATGCG CCGAGTTCTG CGGGTTGTCC
CACCCGAAAA TGGCATTCGA GGTAACGGTC CTGCCCCCCG AGGCCTTCGA CAAGTGGCTT
GCCACCACGC GCGGCGCGGC GCGCGACGTG GCCCGACCCG CGCAAGCCGA GGGACGCGAG
GTTTTCCTGA GCGCCGGCTG TGCCGCCTGT CACGAAATCC GCGGGGTCGC AGAAGGTGGG
CGGCTGGGCC CCGACCTGAC CCGTCTGGGC GCGCGCGCCA GCCTCGGCGC GGGCATGTGG
CGCATGAACC AGGGCAACGT CGCAGGCTGG ATCGCCGATG TGCAGGACAT GAAGCCCGGC
GCGCAAATGC CCTCCTACAA CCACCTCAGC GGTCCGGATC TGCGCAACCT GTCCGCTTAC
CTCGTGAGCC TGCAATGA
 
Protein sequence
MDEDLDILGG RSVRDIVGEA PDLDTSAEDA ALESLGDRSA LDIWNSQSAL EPSGLGAAAA 
YDLTIGMVVG LGAVFVVVMA IAWFAWHSKR PAGHWWVWTG GVITPLIAIS TVMVASTAAL
VATTRPAPDA LVIEVTGYQF WWDVVYDPDG TPLRDANELI LPEGRPVTLR LNSNDVIHSF
WVPSISGKMD MIPGRTNTLT ITATETGQFR GQCAEFCGLS HPKMAFEVTV LPPEAFDKWL
ATTRGAARDV ARPAQAEGRE VFLSAGCAAC HEIRGVAEGG RLGPDLTRLG ARASLGAGMW
RMNQGNVAGW IADVQDMKPG AQMPSYNHLS GPDLRNLSAY LVSLQ