Gene Rcas_3557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3557 
Symbol 
ID5541058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4641640 
End bp4643565 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content58% 
IMG OID640895676 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001433624 
Protein GI156743495 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0739643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG CCGAACGTAC CGCCGGAACG GTCAGCCCGC CCAGCGTCAG TGTGCGCCCC 
GGATGGGCCG GGTGGCTTAG CACGACCGAT CATAAGCGCA TCGGGATTAT GTATCTGGTC
AGCGCGTTCG TCTTTTTCCT GATCGGCGGA ATTGAGGCGC TGCTGATGCG CATTCAGCTT
GGCGTGCCTG ATAATACGTT CCTGACGCCC GATGTGTACA ACCAGATGTT TACCATGCAC
GGCACAACCA TGATTTTCCT TGGTCTGATG CCGCTGAATG TCGGGCTAGG CAACTATATC
GTGCCCCTCA TGATCGGTGC GCGCGATATG GCGTTCCCGC GCCTCAATGC GCTCAGCATC
TGGCTCTTCA TCTTCGGCGG ATTGATGCTG TATGTCAGTT TCTTTGTCGG CGGCGCGCCG
AACGTCGGCT GGTTCGCCTA TGCGCCCCTG ACACAGAAGC AGTTTGCACC GACCGCCGGC
GTCGATTACT GGATTATCGG CATCGGTCTG ACCGGTGTGG CGTCGATTGC CGGCGCGCTG
AACTTTATTG TAACGATTCT CAATATGCGC GCGCCAGGAA TGACGCTCAA CCGGATGCCG
CTCTTTGTCT GGATGCAGTT GGTCGTCGCC TTTATTCTGA TCTTTGCCTT CCCGGTGCTG
ACGGTGGCGA CAATTCAGTT GCTGTTCGAC CGGCATTTCG GCACGCGTTT CTTCCTTCCA
AATCTTGGCG GCGATGCAGT GCTCTGGCAG CATCTGTTCT GGTTCTTTGG GCATCCTGAG
GTCTACATTC TCATTCTCCC GACGATGGGG ATTATTTCGG AAGTGCTGCC AACCTTCTCG
CGTAAGCCGA TCTTCGGTTA TGCGTTTGTG GCGTATTCCG GTGTCGCCAT CGGTTTCCTC
GGCTTCCTGG TTTGGGCGCA CCATATGTTC GCCGTCGGTC TTGGTCCGCT GGCGAATGCG
TTCTTCGGCG CCGCAAGCTT CCTGATCGCC GTGCCAACCG GTGTGAAGAT TTTCAACTGG
CTGGCGACAA TGTGGCAGGG GTCGCTCAAC CTGACGACCT CGATGCTCTA TGCAATCGGG
TTCATTAGTA TGTTCATTAT CGGCGGCATT AGCGGCATCA CCCTCGCCTC ACCGCCGATT
AATGTTCAAC AGACCGACTC GTACTACGTT GTTGCCCATA TGCACTACGT GTTGTTCGGC
GGCGCGATCC AGGGCATCTT TGCTGGCGTC TTTTACTGGT TCCCAAAGAT GACCGGTCGG
ATGCTCAATG AACGACTTGG CAAATGGCAG TTCTGGTTGA TGCTGATCAG TTTCAACCTG
ACGTTCTTCC CGATGCACCT GAGCGGGAAT GAGGGCATGC CGCGCCGTAT CTACACTTAC
GAGGCGGAGA TGGGGTGGAA CCTCTGGAAC CTGATCTCGA CCATTGGCGC GTTGCTACTG
GCGGTTGCGT TCCTGCTCTT CCTCTGGAAT GTGCTCACGA GTATCCGGCG TGGTCCAATT
GCGCCTGCCG ATCCGTGGGA CGGGGCGACG CTCGAGTGGG CGGTCAGTTC GCCGCCGCCG
GTCTATAACT TCGCCGTGAC GCCACGGGTG CGCAGCCGCC GCCCGCTGTG GGACAAGAAG
TATCCGCACC TGCACGAGGC GAATGGGTCG CATGGGGCGC CTGCCGGTCG ACTGGCAGGA
CCCCTTGATG ACGGCTTTGA GCCTGAAACG TCTGCGGCGA TGGAGCCGAT CCATCTGCCC
TCACCCACCT ATGCGCCACT CATTGTGTCG GTTGGCATCA TGATCCTGGG CTTCGGCATC
ATTTATCTGG GTGATTTCGG TCTCATTGCA GCATCGGCAA TGCTTGTCGG CTTGCTCATC
ATGGCGACAG GCATTCTCAG TTGGGTGCGC ATCTCGCACG TTGACTCGCC GTATCAGGCG
CACTAG
 
Protein sequence
MAIAERTAGT VSPPSVSVRP GWAGWLSTTD HKRIGIMYLV SAFVFFLIGG IEALLMRIQL 
GVPDNTFLTP DVYNQMFTMH GTTMIFLGLM PLNVGLGNYI VPLMIGARDM AFPRLNALSI
WLFIFGGLML YVSFFVGGAP NVGWFAYAPL TQKQFAPTAG VDYWIIGIGL TGVASIAGAL
NFIVTILNMR APGMTLNRMP LFVWMQLVVA FILIFAFPVL TVATIQLLFD RHFGTRFFLP
NLGGDAVLWQ HLFWFFGHPE VYILILPTMG IISEVLPTFS RKPIFGYAFV AYSGVAIGFL
GFLVWAHHMF AVGLGPLANA FFGAASFLIA VPTGVKIFNW LATMWQGSLN LTTSMLYAIG
FISMFIIGGI SGITLASPPI NVQQTDSYYV VAHMHYVLFG GAIQGIFAGV FYWFPKMTGR
MLNERLGKWQ FWLMLISFNL TFFPMHLSGN EGMPRRIYTY EAEMGWNLWN LISTIGALLL
AVAFLLFLWN VLTSIRRGPI APADPWDGAT LEWAVSSPPP VYNFAVTPRV RSRRPLWDKK
YPHLHEANGS HGAPAGRLAG PLDDGFEPET SAAMEPIHLP SPTYAPLIVS VGIMILGFGI
IYLGDFGLIA ASAMLVGLLI MATGILSWVR ISHVDSPYQA H