Gene TM1040_2291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2291 
Symbol 
ID4078475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2409675 
End bp2411333 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content59% 
IMG OID638007613 
Productcytochrome-c oxidase 
Protein accessionYP_614285 
Protein GI99082131 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.40554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACG CAGCCATTCA AGGTCACGGC CACGAAGACG AGCGGAGCTT TTTCACCCGC 
TGGTTTATGA GCACGAACCA CAAGGATATC GGTATCCTTT ACCTGATCGT TTCGGCGCTC
ACCGGCTTCA TTTCCGTCGC ATTCACCGTC TACATGCGAC TCGAACTTAT GGATCCCGGT
GTGCAGTACA TGTGTCTGGA AGGCTTTGCC GCAGACCCCT GTACGCCGAA TGGCCATCTC
TGGAACGTGC TGATTACAGG CCACGGCGTC TTGATGATGT TCTTCGTCGT CATCCCGGCC
CTGTTCGGCG GGTTTGGCAA CTATTTCATG CCGCTGCAGA TCGGCGCGCC GGATATGGCG
TTCCCGCGGA TGAACAACCT CAGCTTCTGG ATGTATGTTG CAGGCACCGC GCTGGCGGTC
TCCTCGGTCT ATGCGCCGGG CGGCAACAAC CAGCTGGGCG CTGGCGTGGG TTGGGTTCTC
TATCCGCCGC TCTCCGTCAA GGAAGGCGGG ATCGCGATGG ATCTGGCGAT TTTCGCCGTG
CACGTCTCCG GTGCCTCCTC GATCCTGGGC GCGATCAACA TGATCACCAC CTTCCTGAAC
ATGCGTGCCC CCGGCATGAC CCTGTTCAAG GTGCCGCTCT TCAGCTGGTC GATCTTCGTC
ACCTCCTGGC TGATCCTTCT GTCCCTGCCG GTTCTGGCAG GCGCAATCAC CATGCTGCTG
ATGGATCGTA ACTTCGGCTT CACCTTCTTT GATGCCGCAG GCGGCGGGGA CCCGGTTCTC
TACCAGCACA TCCTGTGGTT CTTCGGCCAC CCCGAAGTGT ACATCGTGAT CCTGCCGGGC
TTTGGCATCA TCTCCCACGT GATCGCGACC TTCTCGCGCA AGCCTGTGTT TGGCTACCTG
CCGATGGTCT GGGCGATCAT CGCGATTGGT GTTCTGGGCT TCGTCGTTTG GGCGCACCAC
ATGTACACCG TCGGCATGAG CCTCAACCAG CAGGCCTACT TCATGCTGGC CACCATGGTG
ATCGCCGTGC CCACAGGGGT GAAGGTCTTC TCGTGGATCG CAACCATGTG GGGCGGCTCC
ATCGAGTTCA AAGCCCCCAT GGTCTTTGCT TTCGGCTTCC TGTTCCTGTT CACCGTTGGC
GGTGTGACCG GCGTGGTGCT GTCGCAGGCG GCCGTGGACC GGGCCTATCA TGACACCTAT
TACGTGGTGG CACACTTCCA CTACGTGATG AGCCTTGGTG CGGTGTTTGC GATCTTCTCC
GGCATCTACT TCTACTTTGG TAAGATGACC GGCCGTCAGT ACTCCGAACT GGGCGCACAG
ATTCACTTCT GGATGTTCTT CATCGGTGCA AACCTGACGT TCTTCCCGCA GCACTTCCTG
GGCCGTCAGG GCATGCCGCG TCGCTACATC GACTATCCCG AAGGCTTTGC ATACTGGAAC
AAGATCTCGT CCTATGGCGC GTTCCTGTCC TTTGCCTCCT TCATCTTCTT CTTCGGTGTG
GTGATCTATT CACTGCTGCG TGGCGCGCGT GTGACCCAGA ACAACTACTG GAACGAATAC
GCCGACACGC TGGAGTGGAC CCTGCCCTCT CCGCCGCCGG AGCACACCTT TGAAATCCTG
CCCAAGCAGG AAGACTGGGA CAAAAGCCAC AGCCACTAA
 
Protein sequence
MADAAIQGHG HEDERSFFTR WFMSTNHKDI GILYLIVSAL TGFISVAFTV YMRLELMDPG 
VQYMCLEGFA ADPCTPNGHL WNVLITGHGV LMMFFVVIPA LFGGFGNYFM PLQIGAPDMA
FPRMNNLSFW MYVAGTALAV SSVYAPGGNN QLGAGVGWVL YPPLSVKEGG IAMDLAIFAV
HVSGASSILG AINMITTFLN MRAPGMTLFK VPLFSWSIFV TSWLILLSLP VLAGAITMLL
MDRNFGFTFF DAAGGGDPVL YQHILWFFGH PEVYIVILPG FGIISHVIAT FSRKPVFGYL
PMVWAIIAIG VLGFVVWAHH MYTVGMSLNQ QAYFMLATMV IAVPTGVKVF SWIATMWGGS
IEFKAPMVFA FGFLFLFTVG GVTGVVLSQA AVDRAYHDTY YVVAHFHYVM SLGAVFAIFS
GIYFYFGKMT GRQYSELGAQ IHFWMFFIGA NLTFFPQHFL GRQGMPRRYI DYPEGFAYWN
KISSYGAFLS FASFIFFFGV VIYSLLRGAR VTQNNYWNEY ADTLEWTLPS PPPEHTFEIL
PKQEDWDKSH SH