Gene TM1040_2545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2545 
Symbol 
ID4076676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2685879 
End bp2687483 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content58% 
IMG OID638007869 
Productcbb3-type cytochrome c oxidase subunit I 
Protein accessionYP_614539 
Protein GI99082385 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3278] Cbb3-type cytochrome oxidase, subunit 1 
TIGRFAM ID[TIGR00780] cytochrome c oxidase, cbb3-type, subunit I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.199988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.933965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATT ATCTCAAGCT GATAGTGCTT GGCGTATTGA CGGTCTTCTT CATGATCGCC 
GCCGGCTACG CCAGGGATCT GGCGTATCAG GTACATGCGG TGATCTTGAT GCTCGTCGCA
GGTGGCCTGT TCTTGTGGAC ACTGCGCGAG ACCGACGAGC CGGTTCCGGC TCCCGTCACC
GGGGAATATC TCGATGGCGT GGTGCGCGCA GGTGTCATTG CAACCGCATT CTGGGGTGTC
GCGGGGTTCC TTGTGGGGAC CTTCATTGCG TTCCAGCTGG CTTTTCCGGT TCTGAATTTT
GATTGGTCCG AAGGGTTTGC GAATTTTGGC CGCCTGCGTC CGCTGCACAC ATCTGCAGTG
ATTTTTGCGT TTGGCGGTAA CGCGTTGATC GCGACGTCCT TTTACATCGT GCAGCGCACC
TCGGCGGCAC GACTCTGGGG CGGCAACCTC GCGTGGTTTG TCTTCTGGGG CTATCAGCTG
TTTATCGTGC TTGCGGCAAC CGGCTATCTC TTGGGCGGAA CGCAGTCCAA GGAATACGCT
GAGCCCGAGT GGTACATCGA CCTGTGGCTG ACCGTTGTCT GGGTGGCCTA TCTGGCGGTG
TTCCTCGGTA CCATCATTCG CCGCAAAGAG CGCCACATCT ATGTGGCCAA CTGGTTTTTC
CTCGCCTTCA TCGTGACCGT CGCGATGCTG CACCTCGTCA ACAACCTGAC CATTCCTGTT
TCGATCTGGG GCTCCAAGTC CGTGATTGTC TGGCCTGGTG TACAGGATGC CATGGTGCAG
TGGTGGTATG GCCACAACGC GGTGGGCTTC TTCCTGACCG CAGGCTTCCT CGGCATGATG
TATTATTTCA TTCCCAAGCA GGCAGATCGT CCGGTTTACA GCTATAAACT GTCGATCATC
CACTTTTGGG CGCTGATCTT CATCTACATC TGGGCCGGCC CACACCACCT GCATTATACC
GCGCTGCCGG ACTGGGCCTC GACGCTGGGC ATGGTGTTCT CGATCGTGCT GTGGATGCCC
TCCTGGGGTG GCATGATCAA CGGTCTGATG ACGCTCTCGG GCGCTTGGGA CAAGCTGCGC
ACCGATCCGG TCCTGCGGAT GCTGGTGATC TCGGTTGGCT TCTACGGCAT GTCCACATTT
GAGGGTCCGA TGATGTCGAT CCGTGCGGTG AACTCGCTGA GCCACTACAC CGACTGGACC
ATTGGTCACG TGCACTCTGG TGCGCTGGGC TGGAATGGTA TGATCACCTT CGGTGCGCTC
TACTACCTGA CCCCGGTTCT GTGGAAGCGT GAGCGGCTCT ACTCCCTGAG CCTGGTCAGC
TGGCACTTCT GGCTCGCGAC CATCGGCATC GTTCTCTATG CGGCGTCCAT GTGGGTGACC
GGTATCATGG AAGGCCTGAT GTGGCGTGAA GTGGATGCGA ACGGCTTCCT CGTGTGGTCC
TTTGCCGACA CCGTTGCTGC GAAGCTTCCG ATGTATGTGA TGCGTGGTTT GGGCGGGGTT
CTGTTCCTCA CCGGGTCGCT GGTCATGTGC TATAACCTCT GGATGACCGT TCGTCGGGCG
CCGGCCAAAG AGGCCAGCCT GTCCGTCGCG GTCCCGGCTG AATAA
 
Protein sequence
MSNYLKLIVL GVLTVFFMIA AGYARDLAYQ VHAVILMLVA GGLFLWTLRE TDEPVPAPVT 
GEYLDGVVRA GVIATAFWGV AGFLVGTFIA FQLAFPVLNF DWSEGFANFG RLRPLHTSAV
IFAFGGNALI ATSFYIVQRT SAARLWGGNL AWFVFWGYQL FIVLAATGYL LGGTQSKEYA
EPEWYIDLWL TVVWVAYLAV FLGTIIRRKE RHIYVANWFF LAFIVTVAML HLVNNLTIPV
SIWGSKSVIV WPGVQDAMVQ WWYGHNAVGF FLTAGFLGMM YYFIPKQADR PVYSYKLSII
HFWALIFIYI WAGPHHLHYT ALPDWASTLG MVFSIVLWMP SWGGMINGLM TLSGAWDKLR
TDPVLRMLVI SVGFYGMSTF EGPMMSIRAV NSLSHYTDWT IGHVHSGALG WNGMITFGAL
YYLTPVLWKR ERLYSLSLVS WHFWLATIGI VLYAASMWVT GIMEGLMWRE VDANGFLVWS
FADTVAAKLP MYVMRGLGGV LFLTGSLVMC YNLWMTVRRA PAKEASLSVA VPAE