Gene Shewmr4_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2009 
Symbol 
ID4252582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2393809 
End bp2394777 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content47% 
IMG OID638118622 
Productcytochrome c oxidase, cbb3-type, subunit III 
Protein accessionYP_734139 
Protein GI113970346 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID[TIGR00782] cytochrome c oxidase, cbb3-type, subunit III 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0902386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000272898 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAGCT TCTGGAGTAT TTGGATTAGC GTACTCTCGT TAATTGTGAT CGCAGGATGT 
TTTCTGTTGC TGCGTGTGTG TTCAAAGAAC ACCACGGATG TCAAAGAAGG CGAATCTATG
GGTCACAGTT TCGATGGTAT CGAAGAACTC AATAACCCAC TGCCAAAATG GTGGAGTTAT
ATGTTCTATA TCACTATCGT GTTTGGCCTG GTTTACTTAG CCCTCTTCCC AGGTTTAGGT
AACTACAAAG GTCTACTGAA CTGGACCAGC TCTAACCAGA GCATTGGTAC AGAGAAAGGT
ATTAAAGCCG ATTCTGCAGC AGCGGTTGAG CTGGCTGCAA AAGAAGGCAT GTACGTTCAG
TATGATCAAG AAGTTAAACA TGCTAACGAA AAATATGGCC CAATCTTCGC GGCTTACTTG
GCTACACCAC TCGAAGAGTT AGTGAAAAAC CAAGAAGCGT TGAAAGTGGG CGGCCGTTTG
TTCCTACAAA ACTGCGCTCA GTGCCATGGC TCTGACGCAC GTGGTAGCAA AGGCTTCCCT
AACCTGACCG ACAGTGATTG GTTATATGGT GGTGATTTAG CCACTATCAA GACCACTATC
ATGAACGGTC GTCATGGCAT GATGCCACCA AAAGGTGGTT TGCCAATCGA CGATAGCGAA
ATTGCAGGTC TAGCCGAGTA CGTAGTTAAG TTATCTGGCC GTGAGCACGA CGAGAAACTC
GCCGCTCAAG GTCAAGGTTC ATTCATGAAG GGTTGTTTTG CGTGTCACGG TATGGACGCA
AAAGGCAACA AACTCATGGG CGCACCTAAC TTAACTGACG ATGTTTGGTT ATATGGCGGT
AGCCGCGGCA TGATCGAAGA AACGATCAAG CACGGCCGCG CAGGCGTAAT GCCAGCATGG
AAAGACGTTC TCGGTGAAGA GAAAGTCCAC GTGATCGCAG CTTATGTTTA TAGCTTGTCA
AACAAGTAA
 
Protein sequence
MSSFWSIWIS VLSLIVIAGC FLLLRVCSKN TTDVKEGESM GHSFDGIEEL NNPLPKWWSY 
MFYITIVFGL VYLALFPGLG NYKGLLNWTS SNQSIGTEKG IKADSAAAVE LAAKEGMYVQ
YDQEVKHANE KYGPIFAAYL ATPLEELVKN QEALKVGGRL FLQNCAQCHG SDARGSKGFP
NLTDSDWLYG GDLATIKTTI MNGRHGMMPP KGGLPIDDSE IAGLAEYVVK LSGREHDEKL
AAQGQGSFMK GCFACHGMDA KGNKLMGAPN LTDDVWLYGG SRGMIEETIK HGRAGVMPAW
KDVLGEEKVH VIAAYVYSLS NK