Gene Sbal223_2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2217 
Symbol 
ID7086344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp2631975 
End bp2632943 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content46% 
IMG OID643461115 
Productcytochrome c oxidase, cbb3-type, subunit III 
Protein accessionYP_002358139 
Protein GI217973388 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID[TIGR00782] cytochrome c oxidase, cbb3-type, subunit III 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000127298 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.441817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCT TCTGGAGTGT TTGGATTACC GTACTCTCGT TAGTCGTGAT CGCGGGTTGT 
TTTCTATTAC TGCGTGTGTG TTCTAAGAAC ACCACGGGCG TAAAAGAAGG CGAATCCATG
GGTCACAGTT TCGACGGTAT TGAAGAACTC AATAACCCAC TGCCAAAATG GTGGAGTTAT
ATGTTCTATA TCACTATCGT GTTTGGCTTG ATCTATCTAG CCTTATTTCC TGGTTTAGGT
AACTACAAAG GCCTTTTAGG CTGGACGAGT TCTAACCAGA GCATTGGTAC TGAACAAGGT
ATTAAAGCCG ATTCTGCCGC AGCCATTGAG CTTGCAGCAA AAGAAGGCCG TTATGTTCAG
TATGACCAAG AAGTAAAACA CGCTAGTGAA AAATATGGCC CAATCTTCGC GGCTTACTTG
GCAACACCAC TAGAAGAATT AGTGAAAAAC CAAGAAGCAT TGAAAGTGGG CGGCCGTTTG
TTCCTACAAA ACTGCGCACA GTGCCATGGC TCTGACGCAC GTGGTAGCAA AGGCTTCCCT
AATCTCACCG ATGGTGACTG GTTATATGGT GGCGACTTAG CCACGATTAA AACCACTATC
ATGGGTGGTC GTCATGGCAT GATGCCGCCG AAAGGTGGTT TGCCAATCGA TGACAGCGAA
ATTGCGGGTT TAGCTGAATA CGTTGTTAAA TTGTCTGGTC GTGAGCACGA TGAAACACTC
GCCGCTCAAG GTCAAGGCTC ATTCATGAAA GGTTGTTTCG CGTGTCATGG TATGGACGCT
AAAGGCAACA AGTTCATGGG TGCTCCTAAT TTAACTGACG ATGTTTGGTT ATATGGCGGT
AGCCGTGGCG TGATCGAAGA AACCATTAAA CATGGTCGCG CAGGTGTAAT GCCAGCGTGG
AAAGACGTTC TCGGTGAAGA GAAAGTTCAC GTAATCGCAG CTTATGTTTA TAGCTTGTCA
AACAAGTAA
 
Protein sequence
MSSFWSVWIT VLSLVVIAGC FLLLRVCSKN TTGVKEGESM GHSFDGIEEL NNPLPKWWSY 
MFYITIVFGL IYLALFPGLG NYKGLLGWTS SNQSIGTEQG IKADSAAAIE LAAKEGRYVQ
YDQEVKHASE KYGPIFAAYL ATPLEELVKN QEALKVGGRL FLQNCAQCHG SDARGSKGFP
NLTDGDWLYG GDLATIKTTI MGGRHGMMPP KGGLPIDDSE IAGLAEYVVK LSGREHDETL
AAQGQGSFMK GCFACHGMDA KGNKFMGAPN LTDDVWLYGG SRGVIEETIK HGRAGVMPAW
KDVLGEEKVH VIAAYVYSLS NK