Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4153 |
Symbol | |
ID | 7088476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4931620 |
End bp | 4933731 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643463032 |
Product | cytochrome c biogenesis protein transmembrane region |
Protein accession | YP_002360047 |
Protein GI | 217975296 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4232] Thiol:disulfide interchange protein [COG4233] Uncharacterized protein predicted to be involved in C-type cytochrome biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00759916 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAT TAAAATCCAT GTTAACCAGA TTCTGGATAC TCGCCTTGAT AGTGACGAGC CCGTCGCTAT TGGCCGCATC GACGGGGTGG TTAGTTAACT ATTATCACCC GCCAGCCAAA GTACGTTTTA TGCTGACGGG CGAAGTTGAT CCCACAACGA ATACTTTGCC CGCTGTGCTT GAAGTACAAC TTGAGGGTGA TTGGAAAACC TATTGGCGTA GCCCTGGGGA AGGCGGTATC GCGCCTACCA TCAAATGGGA TGGCTCACAC AATCTACAGC AAGTAGATTG GAGATGGCCT GCGCCAGAGG AATTCTCATT ATTGGGATTA CAAACCTTTG GTTACAAAGG TAATACCACC TTTCCCTTAA CCCTAAAAGT CGATGATATC GCTGCACCAA CCCAGCTGCG TGGCAAAGTG ACCTTGTCGA CCTGTACCAC TATCTGTGTA CTGACGGATT ATCAGATTAG TCTCGACTTC ACGCCCAATG CGCTGCAAGC AGATACCGAT GCCATGTTGG CCTACAACAA GGCCGTGTCG CTTGTGCCAC AAAAAGTGTT GTCCCAAGAG GGCCAGAATC CTGCCATGAC AATGGGTTGG GATGCCGCCA AAGGTCAACT TGAAGTCAGA CTCAATGATG CCAACTGGCA GCAACCCACA GTGATTATCG ATGGCGAGCC AGATACCACA TTCAAGCTAG TGAGCTTAAA GCCAAGCGAA AGTGATGCTA ACGGCAAGCA ATTGGTCGGT ATTTTCAGCG GTAAAAGTTG GCTGGGCGAG CCTGAAGTTT TGGGTAAATC GCTGAACATG ACTGTCGCCG ATAGCGAGCG AGCGTTGGAA TACAGCGCCG AAGTTAAGCC TGTGGTGATT ACCCAAGCAA GTACCTCGAT ATTTGGTATG ATTTTGTTAG CGCTTATCGG TGGTTTGATC TTAAACGTCA TGCCCTGCGT GTTACCCGTT CTCGGCATGA AGCTCAGTTC AGTGGTTGCT GCGCCTGATC TGAAGCGTAA TCAAATCCGC CAACAGTTCA TCGCATCAGC GCTGGGGATA TTGGCCTCAT TCTGGTTACT GGCTGGCTTT ATCTTAATGC TTAAGCTCAC GGGTCAAGCG ATTGGTTGGG GTGTGCAGTT CCAAAACCCT TGGTTTATCG GCTTTATGGC GCTGGTGACC ACAGTGTTTG CCTTCAATAT GTTAGGTGTG TTTGAGATCA ACTTACCCTC TAACGTGCAA ACCAAGTTGG CGACAACGGG CGGCAATAAT AACAGCGGCC ATTTTTTACA GGGCATGTTT GCCACGCTGC TGGCGACACC GTGCAGTGCA CCATTTTTGG GGACTGCGGT CGCTTTCGCC TTGGGCGCCG ATGTACTGAG CCTGTTTGCC ATCTTTACCG CGTTGGCCGT GGGTATGGCG TTCCCTTGGT TGATTGTGGC GGCATTTCCG CAAGTGGCGG GTTATTTCCC TAAACCCGGT CGCTGGATGA ATACAGTTAA AGTGTTTTTC TCCGGCATGT TGCTCATCAC CAGTTTGTGG CTCATTAGCC TGTTGGCCAG CTTTATTGAT GTGCTCTACC TGTGGCCGTT AGCGGGTGTC ATTACACTGA TATTTATGGT GTTTATGGCT AAAAAATATG GCGCGATTGC CATCGTGAGC TGCTTAGGTA TAGGCATTTT ACTGTCGGCG GTTATTGCAT TTATGACGGC GAACCAGTGG GCTAAACCTT TACCTGCGGA TCTTGCTTGG ACGCCATTGG ATCAAGCATT GATTAATCAG CAGGTTGCCC AAGGCAAAAC TGTGTTTGTC GATGTCACTG CCGATTGGTG CATTACCTGT AAGGCCAATA AAGTTGGCGT GATCTTGCAA GATCCAGTCT ATAGCCTGTT GCAACAGTCG CATATTTTGC CGATGAAGGG CGATTGGACT ACGCCTTCTG AGCCCATTAC TCACTACCTG CAAAGCCATA ATCGTTTCGG GGTGCCATTC AACGTGGTTT ACGGCCCTGG TGCGCCGCAA GGGATTGAGT TACCGGAAAT TCTATCGACT GACCTTGTGC TTGCTGCGAT TGAGCAGGCC TCTGCGTCCC CTGAAAAGTC ATCCGTAGCG GCAGGGAAAT AA
|
Protein sequence | MSTLKSMLTR FWILALIVTS PSLLAASTGW LVNYYHPPAK VRFMLTGEVD PTTNTLPAVL EVQLEGDWKT YWRSPGEGGI APTIKWDGSH NLQQVDWRWP APEEFSLLGL QTFGYKGNTT FPLTLKVDDI AAPTQLRGKV TLSTCTTICV LTDYQISLDF TPNALQADTD AMLAYNKAVS LVPQKVLSQE GQNPAMTMGW DAAKGQLEVR LNDANWQQPT VIIDGEPDTT FKLVSLKPSE SDANGKQLVG IFSGKSWLGE PEVLGKSLNM TVADSERALE YSAEVKPVVI TQASTSIFGM ILLALIGGLI LNVMPCVLPV LGMKLSSVVA APDLKRNQIR QQFIASALGI LASFWLLAGF ILMLKLTGQA IGWGVQFQNP WFIGFMALVT TVFAFNMLGV FEINLPSNVQ TKLATTGGNN NSGHFLQGMF ATLLATPCSA PFLGTAVAFA LGADVLSLFA IFTALAVGMA FPWLIVAAFP QVAGYFPKPG RWMNTVKVFF SGMLLITSLW LISLLASFID VLYLWPLAGV ITLIFMVFMA KKYGAIAIVS CLGIGILLSA VIAFMTANQW AKPLPADLAW TPLDQALINQ QVAQGKTVFV DVTADWCITC KANKVGVILQ DPVYSLLQQS HILPMKGDWT TPSEPITHYL QSHNRFGVPF NVVYGPGAPQ GIELPEILST DLVLAAIEQA SASPEKSSVA AGK
|
| |