Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4298 |
Symbol | |
ID | 7089100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 5107219 |
End bp | 5109030 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643463172 |
Product | protein of unknown function DUF885 |
Protein accession | YP_002360187 |
Protein GI | 217975436 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000000426014 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAAAC TCTTAACCCC CAGTTTATTG ATATGCGCAT TAAGCGCCTG CAGTGCCACT CAAACCACAG AAAATAGCCA AACCGCAGCG CCACAGGCAA TATCAGCAAC ACAATCGCAA GTTGAGCCCA TAGCTCCCGC GCTGCAGGCG ATTATCGACC AAAGCTGGCA ATTACAGCTC AGCGCGAGTC CTGAAATGGC TTACAGCATG GGCGATGCCA GTGCCGCAGG AAAGTTATCG GATCTCTCGC CAGCTGCACT CGCTAAGCTC AATCAAGGCC AAATCGCAGT ATTAGCGCAG CTCAAAGCAC TCGACCGCAG CAGTTTGAAT AAAGAAGATA AGATTAACGC TCAAATACTG GAAGATCAGG TCCAAAACGA TGTCGACCTG TATCGCTTTA AAGATTATTA CTTGCCCATT ACCGCGGAAA GCGGCTTCCA CGCTTATATC ACCTCAATCG CACAGGGAAG ATTCAACACC CTTGAGGATT ACCGCAACTA TATTGCCAAA CTCAATGCAC TCCCGACCTA TTTCGCCCAG CAAACTCACT GGCTAAAGCA AGGATTAGCC GAGGGAATAA CGCCGCCTAA AGTGACACTC AATGGCTTTG AAGACAGTAT CAGTGCTTAT ATCGTCCCTG TGGAAAAGAG CGGTTATTTC AAACCTTTTA CGCAATACCC CAGCTATTTC ACTGAAGCAC AAAAAACTCA GCTGACCCAA GAAGGTCGCG CCTTAGTCGA GCAAAAAGTC CTACCGCTAT ATCAAAACTT CTATGATTTT ATGACCAAAG AGTACATTCC CAATGCACGG GAAAATATCG CGGCCAGCAG CTTACCTAAC GGTGCTGAGT TTTATGAGAA TCGAGTACGT TACTATACGA CCTTGAATAT GACGTCGGCA GAAGTGCATG AACTGGGCTT AAAAGAAGTA AAGCGTATCC GCCAAGAGAT GGAGCAAATC ATCAAGTCCG TTGGCTTTAA AGGCAGCTTT GCCGACTTTT TACATTTCCT GCGGACCGAT CCCCAGTTCT ACACCACAAG TGCCGATCAG TTACTCAAAG AAGCGGCATT TATTGCCAAA AAAGCCGATG CCATGCTGCC TAAGTACTTC GGCAAACTTC CCCGTAAACC TTATGGCATA GCGCCAGTGC CCGCCGAAAT TGCGCCGAAA TACACCACAG GACGATATTC AGGATCGAAC AGCGATGACG AACCCGGTTA CTACTGGGTC AACACTTATG CATTGGATAA GCGCCCACTC TACGAGTTAG AAGCCTTAAC CTTGCACGAA GCCGTACCCG GTCATCATCT GCAAATTTCA CTCAACTCTG AGCTAACGTC ACTCCCTGAC TTCCGTCGTT ATGGCTATAT ATCTGCCTTC GGTGAAGGCT GGGGTCTGTA TTGCGAATAC TTAGGTTTGG AGGCTGGCTT CTACCAAGAT CCCTACAGTA ACTTTGGCCG CTTAACCTAT GAAATGTGGC GGGCGGCGCG TTTAGTCGTA GACACAGGCA TGCATGCCCA AGGGTGGAGC CGTCAACAAG CCATCGACTT TATGGCCAGC AATACCGCGC TATCACTGCA CAATGTCACG ACCGAAATTG ACCGTTATAT TTCGTGGCCA GGACAAGCTT TGTCCTACAA GATTGGCGAG TTAACGATCA AGCGGTTACG TGCTAAAGCA GAACAGGAAC TCGGCGACAA GTTTGATATT CGCGCCTTCC ACGACGCTGT GCTCGAAAAT GGCTCAGTAC CTATGTCGAT CCTCGAACAG CAAATCAATG ATTTTATTGA AGCTAAAAAG GCGACACGCT AA
|
Protein sequence | MHKLLTPSLL ICALSACSAT QTTENSQTAA PQAISATQSQ VEPIAPALQA IIDQSWQLQL SASPEMAYSM GDASAAGKLS DLSPAALAKL NQGQIAVLAQ LKALDRSSLN KEDKINAQIL EDQVQNDVDL YRFKDYYLPI TAESGFHAYI TSIAQGRFNT LEDYRNYIAK LNALPTYFAQ QTHWLKQGLA EGITPPKVTL NGFEDSISAY IVPVEKSGYF KPFTQYPSYF TEAQKTQLTQ EGRALVEQKV LPLYQNFYDF MTKEYIPNAR ENIAASSLPN GAEFYENRVR YYTTLNMTSA EVHELGLKEV KRIRQEMEQI IKSVGFKGSF ADFLHFLRTD PQFYTTSADQ LLKEAAFIAK KADAMLPKYF GKLPRKPYGI APVPAEIAPK YTTGRYSGSN SDDEPGYYWV NTYALDKRPL YELEALTLHE AVPGHHLQIS LNSELTSLPD FRRYGYISAF GEGWGLYCEY LGLEAGFYQD PYSNFGRLTY EMWRAARLVV DTGMHAQGWS RQQAIDFMAS NTALSLHNVT TEIDRYISWP GQALSYKIGE LTIKRLRAKA EQELGDKFDI RAFHDAVLEN GSVPMSILEQ QINDFIEAKK ATR
|
| |