Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1486 |
Symbol | |
ID | 7087539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 1733279 |
End bp | 1735219 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643460389 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_002357416 |
Protein GI | 217972665 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0306155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.36533 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTT TGCAAGGCTT GTTCCGCTTA AAACGTGCTC ACAAACGTTT GGTGAGCGTT GTTATTGATT CGGTTTTCCT TTGTTTTGCT TTCTGGGCTG CATTATTGGT TCGAGTTGAC GATGTCAACG TACTTGTTAA TTGGGCTTAT TGGGCATTAC TGTTGCTTGT TGTCCCCTTG AGTGTTGTAG GTTTTGCAAA ATTAGGCTTG TATCGAGCCG TATTACGTTA CATGGGACTA CAAGCACTCA CTGCAATACT GTTTGGTGTT ATTGCCTCAA CAATAATTCT TGTTATCGTG GCTTATTACT CTGAGGCGCA ATTACCTCGT ACAGTGCCTA TCATTTACGC TGCTTTTGCC TTAGTTTTTG TTGGAGGGAC GCGTGCTATG GTGCGTTCGC TTGTCGGTTC AGGAATGAAG CGGGTTGGTG AGCCGGTAAT AATTTATGGC GCTGGGGTGA GTGGTCGCCA ATTAGTCACT GCATTAGTTC AGAGTCATGA ATATTATCCA TTCGCCTTTG TTGATGATGA TGAGTCGCTA CATGGCATAG TTATCCAAGG TGTTTATGTT CATTCCCCTT CAATTATTAA AAAATTAATC AATCAAAATT CTGCAACTAA AGTTCTTTTG GCTATGCCTA GTGCGCCGCG CTCGCGTCGA CAAGAAATTT TATTGCAGCT TGAGCCTTTG GCTGTGCAAG TGCTTACCTT GCCTGCAATG GCTGATTTGG TCAATGGCAC CAAACTTTAT AGTGATATTA AAGAAGTCGA AATTGATGAT TTATTAGGGC GAGATGCAGT AAATCCAAGA GGCGATCTTC TTAGTGCGAA TATTCGTAAT AAAGTAGTTA TGGTAACAGG GGCAGGTGGG TCTATCGGCT CAGAGCTTTG TCGCCAAATA CTCAAGCAAA ATCCTAAAAA GTTAGTACTC TTTGAACTTT CTGAATATGC CTTATATGCC ATTGAACGTG AATTACGCTT AACGGCGCAT GAGTTGGGTT TGGATGTCGA AATTTTCCCC ATGATGGGTT CAGTTCAACG TGCTAATCGT ATAGAAGCAG TGATGAAGGC TTTTAGTGTA CAGACTGTCT ATCATGCTGC TGCCTACAAA CATGTTCCGT TAGTTGAGCA CAATGTGGTT GAAGGTGTTC GTAATAATGT GTTTGGTACC TTGTATACAG CTCAAGCCGC CATTGCTGCG AAGGTTGAAA CTTTTGTATT GATTTCTACG GATAAAGCTG TGAGGCCGAC CAATATCATG GGAACCACTA AGCGTATGGC TGAGTTGGTT CTGCAAGCAT TATCAAATTT AAATAATAAT ACTCGTTTTT GTATGGTTCG TTTTGGTAAT GTTTTAGGCT CATCGGGTTC AGTAGTGCCT TTGTTCAGAA GCCAAATTGC CAACGGTGGT CCGGTAACAG TCACTCACCC TGAAATCACT CGTTTTTTTA TGACAATACC AGAGGCTTCT CAATTAGTGA TCCAAGCCGG TGCTATGGGT AAAGGCGGTG ATGTATTTGT GCTGGATATG GGCAAATCGG TTAAAATCGT CGATTTGGCT GCAAAAATGA TCCGTCTAAG TGGATTTGAA GTTAAGGATG AAAAAAATCC TGATGGTGAT ATTGCGATTG AGTTTAGTGG TTTACGTCCT GGTGAAAAGT TATATGAGGA ATTGCTGATT GGTGATGATG TTACTGGTAC TGAGCATGAG CGCATTATGA CTGCAAATGA AATATGTTTG CCTTGGAATG AGCTCGAAAC TATTCTTTAT AGACTAGATA AAGCCTGTCA TGAATTCAAT CATGAGGTTA TCCGCGACAT TTTGCTAACA ACGCCAACAG GATTTAATCC TACAGATGGA ATTTGTGATC TAGTTTGGTT GCAAAAGAAA TCAATCACCG CAGTTGAAGA TAAAAATAAA GTCGTTACTT TAGTTTCTTA G
|
Protein sequence | MEFLQGLFRL KRAHKRLVSV VIDSVFLCFA FWAALLVRVD DVNVLVNWAY WALLLLVVPL SVVGFAKLGL YRAVLRYMGL QALTAILFGV IASTIILVIV AYYSEAQLPR TVPIIYAAFA LVFVGGTRAM VRSLVGSGMK RVGEPVIIYG AGVSGRQLVT ALVQSHEYYP FAFVDDDESL HGIVIQGVYV HSPSIIKKLI NQNSATKVLL AMPSAPRSRR QEILLQLEPL AVQVLTLPAM ADLVNGTKLY SDIKEVEIDD LLGRDAVNPR GDLLSANIRN KVVMVTGAGG SIGSELCRQI LKQNPKKLVL FELSEYALYA IERELRLTAH ELGLDVEIFP MMGSVQRANR IEAVMKAFSV QTVYHAAAYK HVPLVEHNVV EGVRNNVFGT LYTAQAAIAA KVETFVLIST DKAVRPTNIM GTTKRMAELV LQALSNLNNN TRFCMVRFGN VLGSSGSVVP LFRSQIANGG PVTVTHPEIT RFFMTIPEAS QLVIQAGAMG KGGDVFVLDM GKSVKIVDLA AKMIRLSGFE VKDEKNPDGD IAIEFSGLRP GEKLYEELLI GDDVTGTEHE RIMTANEICL PWNELETILY RLDKACHEFN HEVIRDILLT TPTGFNPTDG ICDLVWLQKK SITAVEDKNK VVTLVS
|
| |