Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_4018 |
Symbol | |
ID | 5612506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 4919430 |
End bp | 4922609 |
Gene Length | 3180 bp |
Protein Length | 1059 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640934973 |
Product | beta-galactosidase |
Protein accession | YP_001475750 |
Protein GI | 157377150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000751059 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCAAAAT CCATCACCCT AGTCAGCACT CTACTACTCG GCTTATGCAG TCTCACCTCC CAAGCAACTG AGCGATGGCA GGATCACTCG GTATTTGAGG TCAATAAAGA GCCTCCACAC GCCAGCTTCT TCGCCTATGA CTCAATCCTG AAGGCGAGTG CCGACAATTA TCGAAGTAGC AAAAACTTCA TGGATCTCAA TGGTACATGG CAGTTTCATT ATGCGCCCAA CCCTGCATCA GTCCCTGTAG ACTTTGCAAA CAACAGCTAT GATCCCACGA ACTGGGGCAG CATTCAGGTT CCCGGTAACT GGGAGACTCA AGGTTACGGT CATGCCATCT ACCTCGATGA GCGCTACCCC TTCACCACCA GCTGGCCCGA CGCCCCCCAA GATCATAATC CGACGGGCAG TTATCGACGC GAATTCACCC AGCCCGAAGA TTGGCAAGGC AAACAGATTT TCTTCCATGT GGGCGCCGCA AGATCCTCAC TGACTCTATT TGTTAACGGC GCAGAGGTTG GCTATAGCCA GGGTGCTAAG ACGCCGGCCG AGTTTAATAT TACTCAGTAC CTCAAGCCGG GTAAAAACCT GCTTTCGATG AAGATAATTC GCTGGAGTGA TGCCAGTTAT CTGGAAAGCC AGGATATGCT TCGAATGAGC GGCATCGAGC GTGATGTCTA CCTCTTTGCC ACACCTCAAC AGAGGATCAG CGATATTGAT GCACTCTACA GTCTCAATAA CAAATTAGAT AAGGCCGAAC TGGCATTAAG CTTCAAGCTA AAAAACCATC AAGCTACAGC GCCAGTCACG GTTGATTACC AATTACTTTC ACCGGCCGGA GAGTTGGTGG CAAACGGCAG CCAGAGCCTG ATATTGAAGG GGGATAACAG CGTCATTTTC GAGGATAAGC TGGTATCACC ATTGCTGTGG AGCGCCGAAA CACCAGAGCT CTACCAACTT ATCGTCAGCA TGAAAGATGA TAAAGGCAAG TTGTTGCAGG CCAGTAGCCA ACATATAGGT TTTCGTCATA TAGAGATTAA AGCGGGGCAA TTGTTAATCA ATAAGCGGGC CATCACCATC CGTGGTGTCG ACCGCCATGA AACCGATCCT CTTACCGGTC ATGTCGTCAG CCGTGAGAGC ATGGAGCGAG ATATCCGCTT GATGAAGTTA AACAACATCA ACGCCGTTCG CTCATCTCAC TATCCTAACG ATCCATACTG GCTCCAGCTG GCCGACAGAT ACGGCATGTA TATCGTCGAT GAAGCCAATA TCGAATCTCA TCCCCTCGCC ATCGATGAAA AGACCCAACT CGGTAACGAA ATGAGTTGGT ATCCGGCCCA TCTGGCCCGC GTCGAGCGCA TGCTTGAACG GGATAAGAAC CACCCATCGG TGATCATCTG GTCTCTGGGT AACGAGGCCG GCGAAGGAAA ACTGTTTGAA CAGCTCTATC GGTGGGTCAA ACAGCGCGAT TCCAGTCGTC CGGTTCAGTA TGAACCCGCT GGCATGGCCG CCTATACCGA TATTGTCGCC CCTATGTATC CCTCCATTGA GCGGATAGAA ACGTACGCCA AAGCACATAC CGACCGACCA CTGATCATGA TTGAGTATGC CCACGCCATG GGCAACTCGG TTGGCAACCT TCAGGACTAC TGGGATGTCA TAGAAAAGTA TCCGAACCTG CAAGGCGGCT TTATCTGGGA CTGGGTGGAT CAATCGCTAA AGTTCACCAA TGAAAAGGGA GAAGATTATT GGGCATACGG AAAGGATTAC CACCCGGATA TGCCCACAGA CGGTAACTTT CTCAATAATG GTTTAGTCGA TCCGGACCGA AATCCGCACC CTCACCTGAG TGAAGTAAAG AAGGTCTATC AAGCCATAGG GTTCGATAAT TTCAAATTCA ATGGCAATTT CGATGACAAT CTCGATGACA AGAGTGCCAG TATCAGGCTA ACCAACAAAT ATGACTTTCA AAGCACTAAG GGACTGACGC TGAGCTGGTC GCTGCAAAAA GATGGCATAG CCGTCGCGAG TGGCAGCGAG CCTATGCCCG TGATAGCGGC CGGTGAAGCT AAATCTGTCA CTATCAAGCT CACAAACAAG CCAGCGCTTG TATTCGATAG CAGATACGAA TACCGGCTAC TCGTTTCGGT TTCACTGGCG GAGCCCAGAC CTTTGATCCC CGCAGGTCAT GAGTTGGCAT TTGCACAATA TTTACTGCTG GAAGCCAGCC CAAACATCTC GCCAGTGAGC ACGGCCACCA GCTTGGCGGA GACTGAGACT CGATGGCAAC TCAGCCACGG GGATAACCAC TATTCAATAT CCCAAATTTC CGGCTGGTTA ACCGAGATAG AGATTGATGG TGAACCGCAG ATCCAGGCCC CCTTAATGGC CAACTTCTGG CGAGCTCCCA CGGATAATGA CTTAGGTAAT GGTATGCCCG ACTGGGGAGG TGTATGGCAA GATGCCTCGG CTCAACTCAC CCTTGAGTCC ATCGAGCCTC TCGAGGAGCA CGGCCTGATA GTGACCCATC AGCATCCCAA GTTAGGTATC ACCTTAGTGA CACAATACCG TGTAAATAAG CGTGGCGAAC TCATGGTTAA CATAAGCTTT AATCCAGGCA AGACCGCCCT TCCGGATCTG CCCAAGTTTG GCTTTACGAC TCGCCTTCCC TTCGAACAGC GTTACCTGAG TTATTACGGA CGCGGCCCCG AAGAGACCTA TGCAGACAGA GCTTCAGGTA ATCCTTTCGG TTGGTATCAA CTGCCGATAG AGAAATTATT TCACCGTTAT TCACGTCCCC AGGAGACCGG GCAGCGAACC CAGATCAGAT ATGCAGCAGT CAGAAACAGT GCTGGGACCG GTCTGATGGC CGTAGCCCAA AACGAGATGC AAACCAGTTT ATGGCCCTTT AGTCTGGCTG ATATCGACTT TAGAGAGGGT GATGCGCAGG GCTCGGCATC GGGCTTAGTC CCGGTGACAC AGAACCATGG CGGTGAGGTC CCCATTCGTG ATTTTGTCAC CTGGAATATA GACCTGAAAC AGATGGGAGT CGGTGGCGAT ACCTCATGGG GTCGCCCCGT ACATGACAAG TACAGAATTA AGGCCGAGAA GATGAGTTTC AAGTTCAGGC TCGTCCCCGT AACGGCGCAG AGCCAACTTC AACAACTGGC CCGAGAGTAA
|
Protein sequence | MSKSITLVST LLLGLCSLTS QATERWQDHS VFEVNKEPPH ASFFAYDSIL KASADNYRSS KNFMDLNGTW QFHYAPNPAS VPVDFANNSY DPTNWGSIQV PGNWETQGYG HAIYLDERYP FTTSWPDAPQ DHNPTGSYRR EFTQPEDWQG KQIFFHVGAA RSSLTLFVNG AEVGYSQGAK TPAEFNITQY LKPGKNLLSM KIIRWSDASY LESQDMLRMS GIERDVYLFA TPQQRISDID ALYSLNNKLD KAELALSFKL KNHQATAPVT VDYQLLSPAG ELVANGSQSL ILKGDNSVIF EDKLVSPLLW SAETPELYQL IVSMKDDKGK LLQASSQHIG FRHIEIKAGQ LLINKRAITI RGVDRHETDP LTGHVVSRES MERDIRLMKL NNINAVRSSH YPNDPYWLQL ADRYGMYIVD EANIESHPLA IDEKTQLGNE MSWYPAHLAR VERMLERDKN HPSVIIWSLG NEAGEGKLFE QLYRWVKQRD SSRPVQYEPA GMAAYTDIVA PMYPSIERIE TYAKAHTDRP LIMIEYAHAM GNSVGNLQDY WDVIEKYPNL QGGFIWDWVD QSLKFTNEKG EDYWAYGKDY HPDMPTDGNF LNNGLVDPDR NPHPHLSEVK KVYQAIGFDN FKFNGNFDDN LDDKSASIRL TNKYDFQSTK GLTLSWSLQK DGIAVASGSE PMPVIAAGEA KSVTIKLTNK PALVFDSRYE YRLLVSVSLA EPRPLIPAGH ELAFAQYLLL EASPNISPVS TATSLAETET RWQLSHGDNH YSISQISGWL TEIEIDGEPQ IQAPLMANFW RAPTDNDLGN GMPDWGGVWQ DASAQLTLES IEPLEEHGLI VTHQHPKLGI TLVTQYRVNK RGELMVNISF NPGKTALPDL PKFGFTTRLP FEQRYLSYYG RGPEETYADR ASGNPFGWYQ LPIEKLFHRY SRPQETGQRT QIRYAAVRNS AGTGLMAVAQ NEMQTSLWPF SLADIDFREG DAQGSASGLV PVTQNHGGEV PIRDFVTWNI DLKQMGVGGD TSWGRPVHDK YRIKAEKMSF KFRLVPVTAQ SQLQQLARE
|
| |