Gene Ssed_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_4018 
Symbol 
ID5612506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp4919430 
End bp4922609 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content51% 
IMG OID640934973 
Productbeta-galactosidase 
Protein accessionYP_001475750 
Protein GI157377150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000751059 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAAAAT CCATCACCCT AGTCAGCACT CTACTACTCG GCTTATGCAG TCTCACCTCC 
CAAGCAACTG AGCGATGGCA GGATCACTCG GTATTTGAGG TCAATAAAGA GCCTCCACAC
GCCAGCTTCT TCGCCTATGA CTCAATCCTG AAGGCGAGTG CCGACAATTA TCGAAGTAGC
AAAAACTTCA TGGATCTCAA TGGTACATGG CAGTTTCATT ATGCGCCCAA CCCTGCATCA
GTCCCTGTAG ACTTTGCAAA CAACAGCTAT GATCCCACGA ACTGGGGCAG CATTCAGGTT
CCCGGTAACT GGGAGACTCA AGGTTACGGT CATGCCATCT ACCTCGATGA GCGCTACCCC
TTCACCACCA GCTGGCCCGA CGCCCCCCAA GATCATAATC CGACGGGCAG TTATCGACGC
GAATTCACCC AGCCCGAAGA TTGGCAAGGC AAACAGATTT TCTTCCATGT GGGCGCCGCA
AGATCCTCAC TGACTCTATT TGTTAACGGC GCAGAGGTTG GCTATAGCCA GGGTGCTAAG
ACGCCGGCCG AGTTTAATAT TACTCAGTAC CTCAAGCCGG GTAAAAACCT GCTTTCGATG
AAGATAATTC GCTGGAGTGA TGCCAGTTAT CTGGAAAGCC AGGATATGCT TCGAATGAGC
GGCATCGAGC GTGATGTCTA CCTCTTTGCC ACACCTCAAC AGAGGATCAG CGATATTGAT
GCACTCTACA GTCTCAATAA CAAATTAGAT AAGGCCGAAC TGGCATTAAG CTTCAAGCTA
AAAAACCATC AAGCTACAGC GCCAGTCACG GTTGATTACC AATTACTTTC ACCGGCCGGA
GAGTTGGTGG CAAACGGCAG CCAGAGCCTG ATATTGAAGG GGGATAACAG CGTCATTTTC
GAGGATAAGC TGGTATCACC ATTGCTGTGG AGCGCCGAAA CACCAGAGCT CTACCAACTT
ATCGTCAGCA TGAAAGATGA TAAAGGCAAG TTGTTGCAGG CCAGTAGCCA ACATATAGGT
TTTCGTCATA TAGAGATTAA AGCGGGGCAA TTGTTAATCA ATAAGCGGGC CATCACCATC
CGTGGTGTCG ACCGCCATGA AACCGATCCT CTTACCGGTC ATGTCGTCAG CCGTGAGAGC
ATGGAGCGAG ATATCCGCTT GATGAAGTTA AACAACATCA ACGCCGTTCG CTCATCTCAC
TATCCTAACG ATCCATACTG GCTCCAGCTG GCCGACAGAT ACGGCATGTA TATCGTCGAT
GAAGCCAATA TCGAATCTCA TCCCCTCGCC ATCGATGAAA AGACCCAACT CGGTAACGAA
ATGAGTTGGT ATCCGGCCCA TCTGGCCCGC GTCGAGCGCA TGCTTGAACG GGATAAGAAC
CACCCATCGG TGATCATCTG GTCTCTGGGT AACGAGGCCG GCGAAGGAAA ACTGTTTGAA
CAGCTCTATC GGTGGGTCAA ACAGCGCGAT TCCAGTCGTC CGGTTCAGTA TGAACCCGCT
GGCATGGCCG CCTATACCGA TATTGTCGCC CCTATGTATC CCTCCATTGA GCGGATAGAA
ACGTACGCCA AAGCACATAC CGACCGACCA CTGATCATGA TTGAGTATGC CCACGCCATG
GGCAACTCGG TTGGCAACCT TCAGGACTAC TGGGATGTCA TAGAAAAGTA TCCGAACCTG
CAAGGCGGCT TTATCTGGGA CTGGGTGGAT CAATCGCTAA AGTTCACCAA TGAAAAGGGA
GAAGATTATT GGGCATACGG AAAGGATTAC CACCCGGATA TGCCCACAGA CGGTAACTTT
CTCAATAATG GTTTAGTCGA TCCGGACCGA AATCCGCACC CTCACCTGAG TGAAGTAAAG
AAGGTCTATC AAGCCATAGG GTTCGATAAT TTCAAATTCA ATGGCAATTT CGATGACAAT
CTCGATGACA AGAGTGCCAG TATCAGGCTA ACCAACAAAT ATGACTTTCA AAGCACTAAG
GGACTGACGC TGAGCTGGTC GCTGCAAAAA GATGGCATAG CCGTCGCGAG TGGCAGCGAG
CCTATGCCCG TGATAGCGGC CGGTGAAGCT AAATCTGTCA CTATCAAGCT CACAAACAAG
CCAGCGCTTG TATTCGATAG CAGATACGAA TACCGGCTAC TCGTTTCGGT TTCACTGGCG
GAGCCCAGAC CTTTGATCCC CGCAGGTCAT GAGTTGGCAT TTGCACAATA TTTACTGCTG
GAAGCCAGCC CAAACATCTC GCCAGTGAGC ACGGCCACCA GCTTGGCGGA GACTGAGACT
CGATGGCAAC TCAGCCACGG GGATAACCAC TATTCAATAT CCCAAATTTC CGGCTGGTTA
ACCGAGATAG AGATTGATGG TGAACCGCAG ATCCAGGCCC CCTTAATGGC CAACTTCTGG
CGAGCTCCCA CGGATAATGA CTTAGGTAAT GGTATGCCCG ACTGGGGAGG TGTATGGCAA
GATGCCTCGG CTCAACTCAC CCTTGAGTCC ATCGAGCCTC TCGAGGAGCA CGGCCTGATA
GTGACCCATC AGCATCCCAA GTTAGGTATC ACCTTAGTGA CACAATACCG TGTAAATAAG
CGTGGCGAAC TCATGGTTAA CATAAGCTTT AATCCAGGCA AGACCGCCCT TCCGGATCTG
CCCAAGTTTG GCTTTACGAC TCGCCTTCCC TTCGAACAGC GTTACCTGAG TTATTACGGA
CGCGGCCCCG AAGAGACCTA TGCAGACAGA GCTTCAGGTA ATCCTTTCGG TTGGTATCAA
CTGCCGATAG AGAAATTATT TCACCGTTAT TCACGTCCCC AGGAGACCGG GCAGCGAACC
CAGATCAGAT ATGCAGCAGT CAGAAACAGT GCTGGGACCG GTCTGATGGC CGTAGCCCAA
AACGAGATGC AAACCAGTTT ATGGCCCTTT AGTCTGGCTG ATATCGACTT TAGAGAGGGT
GATGCGCAGG GCTCGGCATC GGGCTTAGTC CCGGTGACAC AGAACCATGG CGGTGAGGTC
CCCATTCGTG ATTTTGTCAC CTGGAATATA GACCTGAAAC AGATGGGAGT CGGTGGCGAT
ACCTCATGGG GTCGCCCCGT ACATGACAAG TACAGAATTA AGGCCGAGAA GATGAGTTTC
AAGTTCAGGC TCGTCCCCGT AACGGCGCAG AGCCAACTTC AACAACTGGC CCGAGAGTAA
 
Protein sequence
MSKSITLVST LLLGLCSLTS QATERWQDHS VFEVNKEPPH ASFFAYDSIL KASADNYRSS 
KNFMDLNGTW QFHYAPNPAS VPVDFANNSY DPTNWGSIQV PGNWETQGYG HAIYLDERYP
FTTSWPDAPQ DHNPTGSYRR EFTQPEDWQG KQIFFHVGAA RSSLTLFVNG AEVGYSQGAK
TPAEFNITQY LKPGKNLLSM KIIRWSDASY LESQDMLRMS GIERDVYLFA TPQQRISDID
ALYSLNNKLD KAELALSFKL KNHQATAPVT VDYQLLSPAG ELVANGSQSL ILKGDNSVIF
EDKLVSPLLW SAETPELYQL IVSMKDDKGK LLQASSQHIG FRHIEIKAGQ LLINKRAITI
RGVDRHETDP LTGHVVSRES MERDIRLMKL NNINAVRSSH YPNDPYWLQL ADRYGMYIVD
EANIESHPLA IDEKTQLGNE MSWYPAHLAR VERMLERDKN HPSVIIWSLG NEAGEGKLFE
QLYRWVKQRD SSRPVQYEPA GMAAYTDIVA PMYPSIERIE TYAKAHTDRP LIMIEYAHAM
GNSVGNLQDY WDVIEKYPNL QGGFIWDWVD QSLKFTNEKG EDYWAYGKDY HPDMPTDGNF
LNNGLVDPDR NPHPHLSEVK KVYQAIGFDN FKFNGNFDDN LDDKSASIRL TNKYDFQSTK
GLTLSWSLQK DGIAVASGSE PMPVIAAGEA KSVTIKLTNK PALVFDSRYE YRLLVSVSLA
EPRPLIPAGH ELAFAQYLLL EASPNISPVS TATSLAETET RWQLSHGDNH YSISQISGWL
TEIEIDGEPQ IQAPLMANFW RAPTDNDLGN GMPDWGGVWQ DASAQLTLES IEPLEEHGLI
VTHQHPKLGI TLVTQYRVNK RGELMVNISF NPGKTALPDL PKFGFTTRLP FEQRYLSYYG
RGPEETYADR ASGNPFGWYQ LPIEKLFHRY SRPQETGQRT QIRYAAVRNS AGTGLMAVAQ
NEMQTSLWPF SLADIDFREG DAQGSASGLV PVTQNHGGEV PIRDFVTWNI DLKQMGVGGD
TSWGRPVHDK YRIKAEKMSF KFRLVPVTAQ SQLQQLARE