Gene Sbal223_3412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3412 
SymbolmdoD 
ID7086026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4051431 
End bp4053119 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content49% 
IMG OID643462298 
Productglucan biosynthesis protein D 
Protein accessionYP_002359319 
Protein GI217974568 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000028825 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCCA ATTTATTTCC CGAACAAAGA AACGTTTACC AACAATCGAT GGTGACAGTA 
CCTTCGATTG AAGCCCAATC CTCCTCTCGT CGCGGCCGTT TGCAGCGCAT GGTTGGCAGC
GTATTGTTAA CCTGTAGCGT ATTGCTGAGT CTTTCGGCGT GTGCGGCGGG TAAGCAGGCG
TCACCACAAG AGGGTGAAGA GTTCAGTTAC GCTTGGCTTA AGGGCTATGC CCGTACTATG
GCTGCACAAC CCTATGAAAG CCATAAAGGC GAATTGCCTA AAAGCCTTCA GGGCATGAGC
TGGGATGATT ACCAACAATT CCACTTTAAG AAAAAAGCCG CACTGTGGCG CGATCAGGAC
TCGGAGTTTC GTGCCGAGCT GTTCCACCTA GGGCTGTATT TCGACACGCC AATTCACATT
TATGAATTGA ATGAAGGCAA GGCTAAGTTA ATCGACTATT CGCCTTCTAT GTTCGATTAC
GGCAAGTCAA AGGTGAAAGG CAGTCAGTTG CCTAAGGATC TTGGCTTTGC GGGTTTCCGT
ATGCAGTTCA ATACCGATTG GGAACGTGAC GTTGTCGCTT TCCTTGGCGC CAGTTATTTC
CGCGCTGTGG GTCATGAGAT GCAATACGGC CTGTCGGCTC GTGGTTTAGC GGTGGATACT
GCGCTGCCAA AACCTGAAGA GTTTCCAATG TTCACCGACT TCTGGTTAGA AAAACCTAAA
CCAGGATCGA ACATCACGAC TGTGTATGCC TTGTTAGACT CTCCGAGTGT GACAGGTGCT
TACCGTTTCG ATATCGAGCC GGGTGAGCGT TTAAAAATGA AAGTGGATGT GGCTGTTTAT
CCACGTAAAG CGATTGAACG TTTAGGTGTA GCGCCACTGA CCAGTATGTT TATGGTGGGT
GAGAACGATC GCCGCACCGG TTACGACTGG CGTCCAGAAA TCCATGATAC CGATGGCTTA
GCCATGCACA CGGGTAATGG TGAATGGATT TGGCGTCCAC TAGGTAACCC AGAAAACCTG
CGATTCAACG CCTATAGCGA TGAAAATCCA AAAGGTTTTG GTTTACTGCA ACGTGACCGT
AACTTCGATC ACTATCAGGA TGACGGCGTG TTTTACGATA AACGCCCAAG CCTGTGGATT
GAACCAACAA GCAGTTGGGG CAAAGGTTCG GTGCAGTTAG TTGAAATCCC AACACTGGAT
GAAACCTTCG ATAACATAGT GGCTTTCTGG AATCCTGCTG AGCCGATTGT TCCGGGTCAA
GAGCTGCTGT ATAGCTACAA CATGTATTGG GGCGGCATTC CTCCGGTTCA ATCACCCCGT
GCTCGCGTTG TTGATACCTT CACCGGGATT GGCGGCGTAG TTGGCCAGAA ACGCAAATAC
TACAGTAAGC GTTTTGTGGT GGATTTCGCG GGCGGCACTT TGCCTATGAT AGGCAAAGAC
ACCCAAGTAA AAGCCGTGAT CACTGCATCT GAAGGTAAAG TCGAAATTGA ATCTGCCCGT
CCTCTAGCGT CCATTAATGG TTACCGCGCT ATGTTTGACG TAGTGCCACC CGGAGACGGC
ACTGAGCCGA TTAACCTGAG AGTCTATTTA GAAGTCGATG GTCAGCCATT GTCTGAAACT
TGGATGTATC AGTGGAATCC GCCTGCTAAG GATGATCGTG AATTGCACAA TGCGGGTCAT
CTGCAATAA
 
Protein sequence
MQPNLFPEQR NVYQQSMVTV PSIEAQSSSR RGRLQRMVGS VLLTCSVLLS LSACAAGKQA 
SPQEGEEFSY AWLKGYARTM AAQPYESHKG ELPKSLQGMS WDDYQQFHFK KKAALWRDQD
SEFRAELFHL GLYFDTPIHI YELNEGKAKL IDYSPSMFDY GKSKVKGSQL PKDLGFAGFR
MQFNTDWERD VVAFLGASYF RAVGHEMQYG LSARGLAVDT ALPKPEEFPM FTDFWLEKPK
PGSNITTVYA LLDSPSVTGA YRFDIEPGER LKMKVDVAVY PRKAIERLGV APLTSMFMVG
ENDRRTGYDW RPEIHDTDGL AMHTGNGEWI WRPLGNPENL RFNAYSDENP KGFGLLQRDR
NFDHYQDDGV FYDKRPSLWI EPTSSWGKGS VQLVEIPTLD ETFDNIVAFW NPAEPIVPGQ
ELLYSYNMYW GGIPPVQSPR ARVVDTFTGI GGVVGQKRKY YSKRFVVDFA GGTLPMIGKD
TQVKAVITAS EGKVEIESAR PLASINGYRA MFDVVPPGDG TEPINLRVYL EVDGQPLSET
WMYQWNPPAK DDRELHNAGH LQ