Gene Sama_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1691 
SymbolmdoG 
ID4603942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2067034 
End bp2068656 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content55% 
IMG OID639781054 
Productglucan biosynthesis protein G 
Protein accessionYP_927567 
Protein GI119774827 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.138891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0773488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGTT CTCCTCGCAC CGCCAGTATC AAACCCAAGG CCGTGGCAGC CTTGCTGCTT 
GGCATGTCTG CTCTTTCCCC CATGCATCTC TTCGGCGCTG AGCCAGAGCA GGTGCAAACG
GTAAAGCCTG CCGTTAAAGC GGAAATGCCG CCAAAGCCAA CCAAGCCAAC TCAGGTGCGT
TTTGCCAAAA CCGGTAATTT TGATGCGGAT ACCGTTGTGC GCATCGCAAG ACAGCTTGCA
GCCAAACCCT ATGTGGCTTT GAGCGATCCG CTCCCGCCTG GACTGGCTAA CATCAGCTAT
GATGAGTACC GCGATATTCG CTTCAAGCCA GAGCAGGCCA TTTGGAAGCA GGACGGGTTG
CCCTATCAGA TGCAACTCTT CCACCGTGGG TTCTATTTTC AGGATTTGAT TGAAATCGCC
ATCGTCGAAG GCAAAAAGTC GACTCACCTG TCTTATGACC CGTCACTCTT CAGTGCCGGT
GAAGTCATTC GTGAAAAGCT GCCCAATGAA GACATTGGTT ACAGTGGTTT GCGGGTACAT
TATCCGCTGA ACAGCAGCGA GTATTTTGAT GAGCTCTTTG TGTTCCAGGG CGCCAGTTAC
TTCCGCGCCC TCGGTAAAGG CAATGCCTAC GGCCTGTCTG CCCGTGGCCT TGCCATTAAA
ACCGCCGATC CGGCCGGTGA AGAATTCCCG GTCTTTCGCG CGTTTTGGAT AGAAAAGCCC
AATAACGAAA CTAACCTCAT TGTGGTTCAT GCGCTGCTGG ATAGCCCCAG CGTGGCCGGT
GCATATCGTT TCTCCATCCG TCCCGGTGAC AACACCCGTA TGGATGTGGA AGCCGTGCTC
TTCCCACGGG TAGAGCTTGC CAAAGTGGGT TTGGCCCCGA GCACCAGCAT GTACATGCAT
TCTCCCAATG GCCGCCATCT CACAGACGAT TTTCGCCCAG CGGTGCATGA CTCAGACGGC
CTGTTGATGA TCAACGGCCG GGGTGAGCGT TTGTGGCGTC CGCTGGCAAA TCCAAAGGAT
CTGCAGGTAA GTGCCTTTAT GGATAACTCC CCACAGGGCT TTGGTTTGCT GCAGCGTGAG
CGCAACTATG TGAACTACCA GGATCTGGAA GCCAACTATG AGCGTCGTCC AAGCCTTTGG
GTTGAGCCCG TGGGTAACTG GGGTGCCGGT GCCGTGGTTC TGACTGAAAT CCCGACTCAG
TCTGAAATTC ACGACAACAT TGTGGCCTTC TGGAAGCCTC GTCAGCCACT TGCGGCCGGC
AGCGAGTACC GCTTTGCTTA CCATCTGTCC TGGGGTGCCA ATCCTGTGCC AGTGGATAAC
AGCATTATCG TAAGCCGCAG TGCCAGTGGC CGTGCCGACA TTGCCAAGCC AACGCCAAAA
CGCCTGTTCG TGGTGGATTA TGAGGTGAAA GGCGAGAAGC CCGCCAAGTT GCCCACGCCC
AAGGTGGAAA CCTCCGCAGG CGTTGTCAGC AATGTGGTTA TCCGCGAAAA CCCCAAATCA
AAAGGCTATC GTTTGTCGTT TGAGTTTGAC CCGGGTGAAA CCAAACTGGC TGAGTTTCGT
GCCGAGCTTA AGTTTGACGA ACCCCGCAGC GTAGAAACCT GGCTGTACCG TTGGACGCTC
TGA
 
Protein sequence
MVRSPRTASI KPKAVAALLL GMSALSPMHL FGAEPEQVQT VKPAVKAEMP PKPTKPTQVR 
FAKTGNFDAD TVVRIARQLA AKPYVALSDP LPPGLANISY DEYRDIRFKP EQAIWKQDGL
PYQMQLFHRG FYFQDLIEIA IVEGKKSTHL SYDPSLFSAG EVIREKLPNE DIGYSGLRVH
YPLNSSEYFD ELFVFQGASY FRALGKGNAY GLSARGLAIK TADPAGEEFP VFRAFWIEKP
NNETNLIVVH ALLDSPSVAG AYRFSIRPGD NTRMDVEAVL FPRVELAKVG LAPSTSMYMH
SPNGRHLTDD FRPAVHDSDG LLMINGRGER LWRPLANPKD LQVSAFMDNS PQGFGLLQRE
RNYVNYQDLE ANYERRPSLW VEPVGNWGAG AVVLTEIPTQ SEIHDNIVAF WKPRQPLAAG
SEYRFAYHLS WGANPVPVDN SIIVSRSASG RADIAKPTPK RLFVVDYEVK GEKPAKLPTP
KVETSAGVVS NVVIRENPKS KGYRLSFEFD PGETKLAEFR AELKFDEPRS VETWLYRWTL