Gene Sama_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3197 
Symbol 
ID4605444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3789009 
End bp3790049 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content58% 
IMG OID639782613 
Producthypothetical protein 
Protein accessionYP_929069 
Protein GI119776329 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGTACGGAGT GCAGGGGACG GGTAATGGCC ACTTGAGCCG GGCGAGGGTG 
ATGGCCAAAG CCCTTGCCGA GCGCGGCGCC GAGGTTGATT ACCTGTTCAG TGGCAGACCC
CAATCGCAGT TTTTTGATAT GGACATCTTC GGTGACTACC GAGTTGCCAC CGGGCTGACC
TTTATCAGCA AGGCGGGACG CATCAGCTCG GTGGAGACGG TGCGCCATAA TCTCAGTTGC
CGCTGGTGGC AGGACATGCG CGGCCTGGAT CTCTCATCTT ACGATCTGGT GCTCAACGAT
TTTGAGCCCG TGAGCGCCTG GGCAGCCCGG CGGCAAAAGG TGCCCTGTAT TGGCATCAGC
CATCAGGCGG CGCTGCGCTT TGATGTACCC AAGGTGGGCA ACACCTGGTT TAACGAACGA
TTACTGCAAT ACTTTGCCCC CGTGGATGTG GCGCTTGGTT GTCACTGGCA TCATTTCGGT
TTCCCGCTGC TGCCACCCTT TGTGGATGTG GGTGAAGTCA GTGAAGAGCA TGGTCACGAT
ATCCTGGTCT ACCTGCCCTT TGAGGCTGCC GACGACATCA TTGATTTTCT GCGTCCCTTC
GAAAACTATC GGTTTTTGGT GTACCACGCC CAGTCGCCCA ATGGCCCTGT GCCCGAGCAT
ATTCAATGGC ATGGCTTCTG CCGCGAGGGA TTTCGCCGTC ATCTGGCTGA GGCCGGTGGT
GTGGTGGCCA ATGCGGGGTT TGAGCTTGCC AGCGAGGCAC TGACGCTGGG GAAAAAACTC
TTGGTAAAGC CACTGCTTGG ACAGTTTGAG CAGCTCTCCA ACGTGGCGGC ATTGCAATTG
TTGGGAGCGG CCCAAACCAT GATGCACCTG AGTCGCGATG TTATGCGCTC CTGGCTTAAG
TCAGCCAGCC CTGAGCCGGT GCGTTATCCC GCCGTGGGCG ACGCCCTGGT GCAATGGGTT
GAGCAGGGAG AATGGCACAG GGGCGATGTA CTCAGTGAGC AGCTGTGGTC ACAGGTGCGC
CTGCCCGACA CCTGGCGCTG A
 
Protein sequence
MKILYGVQGT GNGHLSRARV MAKALAERGA EVDYLFSGRP QSQFFDMDIF GDYRVATGLT 
FISKAGRISS VETVRHNLSC RWWQDMRGLD LSSYDLVLND FEPVSAWAAR RQKVPCIGIS
HQAALRFDVP KVGNTWFNER LLQYFAPVDV ALGCHWHHFG FPLLPPFVDV GEVSEEHGHD
ILVYLPFEAA DDIIDFLRPF ENYRFLVYHA QSPNGPVPEH IQWHGFCREG FRRHLAEAGG
VVANAGFELA SEALTLGKKL LVKPLLGQFE QLSNVAALQL LGAAQTMMHL SRDVMRSWLK
SASPEPVRYP AVGDALVQWV EQGEWHRGDV LSEQLWSQVR LPDTWR