Gene Sama_3030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3030 
Symbol 
ID4605277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3604771 
End bp3606261 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content58% 
IMG OID639782441 
ProductYjeF protein 
Protein accessionYP_928902 
Protein GI119776162 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT CTTGCCTGTC CTTGCCTCAA ACACTGTATT CGGCCGCCCA GGTGAAGGCT 
GCTGAAGTCA ATGCAGTCAA TTCAGGGGCT TGTAGCCTGT ACGAGCTGGT CGAGCAGGCT
GGCAGTGCTG CTATGGCGAT TTTAGAGTCG TCCGAGCACT GTAATCTCCC CACAGTGATT
TTGGTGGGCA AGGGGAATAA TGGCGCTGAT GCGCTGGTGG TCGCCAGGTT GCTGCTGACC
AAGGGCAATC GTGCCCGGGT GCTGCTGCTT GGGCAAGGCA AAACCCAGGA ATATGTCACA
GCCATGGCCG CCTTTGTCGA GGCTGGTGGT GTCTCAGAAG CATTTACCGA ATCTGCCCTG
GAAGGTGCTG ACGTCATCAT TGATGGGGTA CTGGGTACCG GAAGCTGCGG CGAACTGAGC
GATGACTTGC AGCAGTGTTT CAACGCGGTA AACGCGACGC CAGCCTGGGT GCTCAGCCTG
GATATGCCTT CCGGTGTGAA TGCGGATACC GGCAACGTGA ATCCGGTTGC GATTCAAGCC
GATGTCACCC TGTGTTTTGG CGGGCTAAAG CAAGGGTTGT TTACTTCCAG AGCACGGCAT
TTTGCTGGGC TTATTCGCTA CGCATCCCTC GGACTTACTG ATTTTCTCGC CCAGCTCCCG
GCAGAGGCCC TGCGGGTTGA TGCAGGGTAT CTGAAGGATC TGCTCGGACG TCGTCCACGC
GACAGCCACA AGGGGAAATC CGGCAAGGTT ACCATCATGG GGGGCAACTA TGGCATGGCC
GGTGCTGTGC GGTTGGCTGG GGAGGCCTGT CTTCGCGCCG GTGCGGGGTT GGTAACGGTT
ATCAGCCGAC CGGAACATCA GCTGACCGTG AATGCCAACC GGCCAGAGCT GATGTTTTGG
GGGTGCGAAT TGGTGGACAT GGAGGTGTAT CTGCGTTTGG GTTGGGCTGA TGTGCTGGTG
CTTGGCCCCG GCCTTGGCAA GGATGACTGG GGATATAACC TGTACAAGGC GGTTGGTTTA
TCGGATAAAC CCTGTGTGCT CGATGCCGAT GCGCTGAACC TGTTATCAAA AGAGCCCTGT
CGTCAGGTCA ATCGCGTTTT GACGCCCCAC CCCGGCGAAG CGGCAAGGCT GCTGGGGGTG
TCCACTGCCC AGATAGAGGT CGACAGGTTT GCTGCTGTCA GGCAGTTACA GGAGAGATTC
GGGGGCGTTG TCTTGCTCAA AGGTGCAGGT ACGCTTATCT ATGACGGCGA TAGTCTGGTG
GTGGCTCCCG TTGGCAACCC GGGCCTGGCC AGTGGTGGGT GCGGAGATGT TTTATCTGGT
ATAATCGCCG CCCTTATGGC TCAGGGGCTC GATACCATGA CGGCGACTAT CGCTGGTGTG
GTGGTCCACG GTGAGGCGGC AGACCTCGCA GCACTCGCCG GGGAGCGGGG CATGCTCGCC
AGTGACCTGA TGCCTTTTAT TCGCCAATTG GTCAATAGTG ATTTAATCTA G
 
Protein sequence
MSDSCLSLPQ TLYSAAQVKA AEVNAVNSGA CSLYELVEQA GSAAMAILES SEHCNLPTVI 
LVGKGNNGAD ALVVARLLLT KGNRARVLLL GQGKTQEYVT AMAAFVEAGG VSEAFTESAL
EGADVIIDGV LGTGSCGELS DDLQQCFNAV NATPAWVLSL DMPSGVNADT GNVNPVAIQA
DVTLCFGGLK QGLFTSRARH FAGLIRYASL GLTDFLAQLP AEALRVDAGY LKDLLGRRPR
DSHKGKSGKV TIMGGNYGMA GAVRLAGEAC LRAGAGLVTV ISRPEHQLTV NANRPELMFW
GCELVDMEVY LRLGWADVLV LGPGLGKDDW GYNLYKAVGL SDKPCVLDAD ALNLLSKEPC
RQVNRVLTPH PGEAARLLGV STAQIEVDRF AAVRQLQERF GGVVLLKGAG TLIYDGDSLV
VAPVGNPGLA SGGCGDVLSG IIAALMAQGL DTMTATIAGV VVHGEAADLA ALAGERGMLA
SDLMPFIRQL VNSDLI