Gene Sbal195_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3898 
Symbol 
ID5755713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp4592721 
End bp4594205 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content52% 
IMG OID641290240 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001556318 
Protein GI160877002 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000196302 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAAG ATCTATCGCA ATTCCCCAAG GCCTTATTTA CCCAAGCACA AGTTCGCCAA 
ACTGAACTGA GTGCTGTGTC GCAGGGAGCC TCTAGCCTCT ATGAATTGGT CGAGCGTGCT
GGCGCGGCGG CTTTTGAATG TTTAACAAAA CATAACCCCA ACGCCTCTTC TGTATTTATC
CTCGCTGGCA GCGGTAACAA TGGTGCCGAT GCCTTAGTGT GCGCCCGTTT AGCCCGTGCA
AGTGGTATGG CCGTTTCAGT GATGATGACG TCAGCGGCAG GCACGCCTGA GTGTCAGCAA
GCATTGGCCC ATTATCTTAA AGACGGCGGA GAGTTACTGC CGAAAGCCGT TGCGCCCATT
CTTGCGGCCA AGATCATCGT CGATGGTTTA CTCGGCACTG GTGTGCGAGA TGCGGTGCGC
GACGATATGG CTGAGTATAT TCGTGCCATT AATGATAATG CAGCTTGGGT ATTGAGCCTT
GATTTACCGT CTGGAGTGAT TGCCGATACC GGCGCTGTGG CGGGTGTTGC AGTGATGGCG
GATGTCACCT TGTGTTTTGG TGGCTGGAAG CAAGGTTTGT TAACGGGTAA GGCGCGGCAT
TACAGCGGCG AACTTGAATT TGCCGCTTTG GGATTAGCAC CTTTCTTCGC TGAAGCGAGT
GCACAAAGAG TCGGAAAAGA GACGCTTAAG GATTATTTTG CCGCCAGAGC GCGCGATAGC
CATAAAGGCC AGTCGGGCAA GGTCACTGTT ATTGGCGGTG ATATGGGTAT GGCTGGCGCC
GTGCGACTCG CCAGTGAGGC ATGTCTGCGT GCCGGTGCTG GATTAGTCAC AGTGATCAGT
CGACCTGAGC ATCAATTGAC TGTGAATGTC TCTCGTCCGG AATTGATGTT CTGGGGTTGT
GACTTAGTCG ATATGGAAGT GTATCTGCGC CTTGGTTGGG CGCAGGTTAT CGTGCTTGGC
CCTGGTTTAG GTAAGCATGA CTGGGGCTAT AACTTGTTTA AAGCCGTGGG CTTAAGCGAT
AAACCCTGTG TTTTGGATGC CGATGCCTTA AATCTATTGA GCAATGAACC GCGCCAGCAA
ACCAATTGGG TGCTAACACC GCATCCGGGT GAGGCTGCGC GTTTACTTGG CTGCTCTGTG
GCAGAAATTG AGCAGGATAG GTTTGCGGCT GTAAGGGCGA TACAGCAAAA GTATGGCGGT
GTTGTACTGC TAAAAGGCGC GGGAACGGTC ATTTTTGATG GCAAACAAAT GGTTGTTGCA
CCTGTAGGTA ATCCTGGGCT TGCCAGTGGT GGGTGTGGCG ATGTGTTATC TGGTATTATA
GGCGCGCTCA TGGCTCAGGG AATGGATAAC ATGCAGGCGA CTGTGCTGGG TGTTGTTGTA
CATGGTTGCG CTGCCGATCT TGCGGCAATA CAGGGAGAAC GTGGCATGTT AGCGAGCGAT
TTAATGCCTT TTATTCGCCA ATTAGTGAAT AGTGATTTAC TCTAG
 
Protein sequence
MAQDLSQFPK ALFTQAQVRQ TELSAVSQGA SSLYELVERA GAAAFECLTK HNPNASSVFI 
LAGSGNNGAD ALVCARLARA SGMAVSVMMT SAAGTPECQQ ALAHYLKDGG ELLPKAVAPI
LAAKIIVDGL LGTGVRDAVR DDMAEYIRAI NDNAAWVLSL DLPSGVIADT GAVAGVAVMA
DVTLCFGGWK QGLLTGKARH YSGELEFAAL GLAPFFAEAS AQRVGKETLK DYFAARARDS
HKGQSGKVTV IGGDMGMAGA VRLASEACLR AGAGLVTVIS RPEHQLTVNV SRPELMFWGC
DLVDMEVYLR LGWAQVIVLG PGLGKHDWGY NLFKAVGLSD KPCVLDADAL NLLSNEPRQQ
TNWVLTPHPG EAARLLGCSV AEIEQDRFAA VRAIQQKYGG VVLLKGAGTV IFDGKQMVVA
PVGNPGLASG GCGDVLSGII GALMAQGMDN MQATVLGVVV HGCAADLAAI QGERGMLASD
LMPFIRQLVN SDLL