Gene GSU1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1802 
Symbol 
ID2686333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1967601 
End bp1969160 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID637126489 
ProductYjeF family protein 
Protein accessionNP_952852 
Protein GI39996901 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0891524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTGG TGAGCGGCGA AACCATGCAG CGGATGGACC GTCGCGCCAT CGACGAGTTC 
GGTATCCCCG GCCTTGTCCT CATGGAAAAC GCCGGCCGCG GATGTGCCGA CGCCATCAGA
GAGATGTTCG GCCGCGACGG CTGCATACCG GTCCTTGTGG TCGCGGGCAA GGGTAACAAC
GGCGGCGACG GCTATGTGAT CGCTCGACTG CTGGCTGGGG AAGGGTGGCC GGTGCATACG
GTTGTGCTTG CCCGTAAGGA CGAAATTGGC GGTGATGCCC GCGAGAATCT GGACCGCCTC
GACCCGTCGA CGGTTTCTTA CCTCCCTGCA GGGGGAACTC TTTCGTCCCT TACGGCTAGG
CTCGACGCTG CGGCATTGGT GGTGGATGCC CTGCTGGGGA CAGGGCTGAA GAACGAGGTG
CAGGGCGCGT ATGCCGAAGC GATCAGGCAC ATCGCAGCTT CAGCTCGGCC GGTTGTCTCG
GTCGACATCC CGTCGGGTAT CGACGCCGCA ACGGGCAAGG TGCTCGGGGT TGCGGTCACG
GCCAGTCTGA CGGTAACCTT CGCCCTTGCA AAATACGGAC ACGTTCTTTA TCCGGGGGCG
CTGCACTGCG GCCGGCTGAG AGTTGTCAAT ATCGGTATTC CGGAGTCAGT CGCCCGGGAA
GCGGACGGCA TTCTCTACGT TGACGCGGCT GAGGCGGCAG CAGTGGTTAA GCGGCGGGAC
CCATGCTCCC ACAAGGGGAG CTTCGGGCAC AGCCTCGTTA TAGCCGGGTC TGTCGGGAAG
ACGGGAGCCG CGGCAATGGC CGCGAACAGC GCCGTTCGAA GCGGAGCAGG CCTCGTTTCG
CTGGCAGTTC CGGCAAGCCT TAATGCAATT CTCGAGTTAA AGACCACCGA GGCTATGACA
ATCCCGCTGG CTGACGGTGG GGTCGGTTTC CTCGGGGATG AGTCACTTGT TCCGCTGAGG
GACGCGATTC GAGGCCGGGA TGCGATAGCC CTTGGTCCCG GCCTGTCGTG GCAGCCAGCT
ACTGCCGCCC TTGTGCGGCA CCTGCTGGCT GATATTATGG TGCCCCTCGT CCTTGACGCC
GATGGCCTCA ATGCAATTTC CGAGCAGACC GAACTCTTGA AGGGCGCGCG CCCCGACACC
GTTGTCCTCA CGCCCCATCC TGGGGAGATG GCGCGTCTTG CGGGAACCAC CACTGCGGCG
GTCGAAGCGG ACCGTATCGG CGTTGCGCGA GACTTTGCCG CACAGTTCGG CGTCTATCTT
ATCCTCAAGG GCGCACGGTC CGTCATCGCG GCGCCCGATG GGCGCATCGC CCTCAACGGC
AGCGGCAATC CGGGCATGGC CTCGGGGGGG ATGGGGGATG TGCTCACTGG TGTTGTCACG
GCACTCCTTG GACAGGGGTA CGAACCATTC GACGCCTGCA TACTGGGTGC TTTCGTTCAC
GGTCATGCCG CTGACCTGGT TGCTGCGGAC AAGGGCGAGA CAGGCATGTC CGCTCTCGAT
GTCCAGGAGC GGCTACCCTA TGCATTCAAT TCACTGATCC GTTTGAAGGG AGAACAATAA
 
Protein sequence
MKVVSGETMQ RMDRRAIDEF GIPGLVLMEN AGRGCADAIR EMFGRDGCIP VLVVAGKGNN 
GGDGYVIARL LAGEGWPVHT VVLARKDEIG GDARENLDRL DPSTVSYLPA GGTLSSLTAR
LDAAALVVDA LLGTGLKNEV QGAYAEAIRH IAASARPVVS VDIPSGIDAA TGKVLGVAVT
ASLTVTFALA KYGHVLYPGA LHCGRLRVVN IGIPESVARE ADGILYVDAA EAAAVVKRRD
PCSHKGSFGH SLVIAGSVGK TGAAAMAANS AVRSGAGLVS LAVPASLNAI LELKTTEAMT
IPLADGGVGF LGDESLVPLR DAIRGRDAIA LGPGLSWQPA TAALVRHLLA DIMVPLVLDA
DGLNAISEQT ELLKGARPDT VVLTPHPGEM ARLAGTTTAA VEADRIGVAR DFAAQFGVYL
ILKGARSVIA APDGRIALNG SGNPGMASGG MGDVLTGVVT ALLGQGYEPF DACILGAFVH
GHAADLVAAD KGETGMSALD VQERLPYAFN SLIRLKGEQ