Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1802 |
Symbol | |
ID | 2686333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 1967601 |
End bp | 1969160 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637126489 |
Product | YjeF family protein |
Protein accession | NP_952852 |
Protein GI | 39996901 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0891524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTGG TGAGCGGCGA AACCATGCAG CGGATGGACC GTCGCGCCAT CGACGAGTTC GGTATCCCCG GCCTTGTCCT CATGGAAAAC GCCGGCCGCG GATGTGCCGA CGCCATCAGA GAGATGTTCG GCCGCGACGG CTGCATACCG GTCCTTGTGG TCGCGGGCAA GGGTAACAAC GGCGGCGACG GCTATGTGAT CGCTCGACTG CTGGCTGGGG AAGGGTGGCC GGTGCATACG GTTGTGCTTG CCCGTAAGGA CGAAATTGGC GGTGATGCCC GCGAGAATCT GGACCGCCTC GACCCGTCGA CGGTTTCTTA CCTCCCTGCA GGGGGAACTC TTTCGTCCCT TACGGCTAGG CTCGACGCTG CGGCATTGGT GGTGGATGCC CTGCTGGGGA CAGGGCTGAA GAACGAGGTG CAGGGCGCGT ATGCCGAAGC GATCAGGCAC ATCGCAGCTT CAGCTCGGCC GGTTGTCTCG GTCGACATCC CGTCGGGTAT CGACGCCGCA ACGGGCAAGG TGCTCGGGGT TGCGGTCACG GCCAGTCTGA CGGTAACCTT CGCCCTTGCA AAATACGGAC ACGTTCTTTA TCCGGGGGCG CTGCACTGCG GCCGGCTGAG AGTTGTCAAT ATCGGTATTC CGGAGTCAGT CGCCCGGGAA GCGGACGGCA TTCTCTACGT TGACGCGGCT GAGGCGGCAG CAGTGGTTAA GCGGCGGGAC CCATGCTCCC ACAAGGGGAG CTTCGGGCAC AGCCTCGTTA TAGCCGGGTC TGTCGGGAAG ACGGGAGCCG CGGCAATGGC CGCGAACAGC GCCGTTCGAA GCGGAGCAGG CCTCGTTTCG CTGGCAGTTC CGGCAAGCCT TAATGCAATT CTCGAGTTAA AGACCACCGA GGCTATGACA ATCCCGCTGG CTGACGGTGG GGTCGGTTTC CTCGGGGATG AGTCACTTGT TCCGCTGAGG GACGCGATTC GAGGCCGGGA TGCGATAGCC CTTGGTCCCG GCCTGTCGTG GCAGCCAGCT ACTGCCGCCC TTGTGCGGCA CCTGCTGGCT GATATTATGG TGCCCCTCGT CCTTGACGCC GATGGCCTCA ATGCAATTTC CGAGCAGACC GAACTCTTGA AGGGCGCGCG CCCCGACACC GTTGTCCTCA CGCCCCATCC TGGGGAGATG GCGCGTCTTG CGGGAACCAC CACTGCGGCG GTCGAAGCGG ACCGTATCGG CGTTGCGCGA GACTTTGCCG CACAGTTCGG CGTCTATCTT ATCCTCAAGG GCGCACGGTC CGTCATCGCG GCGCCCGATG GGCGCATCGC CCTCAACGGC AGCGGCAATC CGGGCATGGC CTCGGGGGGG ATGGGGGATG TGCTCACTGG TGTTGTCACG GCACTCCTTG GACAGGGGTA CGAACCATTC GACGCCTGCA TACTGGGTGC TTTCGTTCAC GGTCATGCCG CTGACCTGGT TGCTGCGGAC AAGGGCGAGA CAGGCATGTC CGCTCTCGAT GTCCAGGAGC GGCTACCCTA TGCATTCAAT TCACTGATCC GTTTGAAGGG AGAACAATAA
|
Protein sequence | MKVVSGETMQ RMDRRAIDEF GIPGLVLMEN AGRGCADAIR EMFGRDGCIP VLVVAGKGNN GGDGYVIARL LAGEGWPVHT VVLARKDEIG GDARENLDRL DPSTVSYLPA GGTLSSLTAR LDAAALVVDA LLGTGLKNEV QGAYAEAIRH IAASARPVVS VDIPSGIDAA TGKVLGVAVT ASLTVTFALA KYGHVLYPGA LHCGRLRVVN IGIPESVARE ADGILYVDAA EAAAVVKRRD PCSHKGSFGH SLVIAGSVGK TGAAAMAANS AVRSGAGLVS LAVPASLNAI LELKTTEAMT IPLADGGVGF LGDESLVPLR DAIRGRDAIA LGPGLSWQPA TAALVRHLLA DIMVPLVLDA DGLNAISEQT ELLKGARPDT VVLTPHPGEM ARLAGTTTAA VEADRIGVAR DFAAQFGVYL ILKGARSVIA APDGRIALNG SGNPGMASGG MGDVLTGVVT ALLGQGYEPF DACILGAFVH GHAADLVAAD KGETGMSALD VQERLPYAFN SLIRLKGEQ
|
| |