Gene Haur_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4181 
Symbol 
ID5736043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5332317 
End bp5334020 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content54% 
IMG OID641281336 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001546941 
Protein GI159900694 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA TTGTGCCTGG GTATCACATT ACAACCGCTG CCCAAATTCG GGCAATCGAG 
CAACATGCTG TTGACGAAGG GGCGACGTGG GCTGGCCTGA TGGCCGAGGC TGCACGCGGC
ATGGCCGATG TTGGATTAAC CGTGATCGCC AAGCAAACCA ACCCCAGCGT GTTAGTGCTG
GTTGGTTCTG GCAACAATGG CGGCGATGCT TTGGTGATTG CACGGCATAT CAAAGAGGCT
GGCTTTCCAG TCACCTGCTA TCTATATAAA CGCAAGCCGC ACGCTGATGA TTGGCCGTTT
GCTGCTGCCC AAGCCGAAGC CATTCCAATG ATCTTCGCTG CTGATGATCC ACAAAATCAG
CAACTCCAGC AACTGTTGCA AACCAACACA TTCATCATTG ATGGGTTGTT TGGCATTGGC
CTGAGTCGTT CGTTGGCGGC TGAGGTAGTC CAAATTATCG ATTTGGTCAA TGCCAGCAAA
TTGCCAGTTT TGGCAGTCGA TGTGCCTTCG GGCCTTGATG CTGACAATGG CAAGATTTGG
GGCACAATCA TCAATGCTGC CTACACCGTA GCGGCTGGCC TAACCAAACG CGGCCATCAT
TTGTACCCAG GTGCGGCCTA TGTTGGCAAA TTGGCAATCG CCCCCTTTAC CCTACCAGAT
TCTATGGAGG AGCCGATGAC TACAACTGAA CTGAATCTTG CGACAATCCG TTCGCTGGTG
CCGGCCCGTC CGGTTGATGG CCATAAAGAT ACTTTCGGGC GCGTGATGGT TGTGGCTGGC
TCGTATCTCT ATCCTGGCGC GGCTTGGCTT GCGGCCACGG CGGCGGCTCG TTCGGGTGCT
GGGGTAGTAA CCTTGGCTTG CCCACGCTCA ATTTATGGCA GCACCGCCGC CCATCTGCAT
GAAGTAACCT ATTTGCCGTT GCCAGAAGTT GAACCAGGCG AGTTGACCGA AGCCGCTGCT
AAGTTGGTGC ACGAAAAACT AGCCAAATAT AAAGCCTTGC TGGTTGGACC AGGCCTTGGC
ACCGAAACAG GCACTGGCGA TTTTCTACGG GCGCTGATTG GCTTGGCCTC AAGCAAACGT
CGGTTAGGCG TAGGTTTTCT TGGCTCAAGC GAGCTTGAGT TGCCGACCAA ACGCAAAGGC
GGGGTTGGTT TCGGCCTAGC TGCTCGGCCC AAAGAAGAAG CAAAGGCTGA AGAAGATGGC
CCAGTCGTGC TGCCACCGCT GGTTATCGAT GCCGATGGCT TGAATATTTT GGCCACAATT
GAACATTGGG AAGAAAAATT ACGCGATCAG CCTGTCGTGC TCACACCACA CATCGGCGAG
ATGGCCCGCT TGTTGGGCGA AGAAAAAATC GGTGAAGATC ACCCACAAAT TGCACTCGAA
GCGGCAGCCC GCTGGGGCGT GACCGTGGTG CTGAAATCGG CCCACACAAT TATTGCAAGC
CCTGATGGCC GCTTGGCGTT GCATGGTTTG GCTAACCCAG CATTGGCAAC CGCCGGATCT
GGTGATATCC TTGCAGGACT AACTGCTGGC TTGATGGCAC AAGGCTTAGC CCCATTTGAA
GCCGCCCAGC TTGCAGTTGG CGTGCATGGC GTTGCAGGCG CTCTGGTTCG TGAAGAACTC
GGCGAACGCG GCACCATTGC TAGCGATATT CTTAATCGCC TGCCCTTAGC ATGGCGAAAA
CTTACAGAAG GCGGATTAAA ATAA
 
Protein sequence
MPKIVPGYHI TTAAQIRAIE QHAVDEGATW AGLMAEAARG MADVGLTVIA KQTNPSVLVL 
VGSGNNGGDA LVIARHIKEA GFPVTCYLYK RKPHADDWPF AAAQAEAIPM IFAADDPQNQ
QLQQLLQTNT FIIDGLFGIG LSRSLAAEVV QIIDLVNASK LPVLAVDVPS GLDADNGKIW
GTIINAAYTV AAGLTKRGHH LYPGAAYVGK LAIAPFTLPD SMEEPMTTTE LNLATIRSLV
PARPVDGHKD TFGRVMVVAG SYLYPGAAWL AATAAARSGA GVVTLACPRS IYGSTAAHLH
EVTYLPLPEV EPGELTEAAA KLVHEKLAKY KALLVGPGLG TETGTGDFLR ALIGLASSKR
RLGVGFLGSS ELELPTKRKG GVGFGLAARP KEEAKAEEDG PVVLPPLVID ADGLNILATI
EHWEEKLRDQ PVVLTPHIGE MARLLGEEKI GEDHPQIALE AAARWGVTVV LKSAHTIIAS
PDGRLALHGL ANPALATAGS GDILAGLTAG LMAQGLAPFE AAQLAVGVHG VAGALVREEL
GERGTIASDI LNRLPLAWRK LTEGGLK