Gene ECH_0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0751 
Symbol 
ID3926990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp757111 
End bp758502 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content34% 
IMG OID637901869 
ProductYjeF family protein 
Protein accessionYP_507549 
Protein GI88658205 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTATTC TAAATGGTCA ACAAGTTGTT TCTTTTGAAA AGAGTTGTGG TGTGGCCATT 
GATGAGCTTA TTTGTAGAGC TGGAAAGGCT ATAAGTGATG TTATATTTAA GCTATTCCCA
AAACAGCCTG TTGCAATAAT TTCTGGACCT GGTAATAATG GTAAAGATGG TATTGTTACT
GCAAAAATAC TGAAAGCTCA TGGCTGGCCA GTAGTCTTGA TGTTGTATAA TTGTACTACC
GATATAGATG AGGATTGGGT TGTTCCACTA ACTTATGATA ATGTTGTTAA TTGTCAGAGT
TGTTTAGTTA TAGATGCCTT ATTCGGAATA GGGTTATCGC GCAATATACC TGAGAATTTA
TCGTGTATAT TTCACTATAT TAACGATTCT AATAATAAAG TTGTTGTTGC AGTTGATATT
CCTAGTGGTA TAAATTGTGA TACAGGTCAG GTTATGGGCT GTGCAATACG TGCTGATGTT
ACTGTAACTT TTTCGGTATT AAAAATAGGG CATGTTTTGT TTCCTGGTTG TGATTATTCT
GGAAAAGTAC ATGTTGTAGA TATTGGCATT GATATTGATG ATAGTAAAGT TGTAATACGT
AAAAATGCTC CTGCTTTGTG GAAACACAAA ATGCCTAAGT TGGAGTATAC ATCAAATAAG
TATAATAGGG GATACACATT AGTGTGTTCT GTTGGTAATA AGTCTATAGG TGCTTCAAAA
CTTGTTGCAA TGTCTGCTTT GAGAGTAGGT TCTGGTATAG TAAGTATTGC TTGTGACAGT
AATGCTGTTG CATTTTATGC AAGTTGTCTA ACGTCGATTA TGTACAAGCT TTATGATGAT
GTAATTAATG ATGATAGAAT TACATCTATT GTAATAGGAC CAGGATGTGG AATAAATGAT
ATTACTAAAC AGCGTACTAT GGATATCCTA AATAAACAAA ATTGTGTTTT AGATGCTGAT
TCTATCTCAG TATTTTCTGA TTCTTATGAA GTTCTTTTTT CCAAAATTCA ACATAATGTT
GTTATGACAC CACATGAAGG AGAGTTTAAG CGTATATTTC CATTTTTAAC TGGTGGTAAA
ATAGAAATGG CAAGGGAAGC AGCAAGTTTA TCGAAAGCTG TGATAGTGTT AAAAGGTCCG
GATACTGTAA TTGCTGATCC TATAGGTAAT GTTGTAGTAA ATAATGCTCC ATTTAGTTTA
GCTACAGCAG GTAGTGGTGA TGTTTTATCT GGAATTATTG GTGGATTATT ATCTTCTGGT
ATGAGTCCAT TTGATGCTGC ATGTTGTGGA GTATGGATAC ATACAGAATG TGCAAGGAAG
TATGGTATTG GTTTGATTGC AGATGATATA ATACTGGAAA TACCTCAAGT ATTAAAAAAA
TTATTTTGTT AA
 
Protein sequence
MVILNGQQVV SFEKSCGVAI DELICRAGKA ISDVIFKLFP KQPVAIISGP GNNGKDGIVT 
AKILKAHGWP VVLMLYNCTT DIDEDWVVPL TYDNVVNCQS CLVIDALFGI GLSRNIPENL
SCIFHYINDS NNKVVVAVDI PSGINCDTGQ VMGCAIRADV TVTFSVLKIG HVLFPGCDYS
GKVHVVDIGI DIDDSKVVIR KNAPALWKHK MPKLEYTSNK YNRGYTLVCS VGNKSIGASK
LVAMSALRVG SGIVSIACDS NAVAFYASCL TSIMYKLYDD VINDDRITSI VIGPGCGIND
ITKQRTMDIL NKQNCVLDAD SISVFSDSYE VLFSKIQHNV VMTPHEGEFK RIFPFLTGGK
IEMAREAASL SKAVIVLKGP DTVIADPIGN VVVNNAPFSL ATAGSGDVLS GIIGGLLSSG
MSPFDAACCG VWIHTECARK YGIGLIADDI ILEIPQVLKK LFC