Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0751 |
Symbol | |
ID | 3926990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 757111 |
End bp | 758502 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901869 |
Product | YjeF family protein |
Protein accession | YP_507549 |
Protein GI | 88658205 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.553827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATTC TAAATGGTCA ACAAGTTGTT TCTTTTGAAA AGAGTTGTGG TGTGGCCATT GATGAGCTTA TTTGTAGAGC TGGAAAGGCT ATAAGTGATG TTATATTTAA GCTATTCCCA AAACAGCCTG TTGCAATAAT TTCTGGACCT GGTAATAATG GTAAAGATGG TATTGTTACT GCAAAAATAC TGAAAGCTCA TGGCTGGCCA GTAGTCTTGA TGTTGTATAA TTGTACTACC GATATAGATG AGGATTGGGT TGTTCCACTA ACTTATGATA ATGTTGTTAA TTGTCAGAGT TGTTTAGTTA TAGATGCCTT ATTCGGAATA GGGTTATCGC GCAATATACC TGAGAATTTA TCGTGTATAT TTCACTATAT TAACGATTCT AATAATAAAG TTGTTGTTGC AGTTGATATT CCTAGTGGTA TAAATTGTGA TACAGGTCAG GTTATGGGCT GTGCAATACG TGCTGATGTT ACTGTAACTT TTTCGGTATT AAAAATAGGG CATGTTTTGT TTCCTGGTTG TGATTATTCT GGAAAAGTAC ATGTTGTAGA TATTGGCATT GATATTGATG ATAGTAAAGT TGTAATACGT AAAAATGCTC CTGCTTTGTG GAAACACAAA ATGCCTAAGT TGGAGTATAC ATCAAATAAG TATAATAGGG GATACACATT AGTGTGTTCT GTTGGTAATA AGTCTATAGG TGCTTCAAAA CTTGTTGCAA TGTCTGCTTT GAGAGTAGGT TCTGGTATAG TAAGTATTGC TTGTGACAGT AATGCTGTTG CATTTTATGC AAGTTGTCTA ACGTCGATTA TGTACAAGCT TTATGATGAT GTAATTAATG ATGATAGAAT TACATCTATT GTAATAGGAC CAGGATGTGG AATAAATGAT ATTACTAAAC AGCGTACTAT GGATATCCTA AATAAACAAA ATTGTGTTTT AGATGCTGAT TCTATCTCAG TATTTTCTGA TTCTTATGAA GTTCTTTTTT CCAAAATTCA ACATAATGTT GTTATGACAC CACATGAAGG AGAGTTTAAG CGTATATTTC CATTTTTAAC TGGTGGTAAA ATAGAAATGG CAAGGGAAGC AGCAAGTTTA TCGAAAGCTG TGATAGTGTT AAAAGGTCCG GATACTGTAA TTGCTGATCC TATAGGTAAT GTTGTAGTAA ATAATGCTCC ATTTAGTTTA GCTACAGCAG GTAGTGGTGA TGTTTTATCT GGAATTATTG GTGGATTATT ATCTTCTGGT ATGAGTCCAT TTGATGCTGC ATGTTGTGGA GTATGGATAC ATACAGAATG TGCAAGGAAG TATGGTATTG GTTTGATTGC AGATGATATA ATACTGGAAA TACCTCAAGT ATTAAAAAAA TTATTTTGTT AA
|
Protein sequence | MVILNGQQVV SFEKSCGVAI DELICRAGKA ISDVIFKLFP KQPVAIISGP GNNGKDGIVT AKILKAHGWP VVLMLYNCTT DIDEDWVVPL TYDNVVNCQS CLVIDALFGI GLSRNIPENL SCIFHYINDS NNKVVVAVDI PSGINCDTGQ VMGCAIRADV TVTFSVLKIG HVLFPGCDYS GKVHVVDIGI DIDDSKVVIR KNAPALWKHK MPKLEYTSNK YNRGYTLVCS VGNKSIGASK LVAMSALRVG SGIVSIACDS NAVAFYASCL TSIMYKLYDD VINDDRITSI VIGPGCGIND ITKQRTMDIL NKQNCVLDAD SISVFSDSYE VLFSKIQHNV VMTPHEGEFK RIFPFLTGGK IEMAREAASL SKAVIVLKGP DTVIADPIGN VVVNNAPFSL ATAGSGDVLS GIIGGLLSSG MSPFDAACCG VWIHTECARK YGIGLIADDI ILEIPQVLKK LFC
|
| |