Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0504 |
Symbol | engA |
ID | 3927981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 506393 |
End bp | 507721 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 637901627 |
Product | GTP-binding protein EngA |
Protein accession | YP_507319 |
Protein GI | 88657837 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.696313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAAAA TAGCTATAGT TGGGTTACCT AATGTTGGAA AATCTACTAT TTTTAATAGA TTAACTAGTC AAAAATCTGC AATTGTTAGC AATATTCCTA ATTTGACAAG AGATAGAAGG GAAGGGGATG CTGATTTATG TGGATTGAAA TTTAAGATTG TTGATACGGG TGGAGTGGAT AATACAATAA AATTATCTTC ATTAGTACTC GATCAAGTAA AATTATCTAT AGAAGAGTCT GATATAATAT TTTTCGTTGT TGATGCAAGG ATTGAACATG ATGATAAAAA TATAGAGTTT GCAAAATATT TAAGAAAGAA GGTAAAAAAA CCTATAATAT TAATTGCGAA TAAATGTGAA AGTCAAAAGA GATGTTATAG TATAGATTAT TTAGGGTATT TTGATTTTAT AGGTCCAGTT TATATTTCTG CAGAACATAA TTTAGGTTTA GTGGATTTAT ATGAAGCGTT GTTGCCATTT ATTAAAGAAT ATGATTTAAA TACATTAGAT TTACATAATA TTAAGCTTTC TATTGTTGGG AGACCAAATG CTGGTAAATC AACTTTTATT AATAGGTTGC TTGCTGAAAA TAGAATGATT GTAAGTCCTG AACCTGGAAC AACTAGAGAT TCTATAGATG TAGAATATAC ATATAGAGGT CAGAAATTTA CATTGATTGA TACTGCTGGC ATGCGTAAAA AAGCAAAGGT AACTGAAAGT ATAGAAGTGA CTTCTGTTCA CAAGACTATT GAGTCAATTA ATAGGTCTGA TATTGTGATT TTGATGATAG ATTCTGTTTA TGGTATAGAG CAACAAGATT TATCTATTGC TGAACTTGCT ATACAAAAAG GTAAAGCTAT TATTATTGCT TTAAACAAAT GGGATATGAT TGCTAAAAAA GATAGATCTG AGTTGCTAAA AGATATTTGT AATTATAATA AATTGAATTT TAAAGTTCCG GTTATTGAAG TTTCTGCACT TAAAAATATT AATTGTAATA AAATAATAGA AACAAGCATA GAGCTATATA AATGTCTGAC AATGCGTATT AGTACTTCTG TGCTGAATAA ATGGTTAAAA TTAGCTGTAG AATACCATAA ACCTCCATTG TGTAATGGCA AGGTAGTGAA GATCAAATAC ATTACACAAG TTAAAGTAAT ACCTCCAACT TTTATTGCAG TTGTTAATGG TTCTTATAGT ATTGATTTAA CTTATAGGCA ATATTTAATG AGTAGTTTGA GGAAGCATTT TTCTATTGAT GGGATTCCAA TAAGGTTAAA TTTTAAAAAG AATAAGAATC CATATGATGA TAGTTATTCT AAATCTTAA
|
Protein sequence | MLKIAIVGLP NVGKSTIFNR LTSQKSAIVS NIPNLTRDRR EGDADLCGLK FKIVDTGGVD NTIKLSSLVL DQVKLSIEES DIIFFVVDAR IEHDDKNIEF AKYLRKKVKK PIILIANKCE SQKRCYSIDY LGYFDFIGPV YISAEHNLGL VDLYEALLPF IKEYDLNTLD LHNIKLSIVG RPNAGKSTFI NRLLAENRMI VSPEPGTTRD SIDVEYTYRG QKFTLIDTAG MRKKAKVTES IEVTSVHKTI ESINRSDIVI LMIDSVYGIE QQDLSIAELA IQKGKAIIIA LNKWDMIAKK DRSELLKDIC NYNKLNFKVP VIEVSALKNI NCNKIIETSI ELYKCLTMRI STSVLNKWLK LAVEYHKPPL CNGKVVKIKY ITQVKVIPPT FIAVVNGSYS IDLTYRQYLM SSLRKHFSID GIPIRLNFKK NKNPYDDSYS KS
|
| |