Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0237 |
Symbol | |
ID | 3927133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 223716 |
End bp | 225119 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637901361 |
Product | hypothetical protein |
Protein accession | YP_507058 |
Protein GI | 88658494 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTAAAA AAATTAAGAA GGTATTGATT TCCGGTAAAA GTGTTTGGCC AATTATAGAA GGTGGTAAAG GTATAGGTGC GAGTGATGGT AGAACTGCTG GTGCTTTTGC TGCTGCTTCT GCTGTAGGTA CTTTTTCTGG AGCATGTGCT AGATTAGTTG ATGATAATGG GGAGCATGTT CCTTTAATAT ATCGTGGAAA GACTAGGTTA GAACGCCATA ATGAGCTCAT AAACTATAGT ATTGATGCAG CAGTGAGTCA GGCTAGAATG GCTCATGAGA TATCAAAGGG ATTGGGAAGA ATACATATGA ATGTACTGTG GGAAATGGGA GGTGTTCAGC GGGTACTTCA TGGTGTGCTT GATAAGGCAA AAGGATTAAT CCATGGTATT ACTTGTGGTG CTGGTATGCC ATATAAGTTA GTTGAAATAG CTGCTCAATA TCAAGTTTAT TACTATCCTA TTGTTTCTTC AATGAGAGCA TTTAAAATTT TGTGGCAACG TTCTTATCAG AAATTTTCAA AAACACTTCT TGGTGGAGTT GTATATGAAG ATCCTTGGTT AGCTGGTGGA CATAATGGAC TTAGTAATAG TGAATCTCCT GGTCATCCAC AAGATCCTTT TGAAAGAGTT GCAGCAATTC GCGCATACAT GAATGAAGTT GGATTATCTG ATGTTGTATT AATTATGGCA GGTGGCGTTT GGCATTTGAA GGACTGGGAA TCATGGTTAG ATAATGATTT AATTGGTCCA ATAGCATTTC AGTTCGGGAC CAGACCTTTA TTAACTCAGG AAAGTCCAAT TTCTCCTGGG TGGAAAAAGA AGTTGATGTC TTTGAAACCT GGTGATGTGT TTTTAAATAG ATTTAGTCCT ACAGGGTTTT ATTCTTCTGC GATTGAAAAT GAGTTTATAA AAGAGTTACA AGCACGTAAT ACACGTCAGA TTGCATTTGA AAATGAGATG ACTGAAAAAT GCAGTGCAGA GCTTTCTATT GGTAGTAGAG GTAGAAAAGT ATATGTAGAT CCTAAGGATA AAAAATTGTC CGAGTCTTGG GTAGCAATGG GTTATACAGA TGCTTTAAAA ACTCCCGATA ATACTTTAAT ATTTGTTAGT CAGAGTCAGT CAAGAAGTAT TAGGGAAGAT CAAATAAATT GTATGGGATG TCTAAGTCAT TGCAAATTTA GTAATTGGAA AGATCACGGG GATTATACTA CAGGTATTAA ACCAGATGTT CGTAGTTTTT GTATTCAGAA AACGTTACAA AATATTATTG CTGGGGTAGA CCATGAACAT GAGCTTATGT TTTCTGGCCA TAATGCATAT AAGTTTGTAC AAGATGAGTT TTATAGAGAT GGTTATATTC CTACAATCAA GGAGCTTGTT GATAGAATTT TGACTGGATA TTGA
|
Protein sequence | MRKKIKKVLI SGKSVWPIIE GGKGIGASDG RTAGAFAAAS AVGTFSGACA RLVDDNGEHV PLIYRGKTRL ERHNELINYS IDAAVSQARM AHEISKGLGR IHMNVLWEMG GVQRVLHGVL DKAKGLIHGI TCGAGMPYKL VEIAAQYQVY YYPIVSSMRA FKILWQRSYQ KFSKTLLGGV VYEDPWLAGG HNGLSNSESP GHPQDPFERV AAIRAYMNEV GLSDVVLIMA GGVWHLKDWE SWLDNDLIGP IAFQFGTRPL LTQESPISPG WKKKLMSLKP GDVFLNRFSP TGFYSSAIEN EFIKELQARN TRQIAFENEM TEKCSAELSI GSRGRKVYVD PKDKKLSESW VAMGYTDALK TPDNTLIFVS QSQSRSIRED QINCMGCLSH CKFSNWKDHG DYTTGIKPDV RSFCIQKTLQ NIIAGVDHEH ELMFSGHNAY KFVQDEFYRD GYIPTIKELV DRILTGY
|
| |