Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0121 |
Symbol | |
ID | 3927246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 107054 |
End bp | 108160 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901245 |
Product | hypothetical protein |
Protein accession | YP_506949 |
Protein GI | 88658059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACATA CTGCAGTACC TGGAATGGTA GCTCCAACAA GCGTTATTTC TGCTAAGCAT GTTGTAATTA AAGGACTTGT TTACAAGCAT GTGAAGCATT ATTCGATAGA GGAATATAAA TCTCAAATAA AAGAGTTTAG GGAATCTATA ACGTGTTTTG CAAGAATGCA TATGTCCTAT ATGTATCATA TGCTGCATAA TACGTTCGTT GTAAGGAATG GAAGGATTAT GTTGAAGTCT GAAATTGAAC AGTGTCTATC AAAAATAACC AGTAATATAA GGCTGTGTGC CTTTGTGATT AAGATAGGAA TAGTAGACCA CGTTATGAGT AGGCTTTGCA GGTTTTATGG TTCTGACAGC ATAAAGTATT GTGCAAGTCA TTACCATGAT CCAAGGGTTA TAGATTCGAT ACTTATTGGG TTATATGGTG CGTCATATTC TGATTTTTCA AGGATGTCAT ATCAAGTACG TAGTAATATA GTTTATTGTG TTGGAAAACA TGGTATTGCA GGTGTTTTTA AGCTACATAA TAGTGGTTTT TACTCAGAAT TATTAGGTAT GTGTTATGAT TTTGTTCATG CAAGGGGTAA GGGTGTAAAA TTGCAAGAAT TATGTGATTT TATGAAGTTG TCTTGTAGTA TACAACTTGG GCAAATGTAT CACATGATGG TAAAAGTCAA ATGTTCTATT GGAGATGAGC AAAGTGATAT ACGGAAACTT GTATCCCAAG AATGTAGTGT AGGGTATCTA GTATATCGTT CTTTACTTTT TGGTAGGTAT GCTTATCATG TAAGAAAAGC GTTTAGGCAT TTATATGCTC CAAGTGATAA AAACCCTGTA CGTACAGTAT CTGGGTTAAA CATTCCGCAT AGTCTAATTC GACTAAATCA TAGAGGAATT TTTACAAAAA TTGAACATTG TATAAACGCA GAAAAAATGA GTTTTAATGT TTTTGTTGTT GATATAGTGC GTCATATTGA CAAGCTATTA TTGCATCCGC GTGAAGAAGT TTATATAAGA GAAGATATAA GTACATATTG CGCTATAGTG AGTAGTAGAT ATAGTACTAT GGGGCCTGAC ATAGATTCTT CTTATCATAT ATTGTAG
|
Protein sequence | MQHTAVPGMV APTSVISAKH VVIKGLVYKH VKHYSIEEYK SQIKEFRESI TCFARMHMSY MYHMLHNTFV VRNGRIMLKS EIEQCLSKIT SNIRLCAFVI KIGIVDHVMS RLCRFYGSDS IKYCASHYHD PRVIDSILIG LYGASYSDFS RMSYQVRSNI VYCVGKHGIA GVFKLHNSGF YSELLGMCYD FVHARGKGVK LQELCDFMKL SCSIQLGQMY HMMVKVKCSI GDEQSDIRKL VSQECSVGYL VYRSLLFGRY AYHVRKAFRH LYAPSDKNPV RTVSGLNIPH SLIRLNHRGI FTKIEHCINA EKMSFNVFVV DIVRHIDKLL LHPREEVYIR EDISTYCAIV SSRYSTMGPD IDSSYHIL
|
| |