Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0830 |
Symbol | |
ID | 3927773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 844398 |
End bp | 845549 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637901947 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_507626 |
Protein GI | 88657903 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACAT CAATGCTAGA AATTCAAGAA TCACTACATA ATCTTACATC TTCATGGGAA GAGTTTAAAA ATCTAAATGA AAAAAAATTA TCAGATATAA GTAAAAAAAG TTCTACATAC CCATCCATAT TAGAAAAATT AGAAAAAACA GAAAGCATGC TAGAAATACA AAATGAAAGA TTGGATCAAT TAGAAATATC ATCACAGCGC CCTGTTACCC ATGATAACAA ACACTATAAT GAAGATTATA ATTATAAAAC GTTCACACAA TACCTATGTA AAGGCACTAA TCTTCAAGGT CAAAGTAGTG TTGTGCAGCC CTTAGATGAA TCGTACTTTA TACCACATAA CATATCATCT TATATAGAAA CAAATCTAAC TAAGAACTCA GTAATGCGTC AATTATGTTC TATAGAAAAA ATATCTGGAG ATAGTTTAGA CTGCTTTATC AGAAACAACA ATGAAAAGTC TGTTAACTGG AGAACAAGCA ATAACGTAGA AGATACAGAA TCACCAAAAA TTGATAAAAT TACTATCAAT TTACATGAAT TATATACTCA ACCAAAAATA ACAAAGAAAT TACTAGAAGA TTCTACTATA GATGTAGCCA GCTGGTTAAT TAATCACTTA GTTGATGATT TCAGCAGAGC AGAAAATACC GCTTTTATAT CTGGTGATGG AAACAACAAG CCATATGGTA TCTTAACATA TGCTTCAGAC ATAGAAAATA ACACCATAAC ATCTACAACA CTAAATAGTG ATATAATTAT CAAACTTTAT TATTCACTTG ATGAATATTT CTCCAGAAAG GCAGCATTCA TAATGCACAG AAGTGTACTA CAAGAAATTA GATCTTTAAA GTTAGCATCT GGCCAATATA TATGGCATCC AGGACTAACA TCAGGAAGTC CTGACACATT AATGGGGTTA CCAGTATATC AAACATCCGA TATGCCTCAA TTAGACAATA AAACATTACC AACCATAGCA CTTGCAGACT TTAAAAATGC ATATAAGATA GTAGAAAACC GTTCAATTAA AACATTAAGA GATCCTTTTA CAAGCAAACC CTTTGTCAAG TTTTATACAA CAAAACGAGT AGGAGGAGCA GTCATTAATA AAAATGCTAT AAAATTTTTA ACAATAAAAT AA
|
Protein sequence | MTTSMLEIQE SLHNLTSSWE EFKNLNEKKL SDISKKSSTY PSILEKLEKT ESMLEIQNER LDQLEISSQR PVTHDNKHYN EDYNYKTFTQ YLCKGTNLQG QSSVVQPLDE SYFIPHNISS YIETNLTKNS VMRQLCSIEK ISGDSLDCFI RNNNEKSVNW RTSNNVEDTE SPKIDKITIN LHELYTQPKI TKKLLEDSTI DVASWLINHL VDDFSRAENT AFISGDGNNK PYGILTYASD IENNTITSTT LNSDIIIKLY YSLDEYFSRK AAFIMHRSVL QEIRSLKLAS GQYIWHPGLT SGSPDTLMGL PVYQTSDMPQ LDNKTLPTIA LADFKNAYKI VENRSIKTLR DPFTSKPFVK FYTTKRVGGA VINKNAIKFL TIK
|
| |