Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0033 |
Symbol | |
ID | 3927815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 28890 |
End bp | 30071 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637901158 |
Product | HK97 family phage portal protein |
Protein accession | YP_506866 |
Protein GI | 88658099 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.13061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTCT TTAAAAAAAA GTCAATACAA GACAATTCTT ACACATTCTC AGTTCCCATA CAACTATTTA CAGAGGCTGT ATGGAAAAAC AGAAGCTATG CAAATTTCGC AGAAAATGGC TACATAAAAA ACGTAATTGC TTTTAGATCT ATTCACATGA TTGCATCAGC TGCAGCATCT GTTTCTTTAT TACTAAATAA GACTATCAAA AACAACACAT TCCAAATAAA AAACCATCCT TTATTAAAAT TAATATCTAA ACCAAACAAC ACCACCTCAA AATCAGAATT CATCGAAGGA ATTCTTACTT ATAAACTTAT TAGCGGTAAT GCTTATATTT TGACAATAGA AAATCATGAT ATGATTCCCA AAGAATTACA TCTTTTGCGG CCAGATAGAA TTGAGATTAC TCCAGGAAAA GATAATAGGC CATATTCATA TCGCTATTCT ATAAATAATT ATCACTATGA CTATAAGATT AATAAATCAA CAAATTATTC ACAAATCCTA CACATAAAAA ACTTTCATCC ACTTAATGAT TATTATGGAT TATCCCCTAT AGAAGCAGCT TCATACAGTA TAGATCAACA TAACCAGGCT GGATCTTGGA ACCAAGCAAT GCTACAAAAT GGAGCTAGAC CAAGCGGGGC ATTAATTGTA AACGCAAAAA GTAACAACAA TGGTAATTTA ACACAAGAAC AATACACTCG TTTAAAATCA CAAGTAGATG AATTCTATTC TGGTCCAAGA AATGCTGGAA GACCAATATT ACTTGAAGGG GGATTAGATT GGAAAGAAAT GAGTTTATCC CCTAAAGATA TGGATTTTAT CGAATCAAAG CATAGTTCAG CACGTGACAT TGCATTAGCA TTTGGTGTAC CACCCCAATT GCTTGGTATT CCTGGAGATA ATACCTATAA TAATCTTATC GAAGCAAGGC TTTCACTATG GGAACAAACA ATATTACCTC ACTTAGACAA TATTATTTCA CATTTTAACA ATTGGTTAAT ACCCAGGTTC GGAAGCAATA TGTTTTTATC ATATGATAAA GATTCCATCT CTGTATTAAC AGAAAAAAGA AAACAGCTCT GGCAATACGT AGAAAATGCA ACTTTCATGA CTATCAATGA AAAAAGAGCA GCTTTTGGGT TACCACCAAT AGAGAATGGA AATACTCTAT AA
|
Protein sequence | MNFFKKKSIQ DNSYTFSVPI QLFTEAVWKN RSYANFAENG YIKNVIAFRS IHMIASAAAS VSLLLNKTIK NNTFQIKNHP LLKLISKPNN TTSKSEFIEG ILTYKLISGN AYILTIENHD MIPKELHLLR PDRIEITPGK DNRPYSYRYS INNYHYDYKI NKSTNYSQIL HIKNFHPLND YYGLSPIEAA SYSIDQHNQA GSWNQAMLQN GARPSGALIV NAKSNNNGNL TQEQYTRLKS QVDEFYSGPR NAGRPILLEG GLDWKEMSLS PKDMDFIESK HSSARDIALA FGVPPQLLGI PGDNTYNNLI EARLSLWEQT ILPHLDNIIS HFNNWLIPRF GSNMFLSYDK DSISVLTEKR KQLWQYVENA TFMTINEKRA AFGLPPIENG NTL
|
| |