Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0087 |
Symbol | |
ID | 3927009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 76933 |
End bp | 78783 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901211 |
Product | hypothetical protein |
Protein accession | YP_506917 |
Protein GI | 88658224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.616781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATAA TGCGTAGAAT GGCTATAGCA AGGTATACTA AGCCCCTTAT ATTTAGTAAA ATATTAAACT TTGATCCATC ATTTATTAAC ATGTTGGAAT CTTTAGAAGC AGAAGACATC CTCTTGTTAA AAGATGAACT TGGTTCGGTA TTTTATTATT TAATCATGTC ATTGTATGAT GTATCTGGTA TTCAAAATGA TTCTAGTGGT GCTATAAGAG AACGCCTTAA AAGGGTTTTG TTGGATGCAA TTGAGCAATC TAATTCAGAT CAAAGTGGAG CTGTAGGTTC TCAAGTTAGT GAAATAAGAG AAATTAGAGA TAGGTTAAGA TCTGCATACC AATCCATTTC TAATAGAGTA TGCAGAACTT TAATAAGTGC ATTGGGTAAT GAAACTCTGC AAGAGGAGAC AGACTCAAAT ATTACACAGC AAAGTAATCT ACAACGCAGG TTACAAATGT TTCAGTATGT AATACAAGCT GTGGAAATAT TGGCTGATAA ACTTAATACA GCTGTTGTGG AGGGTAAGGT TTCGCCTGAA CAAGTTTCTG AATACTTGTC TTGTACAAAT GAATCTCATG ATTCTATTGC TCCTGATACT ATGAGTGCTT TGGTTAAGTT ATATTCCTTA ACGGATGATC CTCAACTTAC AGATTTAGCG TATTCATTGG TAAAAACATT CTTGATGAAG TTTGGCCGAT ATAAGTTGGA TGGTCAAGGT AGGAATTCTA TGCATTATGC TGTAAACATG TGTAGTCCGG AAAGGCAAGA AAGTTTTTTG TGTGAGATGA TACAGCCATC AATATATGAA CGGGTTTCAA TAGTTAATGA GGTGGATGCA TCTCGCAATA ATCTAATGCA TTATGCAGCA TGTGCTCCAT ACATGAATTA TCAAATTTTA AAGTATTTAG TAAAAAATTT TCCTGCAATG ATGACACAAC AGAATTGCTA TGGAGATACT CCATTACATA TTATGTCATA TGTATATTTT GTTAATTTTG CAAAAATACT GTCATCTTAT AACATTACAT ATAGAGAGAA TATGAATGCT TTAAAGGAGG TTGTAGATCG TGGGTTGCCT CTTTCTCAGA TGAGAGAAAG AGTTATGTCA ATTAGAAGAA ATGATGAAGC CTTATCAAGA CAACTTAAAG CTTACGTAGA TGAGAGTGTA GGAACTTATC AATTGTTGCT CACTATGGTG CCTTTGCGGC AGATATTTGA AGTAAGAAAT AATGCAGGGC ATACTGTATA TGATATAATG AATGCTAGTA TGAGCAACAT AGGCAATGAG CGTCTTGAGG CTTTATTGCA AGATTTTTCC CAAGCAAGCA GTAGATTACC TATATATGAT TGTAGGATTG ATTCACAGCA TGAACTATGT GTGAATTTAT GTTTTTCAAA TAAATATAGA GTTGTTGGTA AAAGTGGATA TGGACATGTA TTGTATACGC ATGTGAAGCG TATGTATGAC TTGATATCCT ATAAGTTTTC TGAGATCTCT GATTCTAGAA TGAGATGCAT AAAGATTGAA AATGAGCGTG GTAACAACAG GTATTTAAGT ATGCTTGTTG TTATGATGTT AATGGCTTTG TGTGTTCTAA ATACCGTTTT ACATTTTAAA ACAAGATCTA TTTTAGGTAT TGAGCAAGGC TTATATAGAT CTGTATTATT TTCAGCGATA TCTGTGGTTT TCTTTGTGAG TATTTGTGTA TTTTGTATTG TATATGCTAA ATATGTTGAT GTTGCTGATA AGAAGCTAAT AATCGAGGAA GAAGGGTATG CACGAAGTAT ATTATTATCA CATTTAGATG TTCAAGAAAC AGATACTAGT CAGAGAAGAA GAGAAGGTTA A
|
Protein sequence | MLIMRRMAIA RYTKPLIFSK ILNFDPSFIN MLESLEAEDI LLLKDELGSV FYYLIMSLYD VSGIQNDSSG AIRERLKRVL LDAIEQSNSD QSGAVGSQVS EIREIRDRLR SAYQSISNRV CRTLISALGN ETLQEETDSN ITQQSNLQRR LQMFQYVIQA VEILADKLNT AVVEGKVSPE QVSEYLSCTN ESHDSIAPDT MSALVKLYSL TDDPQLTDLA YSLVKTFLMK FGRYKLDGQG RNSMHYAVNM CSPERQESFL CEMIQPSIYE RVSIVNEVDA SRNNLMHYAA CAPYMNYQIL KYLVKNFPAM MTQQNCYGDT PLHIMSYVYF VNFAKILSSY NITYRENMNA LKEVVDRGLP LSQMRERVMS IRRNDEALSR QLKAYVDESV GTYQLLLTMV PLRQIFEVRN NAGHTVYDIM NASMSNIGNE RLEALLQDFS QASSRLPIYD CRIDSQHELC VNLCFSNKYR VVGKSGYGHV LYTHVKRMYD LISYKFSEIS DSRMRCIKIE NERGNNRYLS MLVVMMLMAL CVLNTVLHFK TRSILGIEQG LYRSVLFSAI SVVFFVSICV FCIVYAKYVD VADKKLIIEE EGYARSILLS HLDVQETDTS QRRREG
|
| |