Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0525 |
Symbol | |
ID | 3927551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 524844 |
End bp | 526844 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901648 |
Product | hypothetical protein |
Protein accession | YP_507340 |
Protein GI | 88657870 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA CTTTAGTAGC ATCTGCTTTA GCTTCTTTTT TTGTGTTATA TGGAAATATT TTATTTTCAG CAGATATGTT AGGTGATGTT GGTATTTTTA ATCCTAATGA TGGAAATAAC AAACAATCTA ATGGTATTAA TGGAAAAAAT CTTTTTAAGA CAAAGGATTC AGGAAATGGC GAGTATGATC GTTCATCAAA AGTAGGTCGT GCTTTAGTAA GTGGGCAAGC TATATCTTAT ATGTGGGCTT CATCAGATGA AAAGAAAAGT TTCTATGATA TATCTGGTAG TGCAGATGAT TGGGGTATGA ATTGTGATGC TATATTACGT TTAGGGGCTG AAATAAAATC AAATGATTCT GGTGTTAAAT ATGGTGCAGA TTTTCAGATA GCTATACCTC ATGTTCAAGG GAAAAATTTT GAAAAAAAAG CTGCTTTAAA TAGAGGATCT AGAATATTTG CTTCTACTCC ATATGGTGAT TTTTCAATGG GGTATCAAGA AGGAGTAGAA TCTATGATGA AGATAGATTC TTCTAATATA GTGGCTGGAG ATGAAAGTAG CAGTTGGACA CAGCATCTGA GAGGTGTATT ATCTGAAAAA AAGAATGCTT TAGGGTATAC AATGTATCCA TTTCTTTTTT CTGCTGGATT ATATAGTGAA AACGTATTTC GGAATAATGA TAATATGACA TCTACTGTGG ATGGTAGTAA GGATTTTATT AATAATTTAC CTTTTAGGAT ATCCTATCAA TCTCCTAATT TTATGGGATT GAGGTTTGGT TTTAGTTATT CTCCTTTAGG TTATAAATTT CAGCATTTTG GTAGAACACT TGATATATAT TCTGTTAAGG CAAACAAAGT GGTTAATCAA CTTCCTACTT TTGAAGTAAA GTTACCATCT GCTACTGTCA ATTTAAGTGA ATTAGTTAAA AAGTTGGGAA CAGCTAAATT AGATGCTGAA TTAGAGTTGT CTAAGCAGCA AGAACTGGAT AAAAATGCTC TTCATATTAA GTTTAAGGAC AAAGAAGAAG GATATCTACC ATGTAAAGGT AAAATTGAGT TACTTGGTGT GCCAGGAGCA GGAGTGTCTT CTCATGATCC TATTGAATTT GGTATTGAAG TAGATAGTAT AGAAGAAGTT AAGATCCCTA TATCTCAGAT TGAGTTAAAA AACTTGACAA TTACATTACC AGATTTAGAA AAGTTTGACA GTAACTTACC AGTTATTCAA AAAGGAGATA CTCAAGTCAC TAAAAAAGGG GAAAGGGAAT CTATAGTAAC AAAACGTTCC AAAAATAAAT CAGATCAGGT GTTTTTTGGT GCAAAATATG AACATATATT AAGTGGTAGT ATAGCTTATA GCTATGATTT AAGTAATGGC TTTAAGTTTA GTACTTCTCT TGTAGGTGAG TATGCTCATC CTAGGCTACA TTTTAATACA CAAAATTATG ATATTTATCC GGAAAACTAT AATTTACAAG GTATATCTAT TGGTTCTCTA TTGAGTTATA GTAATGTTAG CTTTGCTATT GCTTATGGGT ATTTGGGGCA TTCTGGTTTT GCAAAGCACT ACATTCTGCA TAAAAAGACA GCACGAAATG GGGAATCTAA TGCTATATAT ACAATGTGTG AAAGGTCTAA CACTTATTAT TGGGATATAG CTTTTGGGTA TCAATATAAA TCTTCTAATA TTAGTGTTAC ATATTTTAAG AGTAATAGGA GTAGTAATAT ATTACAGGAT ATTAGTTTAG GTGTTGAGTA TCATTTATTA AAAGAGCAAA GTAAAATGAA GTGCAAATTA TTTGGAAATT ATCACCACTA TAAATTTTCG GAAATTACTA TTGCTGTAGA TAATGTTCGT TATGACGATG GTTATGATAA TAGTGCAAAA CCAGGTATTT CTCCTGTCAT AGGTACAGGA CTTGATAAAA GTAATGTTAT CAGAAATGCA AGTAATGGAT CAGGTAACAT ATTTTTAGTA GGTGCTAAAT TAGAGTTTTA A
|
Protein sequence | MKFTLVASAL ASFFVLYGNI LFSADMLGDV GIFNPNDGNN KQSNGINGKN LFKTKDSGNG EYDRSSKVGR ALVSGQAISY MWASSDEKKS FYDISGSADD WGMNCDAILR LGAEIKSNDS GVKYGADFQI AIPHVQGKNF EKKAALNRGS RIFASTPYGD FSMGYQEGVE SMMKIDSSNI VAGDESSSWT QHLRGVLSEK KNALGYTMYP FLFSAGLYSE NVFRNNDNMT STVDGSKDFI NNLPFRISYQ SPNFMGLRFG FSYSPLGYKF QHFGRTLDIY SVKANKVVNQ LPTFEVKLPS ATVNLSELVK KLGTAKLDAE LELSKQQELD KNALHIKFKD KEEGYLPCKG KIELLGVPGA GVSSHDPIEF GIEVDSIEEV KIPISQIELK NLTITLPDLE KFDSNLPVIQ KGDTQVTKKG ERESIVTKRS KNKSDQVFFG AKYEHILSGS IAYSYDLSNG FKFSTSLVGE YAHPRLHFNT QNYDIYPENY NLQGISIGSL LSYSNVSFAI AYGYLGHSGF AKHYILHKKT ARNGESNAIY TMCERSNTYY WDIAFGYQYK SSNISVTYFK SNRSSNILQD ISLGVEYHLL KEQSKMKCKL FGNYHHYKFS EITIAVDNVR YDDGYDNSAK PGISPVIGTG LDKSNVIRNA SNGSGNIFLV GAKLEF
|
| |