Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0526 |
Symbol | |
ID | 3927366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 527303 |
End bp | 528790 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 637901649 |
Product | hypothetical protein |
Protein accession | YP_507341 |
Protein GI | 88658140 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGT TTTCGTTTGC TACTGCTTTT GTTTCTTTCT TGCTTTTGTA TAATGTTGAT GCTTTTTCAG TAGATAATAT AGATAAAAAT CATGAAAATC AGAAAAAACT AGTAAACCTT ATAAAAAATA ATGAATATGT AAAACAAGCA AGTTTGCATC CATCAATTAA GAGTTATCAT AGAATTGATC TTGGGGGAAA AGTATTGTCT TATGCTTGGT TTACTAATAA TTCAGAAAAA ATGCAAGATC ATGGAGTAAA ATTAGATGGT GTTTTAAATA TAAAATCTAT AAATAATAAT TCTGATCTTG GAATTTTCTA TGGTGGTAAT TTTCAATTGG CTATACCTGC TATGAAAAGT GAGAATTTTA TTCCTTCGAT GAAAGCATAT AATAGAGGAG CACAATTATT TGTTGAATCT GGTTATGGGA ATTTATCATT TGGATATCAA GAAGGTGTTG AGTCTATAAT GAAAATAGAT GCTTCTAGTA TAGGAGCTGG AGATAATAGT ATTGCTTGGT TACAATACAC AAACTTATCA AACCTTGATG GAAAAGTACA GTATCAAGTA TTTCCTGGAT TATATAGTGA AAGTGTGTTT AACAGGAGTA ATAATAATGT TATTTCTATT AAAGATAAGG ATTTTGTTAA TAATTTGCCA TTCAGGATAT CTTATCAATC TCCGAATTTT ATGGGTGTAA AATTTGGTAT TAGTTATTCT CCAACAGGTT ATGATAGTAA CTTATTCGAA AGTGTGAGTT CTTATAATAT TAAAAAATTA ACACTACCTC CTGTAATAAG TGCAGATACA ACTTCTGAGA TACAAGCAAA TCCTAATGAT AAAGATAATA TTTTATATAA TGCATTAGGT AAAGAAAAAG TTCAAGATGT CACTATAGAA GGTATAGTTC CGTCTAAAAT AGAGTTTTTG CAAGCACGTT ATGAAAATAT TGTAAGTGCT GGATTATCGT ATAATCATTC TTTTAATGAT ATTGATTTTC AAGCATCTGT TGTTGGAGAA TATGGATCTA CTGATATTGA TAAGTTAAAG TCATACTCAA AGTATCCATC TGCTGAAAAT TTAGCAGCTT TTGCTATTGG TACATCTGTT ACTTATCGTG ATGTTATAGT TGCAGGTTCT TATGGATATT TGGGAAAATC AGGTTATATT AATACGATTT ATTCTGCTAC TGAAGCTCCT TTAAAAATGT TTTCTCCTGA TAATCAGTAT ACTTATTATT GGAACATTGG TGCAAAATAT GTGTACAGTA ACGCTTCAAT TAGTACATCT TATTTTAGAA GTAATAAAGT TAATACTCAC TTCTATGACT TTAGTTTAGG TATTGATTAT AATTTATCTC TAAGTAGTAG TCATAAGGGA CAATACAAAG TTTTTGGAAA TTATCATTAT TTTAATATAG ATAATAAGAA TTTTAAAGTT TCACGTGATG GTAGTGTGCT ATTACTAGGT GTTAAGTATG AATTCTAA
|
Protein sequence | MKKFSFATAF VSFLLLYNVD AFSVDNIDKN HENQKKLVNL IKNNEYVKQA SLHPSIKSYH RIDLGGKVLS YAWFTNNSEK MQDHGVKLDG VLNIKSINNN SDLGIFYGGN FQLAIPAMKS ENFIPSMKAY NRGAQLFVES GYGNLSFGYQ EGVESIMKID ASSIGAGDNS IAWLQYTNLS NLDGKVQYQV FPGLYSESVF NRSNNNVISI KDKDFVNNLP FRISYQSPNF MGVKFGISYS PTGYDSNLFE SVSSYNIKKL TLPPVISADT TSEIQANPND KDNILYNALG KEKVQDVTIE GIVPSKIEFL QARYENIVSA GLSYNHSFND IDFQASVVGE YGSTDIDKLK SYSKYPSAEN LAAFAIGTSV TYRDVIVAGS YGYLGKSGYI NTIYSATEAP LKMFSPDNQY TYYWNIGAKY VYSNASISTS YFRSNKVNTH FYDFSLGIDY NLSLSSSHKG QYKVFGNYHY FNIDNKNFKV SRDGSVLLLG VKYEF
|
| |