Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0116 |
Symbol | |
ID | 3927607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 102482 |
End bp | 103588 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901240 |
Product | hypothetical protein |
Protein accession | YP_506944 |
Protein GI | 88658018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.057396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACATG CTGCAGTACC TGGAGTGGTA GCTTCAGCAA ACGTTATTCC TGCTAAGCAT CTTGTAATTA GAGGAAAGGT TTTCAAACAT GTGAAGCGTT ATTCGATAGA GGAATATAAA TCTCAAATAA AAGAGTTTAG GGAATCTATA GCGTGTTTTG CAAGAATGCA TATGTCCTAT ATGTATCATA TGCTGCATAA TACGTTTGTT GTAAACAATG GAAGGATTAT GTTTAAGCCT GAAGTTGAAC AGTTTCTATT AGGAATAACC AGTAATATGA AGCTGTGTAC TTTTGTGATT AAGATAGGAA TAGTAGAGCA TGTTATGAGT AGGATTTGCA GGTTTTATGG TTCTGACAGC ATAAAGTATT GTGCGAGTCA TTACCGTGAT CCAAAGTTCA TAGATTCGAT ACTTGTTATA CTGCATGATG CATCACATTT TGATTTTTCA ACGATGTCAT ATCAAGTACG CAACAGTATG GCTAATTGTG TTAGGCGATA TAACATTACA AGTGTTTATG AGCTACATGA TAGTAGTTTT TACTCAGAAT TATTAAGTAT GTGTTATGAT TTTGTTCGTG CAAAGAGTAA TCAGAGTGTA CAATTTCAAG AATTATGTGA TTTTATAAAG TTGTCTTCTA CTGTGCAACT TGCGCAAATG TATCACATGA TACTAAAAAC CAAATGTTCT ACTGGAAATG AACAAGATAA TCTGCAAGGA CTTTTATTAC AAGAACGTAA TATAGAGAGT CTGATATATA GTTCTGTATT TTTTGGTAAG TATGCTTGTC GTGTAAGAAA AGCGTTTAGG CATTTATATG CTCCAAGTGA TAAAAACCCT GTACGTACAG TATCTGGGTT AAACATTCCA TATGTTATGA TTCAACTGAA TAGTAAGGGA ATTTTTGCAG AAATTGAACA TTGTGTAAAC GCAGAAAAAA TGGATTTTAA TGTTTTTGTT CGTGATATAG TGCGTTATAT CAACAAGCTA TTATCGTATC CACGTGAAGA AGGTTATATA AGAGCAGATA TAGGTAAATA TTGCGCTATG GTAAGTAGTC GATATAGTAC TATGGGAGCT GACATAGTTC CTTCTTCTCT TCATTGA
|
Protein sequence | MQHAAVPGVV ASANVIPAKH LVIRGKVFKH VKRYSIEEYK SQIKEFRESI ACFARMHMSY MYHMLHNTFV VNNGRIMFKP EVEQFLLGIT SNMKLCTFVI KIGIVEHVMS RICRFYGSDS IKYCASHYRD PKFIDSILVI LHDASHFDFS TMSYQVRNSM ANCVRRYNIT SVYELHDSSF YSELLSMCYD FVRAKSNQSV QFQELCDFIK LSSTVQLAQM YHMILKTKCS TGNEQDNLQG LLLQERNIES LIYSSVFFGK YACRVRKAFR HLYAPSDKNP VRTVSGLNIP YVMIQLNSKG IFAEIEHCVN AEKMDFNVFV RDIVRYINKL LSYPREEGYI RADIGKYCAM VSSRYSTMGA DIVPSSLH
|
| |