Gene ECH_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0116 
Symbol 
ID3927607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp102482 
End bp103588 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content34% 
IMG OID637901240 
Producthypothetical protein 
Protein accessionYP_506944 
Protein GI88658018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.057396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACATG CTGCAGTACC TGGAGTGGTA GCTTCAGCAA ACGTTATTCC TGCTAAGCAT 
CTTGTAATTA GAGGAAAGGT TTTCAAACAT GTGAAGCGTT ATTCGATAGA GGAATATAAA
TCTCAAATAA AAGAGTTTAG GGAATCTATA GCGTGTTTTG CAAGAATGCA TATGTCCTAT
ATGTATCATA TGCTGCATAA TACGTTTGTT GTAAACAATG GAAGGATTAT GTTTAAGCCT
GAAGTTGAAC AGTTTCTATT AGGAATAACC AGTAATATGA AGCTGTGTAC TTTTGTGATT
AAGATAGGAA TAGTAGAGCA TGTTATGAGT AGGATTTGCA GGTTTTATGG TTCTGACAGC
ATAAAGTATT GTGCGAGTCA TTACCGTGAT CCAAAGTTCA TAGATTCGAT ACTTGTTATA
CTGCATGATG CATCACATTT TGATTTTTCA ACGATGTCAT ATCAAGTACG CAACAGTATG
GCTAATTGTG TTAGGCGATA TAACATTACA AGTGTTTATG AGCTACATGA TAGTAGTTTT
TACTCAGAAT TATTAAGTAT GTGTTATGAT TTTGTTCGTG CAAAGAGTAA TCAGAGTGTA
CAATTTCAAG AATTATGTGA TTTTATAAAG TTGTCTTCTA CTGTGCAACT TGCGCAAATG
TATCACATGA TACTAAAAAC CAAATGTTCT ACTGGAAATG AACAAGATAA TCTGCAAGGA
CTTTTATTAC AAGAACGTAA TATAGAGAGT CTGATATATA GTTCTGTATT TTTTGGTAAG
TATGCTTGTC GTGTAAGAAA AGCGTTTAGG CATTTATATG CTCCAAGTGA TAAAAACCCT
GTACGTACAG TATCTGGGTT AAACATTCCA TATGTTATGA TTCAACTGAA TAGTAAGGGA
ATTTTTGCAG AAATTGAACA TTGTGTAAAC GCAGAAAAAA TGGATTTTAA TGTTTTTGTT
CGTGATATAG TGCGTTATAT CAACAAGCTA TTATCGTATC CACGTGAAGA AGGTTATATA
AGAGCAGATA TAGGTAAATA TTGCGCTATG GTAAGTAGTC GATATAGTAC TATGGGAGCT
GACATAGTTC CTTCTTCTCT TCATTGA
 
Protein sequence
MQHAAVPGVV ASANVIPAKH LVIRGKVFKH VKRYSIEEYK SQIKEFRESI ACFARMHMSY 
MYHMLHNTFV VNNGRIMFKP EVEQFLLGIT SNMKLCTFVI KIGIVEHVMS RICRFYGSDS
IKYCASHYRD PKFIDSILVI LHDASHFDFS TMSYQVRNSM ANCVRRYNIT SVYELHDSSF
YSELLSMCYD FVRAKSNQSV QFQELCDFIK LSSTVQLAQM YHMILKTKCS TGNEQDNLQG
LLLQERNIES LIYSSVFFGK YACRVRKAFR HLYAPSDKNP VRTVSGLNIP YVMIQLNSKG
IFAEIEHCVN AEKMDFNVFV RDIVRYINKL LSYPREEGYI RADIGKYCAM VSSRYSTMGA
DIVPSSLH