Gene ECH_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0526 
Symbol 
ID3927366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp527303 
End bp528790 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content28% 
IMG OID637901649 
Producthypothetical protein 
Protein accessionYP_507341 
Protein GI88658140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGT TTTCGTTTGC TACTGCTTTT GTTTCTTTCT TGCTTTTGTA TAATGTTGAT 
GCTTTTTCAG TAGATAATAT AGATAAAAAT CATGAAAATC AGAAAAAACT AGTAAACCTT
ATAAAAAATA ATGAATATGT AAAACAAGCA AGTTTGCATC CATCAATTAA GAGTTATCAT
AGAATTGATC TTGGGGGAAA AGTATTGTCT TATGCTTGGT TTACTAATAA TTCAGAAAAA
ATGCAAGATC ATGGAGTAAA ATTAGATGGT GTTTTAAATA TAAAATCTAT AAATAATAAT
TCTGATCTTG GAATTTTCTA TGGTGGTAAT TTTCAATTGG CTATACCTGC TATGAAAAGT
GAGAATTTTA TTCCTTCGAT GAAAGCATAT AATAGAGGAG CACAATTATT TGTTGAATCT
GGTTATGGGA ATTTATCATT TGGATATCAA GAAGGTGTTG AGTCTATAAT GAAAATAGAT
GCTTCTAGTA TAGGAGCTGG AGATAATAGT ATTGCTTGGT TACAATACAC AAACTTATCA
AACCTTGATG GAAAAGTACA GTATCAAGTA TTTCCTGGAT TATATAGTGA AAGTGTGTTT
AACAGGAGTA ATAATAATGT TATTTCTATT AAAGATAAGG ATTTTGTTAA TAATTTGCCA
TTCAGGATAT CTTATCAATC TCCGAATTTT ATGGGTGTAA AATTTGGTAT TAGTTATTCT
CCAACAGGTT ATGATAGTAA CTTATTCGAA AGTGTGAGTT CTTATAATAT TAAAAAATTA
ACACTACCTC CTGTAATAAG TGCAGATACA ACTTCTGAGA TACAAGCAAA TCCTAATGAT
AAAGATAATA TTTTATATAA TGCATTAGGT AAAGAAAAAG TTCAAGATGT CACTATAGAA
GGTATAGTTC CGTCTAAAAT AGAGTTTTTG CAAGCACGTT ATGAAAATAT TGTAAGTGCT
GGATTATCGT ATAATCATTC TTTTAATGAT ATTGATTTTC AAGCATCTGT TGTTGGAGAA
TATGGATCTA CTGATATTGA TAAGTTAAAG TCATACTCAA AGTATCCATC TGCTGAAAAT
TTAGCAGCTT TTGCTATTGG TACATCTGTT ACTTATCGTG ATGTTATAGT TGCAGGTTCT
TATGGATATT TGGGAAAATC AGGTTATATT AATACGATTT ATTCTGCTAC TGAAGCTCCT
TTAAAAATGT TTTCTCCTGA TAATCAGTAT ACTTATTATT GGAACATTGG TGCAAAATAT
GTGTACAGTA ACGCTTCAAT TAGTACATCT TATTTTAGAA GTAATAAAGT TAATACTCAC
TTCTATGACT TTAGTTTAGG TATTGATTAT AATTTATCTC TAAGTAGTAG TCATAAGGGA
CAATACAAAG TTTTTGGAAA TTATCATTAT TTTAATATAG ATAATAAGAA TTTTAAAGTT
TCACGTGATG GTAGTGTGCT ATTACTAGGT GTTAAGTATG AATTCTAA
 
Protein sequence
MKKFSFATAF VSFLLLYNVD AFSVDNIDKN HENQKKLVNL IKNNEYVKQA SLHPSIKSYH 
RIDLGGKVLS YAWFTNNSEK MQDHGVKLDG VLNIKSINNN SDLGIFYGGN FQLAIPAMKS
ENFIPSMKAY NRGAQLFVES GYGNLSFGYQ EGVESIMKID ASSIGAGDNS IAWLQYTNLS
NLDGKVQYQV FPGLYSESVF NRSNNNVISI KDKDFVNNLP FRISYQSPNF MGVKFGISYS
PTGYDSNLFE SVSSYNIKKL TLPPVISADT TSEIQANPND KDNILYNALG KEKVQDVTIE
GIVPSKIEFL QARYENIVSA GLSYNHSFND IDFQASVVGE YGSTDIDKLK SYSKYPSAEN
LAAFAIGTSV TYRDVIVAGS YGYLGKSGYI NTIYSATEAP LKMFSPDNQY TYYWNIGAKY
VYSNASISTS YFRSNKVNTH FYDFSLGIDY NLSLSSSHKG QYKVFGNYHY FNIDNKNFKV
SRDGSVLLLG VKYEF