Gene ECH_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0129 
Symbol 
ID3928005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp118392 
End bp119642 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content28% 
IMG OID637901253 
ProductHemY domain-containing protein 
Protein accessionYP_506957 
Protein GI88658307 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAA GCGTTTTAAT CTTTGTTATT GTTGCACTTA CTATAGGGCT ATGGGTAATA 
GATTGCGATG GTATCATCAG AATAGATTGG TTAGGCTATG ATATAGAAGT AAATATCTTA
TTTACGTTGT TTGTCATTGC TGTTGTTTTC CTTCTTTTAA TTCTATTAGT AAGATTCATT
TTTTGTTTTT CTCGATGTGT GTATCGATAT AAAAGAGATT TACAGAATAA GAAAATGGTC
TTGTTGGAAC AGGGTTACAT GTATTTAAAT TGTGGAGATG TTGAAAGAGT AGAAAAGATT
ATAGTAAAGA TAGGAAATTT TGATCATCCA TCTTTATTTT TGTTAAAGGG AAGGGTTTAT
TTTGATACTG GAAAATACAT ATTGGCTGAA AAGTATTTTA CGCAATTTGT GAAAGTTGTA
CCAGTTATAG ATGCTTCGTT GGGTATACAC TTGTTGAATG TTATTATGCA GATAGAAGAT
CAGATTCAAC AGCTGAGTTT ATTGAGGAAA ATGCTGGAGA TTTTTTTTAA ACAATCTTGG
TCAGCTATTT TTAAGTTAAC TATATATCGG ATTTCTAGAG ATTGGGGTAA TGCAATTGAA
GAAATGAAAA AAATAATAAA GTTAAAAATA AATCTGCCTT TACCTTATAA TACACAAGAG
ATGCTTAATG TGTTTTATTA TGCATTAGCA AAACAATGTT ATGATATTCA AAAGTATGAT
GATGGTTTAA GAGTACTTGA TAACATTAAG AATTGTTCTC AACAGTGTAG TACTGCTGTT
ACATTGTTGA AAGCTAAATT TTATATAGAT ACTGATAAGA AACGTAAAGC GGTGAATATT
CTTGAACATG AATACCGTAT TAATCCACAC CCTGATATTG CAAATTTCTA TTTAGATATT
ATGCAGCATA GTAGTCATGC TATACATAAA TTATATAGTT TTAATACTGG ATATTACTTT
AGTATATATC TTATAGCACA GGATGCTATA AATTCAGGTG AATATGATAC AGCAATGAAA
TATTTAAATC ATAGTTTCAA AACTAAAACT TATATTTCTT TGTATTTTTT AGTGTTAAAA
CTAAAAGTAT TATCACAAAA CTATAATGAA CTTTTGTATT GGACAGATAA AATTGCAAAA
GATGCTATAG CAGATAAGTA TTGGAGTTGT ACAAAGTGTA AATATACCCC TACCTGTTGG
CATTATGAGT GTGATGGTTG TAAAAGTTTT AATACCATAA TTTGGGTTTA A
 
Protein sequence
MITSVLIFVI VALTIGLWVI DCDGIIRIDW LGYDIEVNIL FTLFVIAVVF LLLILLVRFI 
FCFSRCVYRY KRDLQNKKMV LLEQGYMYLN CGDVERVEKI IVKIGNFDHP SLFLLKGRVY
FDTGKYILAE KYFTQFVKVV PVIDASLGIH LLNVIMQIED QIQQLSLLRK MLEIFFKQSW
SAIFKLTIYR ISRDWGNAIE EMKKIIKLKI NLPLPYNTQE MLNVFYYALA KQCYDIQKYD
DGLRVLDNIK NCSQQCSTAV TLLKAKFYID TDKKRKAVNI LEHEYRINPH PDIANFYLDI
MQHSSHAIHK LYSFNTGYYF SIYLIAQDAI NSGEYDTAMK YLNHSFKTKT YISLYFLVLK
LKVLSQNYNE LLYWTDKIAK DAIADKYWSC TKCKYTPTCW HYECDGCKSF NTIIWV