Gene ECH_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0045 
Symbol 
ID3927524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp42310 
End bp43407 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content30% 
IMG OID637901169 
Productputative GTP cyclohydrolase II 
Protein accessionYP_506877 
Protein GI88657727 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATT TATTTAACAG ATTTCACTCC AACAAGTATA TTACAGAAAA GTCTATCGCA 
GAACTAAGAT GTGGTATCCC AGTATTGCTA TATAACAGCA ATGATAGCTT ACTCATCTTC
CCCAGCGAAT TAATAGATAA TCAATTACTA AACACATTGA AAAAACATTT TAAAGATATC
AACATATTAG TAACCGGTAA TAGATTAAAC TTCATCTTCC AATCTTCAGG AAACCAACTC
TCAAGAATCA AAATTAAAGA ATCTCATGAT CTAGAATATA TATCCTCCCT ACTAACAGGT
CAAGAATTAC ACAAAGGATC TCTTGTTATT GATAATGTGA CAGTAAGTAC AAACTCTTTA
GATATAACTG CTATTTCCCT AATAAAATTA ACAAAACTAT TACCCTCAGC AGTTGTTGTT
GATATCAATG ATTCTGATGT ACTACATTGG TGTACTAAAA ACAACATCAC ACCTATAAGA
CAAGAAATAA TCGAGAATTA TAATAAAGAA TATGAAATTC AGGAAGTATG TAGTTCACCT
TTATTTCTAA AAGACTGTTC CAATGCTAAA ATAAATGTTT ACAGATCACA TACTGGGGAA
CTTGAACATT ACGCAATTAT TATAGAAAAT CCAGATTATA GCAATCCTAT CATTAGGATT
CATTCTTCAT GTTACACTGG TGACCTACTT AACAGCTTAT CTTGTGATTG TCGATGTCAA
CTACATACTG CCATAAAGTT AATGATAGAA AATAAAGGTG GAATAATTTT ATACTTAGCC
CAGGATGGTC GTGGTATAGG GCTAGCTAAC AAAATAAGAA CATATCAACT ACAAATAAAA
CACAATTTTG ATACTGTAGA TGCTAATAGA TTCTTTGGAT TTGAAGATGA TGAAAGGGTA
TTCATCCCAG CTATAAAAAT ACTACAGAAA TTGGGAATTT CAAGATTGCA ATTATTAACA
AACAATCCAA ATAAAATTTC AGAAATTCAG AATCACGGCA TACAAGTTAC AAAAATATTA
CCTATTTTTG TTGACACAAA TCAACATAAT ATTAATTATA TCAATACTAA AGCTAAAAGG
TTAGGTCACG TTTGCTAG
 
Protein sequence
MENLFNRFHS NKYITEKSIA ELRCGIPVLL YNSNDSLLIF PSELIDNQLL NTLKKHFKDI 
NILVTGNRLN FIFQSSGNQL SRIKIKESHD LEYISSLLTG QELHKGSLVI DNVTVSTNSL
DITAISLIKL TKLLPSAVVV DINDSDVLHW CTKNNITPIR QEIIENYNKE YEIQEVCSSP
LFLKDCSNAK INVYRSHTGE LEHYAIIIEN PDYSNPIIRI HSSCYTGDLL NSLSCDCRCQ
LHTAIKLMIE NKGGIILYLA QDGRGIGLAN KIRTYQLQIK HNFDTVDANR FFGFEDDERV
FIPAIKILQK LGISRLQLLT NNPNKISEIQ NHGIQVTKIL PIFVDTNQHN INYINTKAKR
LGHVC