Gene ECH_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0154 
SymbolychF 
ID3928021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp145220 
End bp146308 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content33% 
IMG OID637901278 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_506981 
Protein GI88658475 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGA ACTGCGGTAT TGTTGGACTA CCTAATGTAG GAAAATCAAC TCTATTTAAT 
GCATTAACAC AAACTATGGT AGCAGAAGTA GCAAACTACC CATTCTGTAC AATAGAACCA
AATATAGGCA AAGCAATAGT ACAAGATCAT AGGTTAAAAA CTTTAGCAAA CATAGCATCT
TCTAAGAAAA TTATATACAA CCAAGTTGAA TGTGTCGATA TTGCTGGATT GGTCAGTGGA
GCAAGTCAAG GTGAAGGATT AGGAAATAAA TTCTTAAGTC ATATAAGAGA AGTGGATGCT
ATTATTCATG TATTACGATG TTTCGGAGAT CAAAACATTA GCCATGTTAA CCAAACTGTA
GACCCAATAA GTGATGCAGA AATTGTAGAA ATGGAGCTGA TTTTAGCTGA TATTGAAAGT
TTAAAACGTC GCTTACCTGC AACAGAAAAA GCTGTAAAAG CTAATAAAGA ACCCAGAAAG
AAATTAGATA CTATATTAGA AGTATTAAGT GTGCTAGAAG CTGGCAATTT AGCAAAAAGT
GCAAAACACC TAGGCGATGA TTTAAAACAA CTACAACTCA TTACAACAAA ACCTATGATG
TACGTATGTA ATGTAGAAGA ATCGAATGTA ACCACTGGAA ATGCTTTATC AGAAAAAGTA
AAACTTATGG CAGAAAAAAA ACATAATAAG TTTTGTTGTA TTTCAGCAAA ATTAGAAGCA
GATGTATCAA GTTTAGAAAC AGAAGAAGAA AAACAAATTT TTCTAGCTGA GTTTAATTTA
CAAGAATCTG GTACAACATC TGTAGTAAAA ACAATGTATG ATCTGCTAGA TATGATTACA
TTTTTTACAT TAGGTCCACA AGAAGCACGT GCCTGGCCAA TAAAAAGATT CTCTACTGCT
AGCAGTGCTG CAGGCGTAAT ACACACTGAC TTTGAAAAAG GGTTCATAAA AGCAGAGCTC
ATTAGTTTTG ATGACTATAT AAAATACAAT GGAGAGGCAA AGTGCAAAGA AGCTGGAAAA
GTCAGGTTTG AAGGCAGAGA TTATATCGTA CAAGATGGAG ATATAATACA CTTTAGGCAT
AATAAATAA
 
Protein sequence
MGLNCGIVGL PNVGKSTLFN ALTQTMVAEV ANYPFCTIEP NIGKAIVQDH RLKTLANIAS 
SKKIIYNQVE CVDIAGLVSG ASQGEGLGNK FLSHIREVDA IIHVLRCFGD QNISHVNQTV
DPISDAEIVE MELILADIES LKRRLPATEK AVKANKEPRK KLDTILEVLS VLEAGNLAKS
AKHLGDDLKQ LQLITTKPMM YVCNVEESNV TTGNALSEKV KLMAEKKHNK FCCISAKLEA
DVSSLETEEE KQIFLAEFNL QESGTTSVVK TMYDLLDMIT FFTLGPQEAR AWPIKRFSTA
SSAAGVIHTD FEKGFIKAEL ISFDDYIKYN GEAKCKEAGK VRFEGRDYIV QDGDIIHFRH
NK