Gene ECH_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1116 
Symbol 
ID3927017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1140567 
End bp1141772 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content30% 
IMG OID637902230 
ProductpolyA polymerase/tRNA nucleotidyltransferase family protein 
Protein accessionYP_507900 
Protein GI88657671 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGATG ACTTTATAAA TAATGAGAAT ATTCTGTTAA TCATAAATGC AATAAAAAAA 
TTTCAAGGAG ATATTAGGTT AGTAGGTGGA TGTGTAAGAG ATAGTCTTCT AAAAAGACAG
ACCATAGATA TTGATTTTGC AACTACTTTA TTACCCAACC AAACAATAAA TGCTCTTACT
GCAGCTCATA TTAAAGCTAT TCCAACAGGT ATAAAACATG GTACAATAAC AGCCTTAGTT
AATAATACAG CATATGAAAT TACAACACTA AGATCTGATA TTAGCTGTGA TGGAAGACAT
GCTGAAGTAA AATTTACAAA CAATTGGCAG CAAGACGCTT CAAGAAGAGA TTTCACCTTC
AATGCTCTAT ATTGTGATGA AAAAGGAATA GTATATGATT ATTTTTCTGG TATCCAAGAT
CTAGAAAAAA AACATCTAAA TTTTATTGGA GATCCAGAAA TTAGAATACA AGAAGACTAC
CTACGCATAC TTCGAGCATT TAGATTTTAT GCTTCTATAT GTAGTCAAAA CAAATTGAGT
GATGAAATAG TGCACTCTTG CACAAAATAT TCATCTTATA TCAATAACCT ATCCAGAGAA
CGCATTCGCG ATGAGTTCTT TAAACTTTTA TTATGTCCTA ACTTATCAAA CACATTAAAG
ATTATGCAAA AATGCCACGT GCTAGATAAA ATCATTCCCT TTGAAGTCAT ACCAGACATA
ATGTCATCTG AGACCTTATC AAACACAGAT CCACTAACAA AATTAGCAGC TCTTTTAAGA
ACAAACAATA ACAACCACTC TCTAGATAAA ATTAAAGCTT CTTTATGCTT ATCAAACTAC
AGTCAAAAAA CACTTGTGTC ACTATTAAAC AATAATTTAG AACTTCCACT TTCAACTACC
GCACAACACA AACACATTAA CAAGCTTGGA AAAGAAATAT ACTGCAATCT ACTGAGAATA
ATACATGCAG AATTAAATTT AAATTATCAT GACCTAATGC AATATATAGA GTACGCAGAT
CAATTAATTA TTCCTGAATT TCCTATCTCT GGAAAAGATT TACTTAATAT AGGATACCAA
CCAGGAAAAA ATCTTGGTAT CACTTTAGAA AAAATCAAAG ATCTATGGGA AAATAGTTCA
TATCAACTAA CAAAAACCCA ATTATTAGAT TACGCGAGAG GAAAATTATT AAAAAGTAAG
AATTAA
 
Protein sequence
MYDDFINNEN ILLIINAIKK FQGDIRLVGG CVRDSLLKRQ TIDIDFATTL LPNQTINALT 
AAHIKAIPTG IKHGTITALV NNTAYEITTL RSDISCDGRH AEVKFTNNWQ QDASRRDFTF
NALYCDEKGI VYDYFSGIQD LEKKHLNFIG DPEIRIQEDY LRILRAFRFY ASICSQNKLS
DEIVHSCTKY SSYINNLSRE RIRDEFFKLL LCPNLSNTLK IMQKCHVLDK IIPFEVIPDI
MSSETLSNTD PLTKLAALLR TNNNNHSLDK IKASLCLSNY SQKTLVSLLN NNLELPLSTT
AQHKHINKLG KEIYCNLLRI IHAELNLNYH DLMQYIEYAD QLIIPEFPIS GKDLLNIGYQ
PGKNLGITLE KIKDLWENSS YQLTKTQLLD YARGKLLKSK N