Gene ECH_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0139 
SymbolpurF 
ID3928019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp130634 
End bp132022 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content33% 
IMG OID637901263 
Productamidophosphoribosyltransferase 
Protein accessionYP_506967 
Protein GI88658320 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTA ATGAAATATA TGAAGAATGC GGAGTATTTG CCATACAAAA TAATAATTGT 
GCTGCTATTA ATTGCATCCT AGGGCTACAT GCACTTCAAC ATAGAGGCCA AGAATCATTT
GGCATAGTAA CGTCAGAAGA TAACAAACTT CATTTTCACT ATTCAAATGA ACAAGTCAAC
AGCATATTCA ATCAACAATC AAAAATAGAC TCTTTACTAG GAAACACTGC AATTGGACAT
ATACGTTATT CTACAAGTGG AAGTAAAGTT GGTGTTCAAC CAATAACCTT AGATTGTAAA
TTTGGGAAAT TAGCAATAGC ACACAATGGT AACCTCACTA ATGCAGCACA AATAAGAAAA
TCACTTACTG AAAGGGGATG TATATTTTCA TCAGATATCG ATACAGAAGT CATTGCTCAC
CTAATTGCTA TTAATACTGA GAATACTCTC TTGGACAATG TTATTAATGC ATTAAAAACC
ATTAAGGGCG CATATTCTTT AGTTATATTA ATAAACGGTA CTATAATATG TTGTCGCGAT
CCAGCTGGAA TTAGACCACT AGTATTAGGT ATGTTAGATA ATTCATATAT TGTAGCTTCA
GAAACTTGTG CTCTAGATAT AGTTGGAGCC CAATTTATAA GAGATGTATT ACCAGGAGAA
TTCATCACAA TTGACCAAGG TAACACTTTA ACAAGTTCTT TTCCATTTAA AAAACAAAAA
TCAAGTTTTT GTATTTTTGA ATACGTATAT TTTGCACGAC CAGATAGTAT AATAGACAAT
AAGTCTATAT ATGAAATACG CAAAAACATA GGTAAAGAAT TAGCAATAGA AAATCCTATT
CCAAAAGATA CACACATGAT AGTACCAGTA CCAGATTCGG GAGTACCAGC TGCATTGGGG
TTCGCAGAAT ACACAAAAAT ACCTTTTGAA TTTGGCATTA TTAGGAACCA TTATATTGGA
AGAACTTTTA TCCAACCCAA CGATCACATT CGTAGTATGG GAGTGAAGTT AAAACACAAT
GCTAATTCTT CCATACTAAA AGACAAGGTA ATAGTTTTAA TAGATGATAG CCTAGTTAGA
GGCACAACAC TAAAAAGTAT AATAACACTA CTACACAAAG CAGGAGTTCA ACAAATACAT
TTAAGAATTT CTAGCCCACC AACTATAAAT TCTTGCTTTT ATGGAATAGA CACTCCAGAA
GAATCAAAAT TAATAGCTAA CAGGTTATCA CAGTTAGAGA TCAAAAATGC TCTTGGTTGC
GATAGTCTAC ACTTTTTAAG TATAGATGGT CTCTACAAGG CAATATGCAA CACAAAACGT
AATAATAGTA TACCTCAATA CTGTGATGCC TGTTTTACTG GGGATTATCC AATAGGAAAA
ATAGAATAA
 
Protein sequence
MQFNEIYEEC GVFAIQNNNC AAINCILGLH ALQHRGQESF GIVTSEDNKL HFHYSNEQVN 
SIFNQQSKID SLLGNTAIGH IRYSTSGSKV GVQPITLDCK FGKLAIAHNG NLTNAAQIRK
SLTERGCIFS SDIDTEVIAH LIAINTENTL LDNVINALKT IKGAYSLVIL INGTIICCRD
PAGIRPLVLG MLDNSYIVAS ETCALDIVGA QFIRDVLPGE FITIDQGNTL TSSFPFKKQK
SSFCIFEYVY FARPDSIIDN KSIYEIRKNI GKELAIENPI PKDTHMIVPV PDSGVPAALG
FAEYTKIPFE FGIIRNHYIG RTFIQPNDHI RSMGVKLKHN ANSSILKDKV IVLIDDSLVR
GTTLKSIITL LHKAGVQQIH LRISSPPTIN SCFYGIDTPE ESKLIANRLS QLEIKNALGC
DSLHFLSIDG LYKAICNTKR NNSIPQYCDA CFTGDYPIGK IE