Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1006 |
Symbol | purD |
ID | 3927978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1032412 |
End bp | 1033677 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637902122 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_507793 |
Protein GI | 88657834 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.283424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAT TGGTGATTGG TTCTGGCGGC CGTGAGCACT CAATGTTGCA TCACATTCGT AAATCTACAT TACTAAACAA GCTATTTATC GCCCCAGGGC GTGAAGGAAT GTCTGGGTTA GCAGATATAA TAGATATAGA TATCAATAGC ACAATAGAAG TAATTCAAGT ATGTAAGAAA GAAAAAATTG AATTAGTAGT CATCGGACCA GAAACTCCAT TAATGAATGG ATTATCAGAC GCATTAACAG AAGAAGGCAT ATTAGTCTTT GGACCTTCTA AAGCAGCAGC GCGTCTTGAA TCTTCAAAAG GATTTACAAA AGAATTATGC ATGAGGTATG GAATTCCTAC TGCAAAATAC GGGTACTTTG TTGATACAAA TTCAGCTTAC AAATTCATTG ATAAACACAA ATTACCTTTA GTAGTTAAGG CTGATGGGTT AGCCCAGGGA AAAGGAACAG TGATATGTCA CACACACGAA GAAGCATACA ATGCTGTAGA TGCCATGTTA GTGCACCACA AATTTGGAGA AGCTGGTTGT GCAATAATTA TTGAAGAATT CCTTGAAGGC AAGGAAATTA GCTTCTTTAC ATTGGTTGAC GGATCCAACC CAGTTATACT TGGCGTAGCA CAAGATTATA AAACTATAGG AGATAATAAT AAAGGTCCTA ATACTGGAGG GATGGGATCA TACTCTAAAC CAAATATCAT TACACAAGAA ATGGAGCATA TAATAATTCA GAAGATAATA TATCCAACTA TTAAAGCAAT GTTCAACATG AATATACAGT TTAGAGGTCT GTTATTCGCT GGTATTATAA TCAAAAAAAA TGAACCAAAA TTACTTGAAT ATAATGTACG GTTTGGAGAT CCTGAAACAC AATCAATATT ACCAAGATTA AATTCCGATT TCTTAAAACT TTTATCACTA ACAGCTAAAG GTAAACTAGG AAATGAATCA GTAGAATTAA GTAAAAAAGC TGCTTTATGT GTTGTGGTAG CTAGTCGTGG ATATCCAGGT GAGTATAAGA AAAATTCTAT AATTAATGGA ATAGAAAATA TTGAAAAGCT ACCTAATGTT CAGCTCTTAC ATGCAGGCAC AAGAAGAGAA GGAAATAACT GGGTATCAGA TTCTGGAAGA GTAATAAATG TTGTAGCACA AGGTGAAAAT TTAGCTAGTG CGAAACACCA AGCCTACGCT GCATTGGACT TATTAGATTG GCCAGATGGA ATTTACAGAT ATGATATAGG ATCATGTGCT CTTTAA
|
Protein sequence | MNVLVIGSGG REHSMLHHIR KSTLLNKLFI APGREGMSGL ADIIDIDINS TIEVIQVCKK EKIELVVIGP ETPLMNGLSD ALTEEGILVF GPSKAAARLE SSKGFTKELC MRYGIPTAKY GYFVDTNSAY KFIDKHKLPL VVKADGLAQG KGTVICHTHE EAYNAVDAML VHHKFGEAGC AIIIEEFLEG KEISFFTLVD GSNPVILGVA QDYKTIGDNN KGPNTGGMGS YSKPNIITQE MEHIIIQKII YPTIKAMFNM NIQFRGLLFA GIIIKKNEPK LLEYNVRFGD PETQSILPRL NSDFLKLLSL TAKGKLGNES VELSKKAALC VVVASRGYPG EYKKNSIING IENIEKLPNV QLLHAGTRRE GNNWVSDSGR VINVVAQGEN LASAKHQAYA ALDLLDWPDG IYRYDIGSCA L
|
| |