Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0123 |
Symbol | guaA |
ID | 3928057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 109470 |
End bp | 111050 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901247 |
Product | GMP synthase |
Protein accession | YP_506951 |
Protein GI | 88658530 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGGG TGGCAATTAT TGATTTTGGT TCGCAATTTA CACAATTGCT TGCTAGAAGG ATAAGAGAAC TAAATGTTTA TTCTGAAATT TTTCCGCATG ATATAGCATT TGATTACATA AAAGATAGTA AAGCATTTAT TTTATCTGGA GGTCCTAAGT CGGTTCTGGA TTTTACAGGA ATGCCTCCAA TAGTTCACGA TATTATTGAA TTGAATAAAA AAACTTCCGT TCCAGTATTG GGAGTGTGTT ATGGATTACA ACTTTTAAGT AATTATTTTA ACTCGACAAT TGTACATGGG TGTGGTCAGG AATTTGGGAA AGCTATTCTA AATGTTGTCA AAAAATCAGA AATGATAAAA GATGTGTGGA AAGTAGGAGA CCAACCCTAT GTGTGGATGA GTCATGCAGA TAGTGTATAT GATACTCCAT GCGGATTTGA AGTCATAGCA TGTAGTGTAG TGAATAATGC AATTGCGATG ATCAGCAATG AGGAAAGAAG AATTTATGGT GTACAATTTC ATCCTGAAGT ATATCACACT CCGGATGGGG TAAAATTACT TGCTAATTTT GTCAGAATAG CAGGTTGTGA CAATAACTGG ACAGTGGAAT CCTTTTTAGA TGAGCAAGAG AATCTTATAA AGAAACAAGT GGGAGATAAA AAAGTAATTG CTGCTCTTAG TGGTGGTGTT GATTCTAGTG TTGCTGCAGC TTTGACTTAT CGAGCAATAG GTGATCAATT ACACTGTATC TTTATTGATA ATGGCTTACT GCGCTATAAT GAAGCAGAAA AAGTAAGACA ATCGTTTGTT GATCAGTTTC AAATGCCTGT AACTATTGTT GATAGATCAT CAGTATTTTT AGATAAACTT CAATTTGTTA CTGATCCAGA GCAAAAGCGA AAAATAATAG GAAAAACGTT TATTGAAGTA TTTGAAGAAG AAGCGAATAA GATAGGGAAC GTAGAATTTT TAATGCAAGG TACTATATAC CCTGATGTTA TAGAATCTGG TGGCTCTGTT GGGAAAGAAA GTGTTACTAT CAAATCACAT CATAATGTTG GTGGACTACC TGATATAATG AAGTTACAGC TTGTTGAACC ACTAAAGCTT TTGTTTAAAG ATGAAGTGAG ATTGTTGGGG AAAAAACTTG GTATATCAGA TGAAATATTA ATGCGTCATC CATTCCCAGG ACCTGGTTTA GCAATTAGAA TAATAGGTGA AATTACTCAA GAGAAGGTTA ATATGTTGCA GGCTGCCGAT GAGATATATA TTAATCTTAT CAAAAAATAT AATTTGTACG ATGTTATATG GCAAGCGTTT GCAGTGCTAT TGCCAGTGAA AACAGTGGGT GTAATGGGTG ATAGTAGAAC TTATGGTTAT ACTTGTGCTT TAAGAGCAGT TACTTCTAGT GATGGTATGA CTGCAGAATG TTTCCCATTT GGTGTAGACT TGGAAACTAA GATAATCTTT TATGAGTTTC TACAGGATGT TAGTAATACT ATTGTGAATA ATGTTCAAGG AATTAATAGA GTGGTATATG ATACTACATC TAAACCTCCA GCTACAATTG AATGGGAATA G
|
Protein sequence | MSRVAIIDFG SQFTQLLARR IRELNVYSEI FPHDIAFDYI KDSKAFILSG GPKSVLDFTG MPPIVHDIIE LNKKTSVPVL GVCYGLQLLS NYFNSTIVHG CGQEFGKAIL NVVKKSEMIK DVWKVGDQPY VWMSHADSVY DTPCGFEVIA CSVVNNAIAM ISNEERRIYG VQFHPEVYHT PDGVKLLANF VRIAGCDNNW TVESFLDEQE NLIKKQVGDK KVIAALSGGV DSSVAAALTY RAIGDQLHCI FIDNGLLRYN EAEKVRQSFV DQFQMPVTIV DRSSVFLDKL QFVTDPEQKR KIIGKTFIEV FEEEANKIGN VEFLMQGTIY PDVIESGGSV GKESVTIKSH HNVGGLPDIM KLQLVEPLKL LFKDEVRLLG KKLGISDEIL MRHPFPGPGL AIRIIGEITQ EKVNMLQAAD EIYINLIKKY NLYDVIWQAF AVLLPVKTVG VMGDSRTYGY TCALRAVTSS DGMTAECFPF GVDLETKIIF YEFLQDVSNT IVNNVQGINR VVYDTTSKPP ATIEWE
|
| |