Gene ECH_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0123 
SymbolguaA 
ID3928057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp109470 
End bp111050 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content34% 
IMG OID637901247 
ProductGMP synthase 
Protein accessionYP_506951 
Protein GI88658530 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAGGG TGGCAATTAT TGATTTTGGT TCGCAATTTA CACAATTGCT TGCTAGAAGG 
ATAAGAGAAC TAAATGTTTA TTCTGAAATT TTTCCGCATG ATATAGCATT TGATTACATA
AAAGATAGTA AAGCATTTAT TTTATCTGGA GGTCCTAAGT CGGTTCTGGA TTTTACAGGA
ATGCCTCCAA TAGTTCACGA TATTATTGAA TTGAATAAAA AAACTTCCGT TCCAGTATTG
GGAGTGTGTT ATGGATTACA ACTTTTAAGT AATTATTTTA ACTCGACAAT TGTACATGGG
TGTGGTCAGG AATTTGGGAA AGCTATTCTA AATGTTGTCA AAAAATCAGA AATGATAAAA
GATGTGTGGA AAGTAGGAGA CCAACCCTAT GTGTGGATGA GTCATGCAGA TAGTGTATAT
GATACTCCAT GCGGATTTGA AGTCATAGCA TGTAGTGTAG TGAATAATGC AATTGCGATG
ATCAGCAATG AGGAAAGAAG AATTTATGGT GTACAATTTC ATCCTGAAGT ATATCACACT
CCGGATGGGG TAAAATTACT TGCTAATTTT GTCAGAATAG CAGGTTGTGA CAATAACTGG
ACAGTGGAAT CCTTTTTAGA TGAGCAAGAG AATCTTATAA AGAAACAAGT GGGAGATAAA
AAAGTAATTG CTGCTCTTAG TGGTGGTGTT GATTCTAGTG TTGCTGCAGC TTTGACTTAT
CGAGCAATAG GTGATCAATT ACACTGTATC TTTATTGATA ATGGCTTACT GCGCTATAAT
GAAGCAGAAA AAGTAAGACA ATCGTTTGTT GATCAGTTTC AAATGCCTGT AACTATTGTT
GATAGATCAT CAGTATTTTT AGATAAACTT CAATTTGTTA CTGATCCAGA GCAAAAGCGA
AAAATAATAG GAAAAACGTT TATTGAAGTA TTTGAAGAAG AAGCGAATAA GATAGGGAAC
GTAGAATTTT TAATGCAAGG TACTATATAC CCTGATGTTA TAGAATCTGG TGGCTCTGTT
GGGAAAGAAA GTGTTACTAT CAAATCACAT CATAATGTTG GTGGACTACC TGATATAATG
AAGTTACAGC TTGTTGAACC ACTAAAGCTT TTGTTTAAAG ATGAAGTGAG ATTGTTGGGG
AAAAAACTTG GTATATCAGA TGAAATATTA ATGCGTCATC CATTCCCAGG ACCTGGTTTA
GCAATTAGAA TAATAGGTGA AATTACTCAA GAGAAGGTTA ATATGTTGCA GGCTGCCGAT
GAGATATATA TTAATCTTAT CAAAAAATAT AATTTGTACG ATGTTATATG GCAAGCGTTT
GCAGTGCTAT TGCCAGTGAA AACAGTGGGT GTAATGGGTG ATAGTAGAAC TTATGGTTAT
ACTTGTGCTT TAAGAGCAGT TACTTCTAGT GATGGTATGA CTGCAGAATG TTTCCCATTT
GGTGTAGACT TGGAAACTAA GATAATCTTT TATGAGTTTC TACAGGATGT TAGTAATACT
ATTGTGAATA ATGTTCAAGG AATTAATAGA GTGGTATATG ATACTACATC TAAACCTCCA
GCTACAATTG AATGGGAATA G
 
Protein sequence
MSRVAIIDFG SQFTQLLARR IRELNVYSEI FPHDIAFDYI KDSKAFILSG GPKSVLDFTG 
MPPIVHDIIE LNKKTSVPVL GVCYGLQLLS NYFNSTIVHG CGQEFGKAIL NVVKKSEMIK
DVWKVGDQPY VWMSHADSVY DTPCGFEVIA CSVVNNAIAM ISNEERRIYG VQFHPEVYHT
PDGVKLLANF VRIAGCDNNW TVESFLDEQE NLIKKQVGDK KVIAALSGGV DSSVAAALTY
RAIGDQLHCI FIDNGLLRYN EAEKVRQSFV DQFQMPVTIV DRSSVFLDKL QFVTDPEQKR
KIIGKTFIEV FEEEANKIGN VEFLMQGTIY PDVIESGGSV GKESVTIKSH HNVGGLPDIM
KLQLVEPLKL LFKDEVRLLG KKLGISDEIL MRHPFPGPGL AIRIIGEITQ EKVNMLQAAD
EIYINLIKKY NLYDVIWQAF AVLLPVKTVG VMGDSRTYGY TCALRAVTSS DGMTAECFPF
GVDLETKIIF YEFLQDVSNT IVNNVQGINR VVYDTTSKPP ATIEWE