Gene ECH_1006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1006 
SymbolpurD 
ID3927978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1032412 
End bp1033677 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content35% 
IMG OID637902122 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_507793 
Protein GI88657834 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.283424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAT TGGTGATTGG TTCTGGCGGC CGTGAGCACT CAATGTTGCA TCACATTCGT 
AAATCTACAT TACTAAACAA GCTATTTATC GCCCCAGGGC GTGAAGGAAT GTCTGGGTTA
GCAGATATAA TAGATATAGA TATCAATAGC ACAATAGAAG TAATTCAAGT ATGTAAGAAA
GAAAAAATTG AATTAGTAGT CATCGGACCA GAAACTCCAT TAATGAATGG ATTATCAGAC
GCATTAACAG AAGAAGGCAT ATTAGTCTTT GGACCTTCTA AAGCAGCAGC GCGTCTTGAA
TCTTCAAAAG GATTTACAAA AGAATTATGC ATGAGGTATG GAATTCCTAC TGCAAAATAC
GGGTACTTTG TTGATACAAA TTCAGCTTAC AAATTCATTG ATAAACACAA ATTACCTTTA
GTAGTTAAGG CTGATGGGTT AGCCCAGGGA AAAGGAACAG TGATATGTCA CACACACGAA
GAAGCATACA ATGCTGTAGA TGCCATGTTA GTGCACCACA AATTTGGAGA AGCTGGTTGT
GCAATAATTA TTGAAGAATT CCTTGAAGGC AAGGAAATTA GCTTCTTTAC ATTGGTTGAC
GGATCCAACC CAGTTATACT TGGCGTAGCA CAAGATTATA AAACTATAGG AGATAATAAT
AAAGGTCCTA ATACTGGAGG GATGGGATCA TACTCTAAAC CAAATATCAT TACACAAGAA
ATGGAGCATA TAATAATTCA GAAGATAATA TATCCAACTA TTAAAGCAAT GTTCAACATG
AATATACAGT TTAGAGGTCT GTTATTCGCT GGTATTATAA TCAAAAAAAA TGAACCAAAA
TTACTTGAAT ATAATGTACG GTTTGGAGAT CCTGAAACAC AATCAATATT ACCAAGATTA
AATTCCGATT TCTTAAAACT TTTATCACTA ACAGCTAAAG GTAAACTAGG AAATGAATCA
GTAGAATTAA GTAAAAAAGC TGCTTTATGT GTTGTGGTAG CTAGTCGTGG ATATCCAGGT
GAGTATAAGA AAAATTCTAT AATTAATGGA ATAGAAAATA TTGAAAAGCT ACCTAATGTT
CAGCTCTTAC ATGCAGGCAC AAGAAGAGAA GGAAATAACT GGGTATCAGA TTCTGGAAGA
GTAATAAATG TTGTAGCACA AGGTGAAAAT TTAGCTAGTG CGAAACACCA AGCCTACGCT
GCATTGGACT TATTAGATTG GCCAGATGGA ATTTACAGAT ATGATATAGG ATCATGTGCT
CTTTAA
 
Protein sequence
MNVLVIGSGG REHSMLHHIR KSTLLNKLFI APGREGMSGL ADIIDIDINS TIEVIQVCKK 
EKIELVVIGP ETPLMNGLSD ALTEEGILVF GPSKAAARLE SSKGFTKELC MRYGIPTAKY
GYFVDTNSAY KFIDKHKLPL VVKADGLAQG KGTVICHTHE EAYNAVDAML VHHKFGEAGC
AIIIEEFLEG KEISFFTLVD GSNPVILGVA QDYKTIGDNN KGPNTGGMGS YSKPNIITQE
MEHIIIQKII YPTIKAMFNM NIQFRGLLFA GIIIKKNEPK LLEYNVRFGD PETQSILPRL
NSDFLKLLSL TAKGKLGNES VELSKKAALC VVVASRGYPG EYKKNSIING IENIEKLPNV
QLLHAGTRRE GNNWVSDSGR VINVVAQGEN LASAKHQAYA ALDLLDWPDG IYRYDIGSCA
L