Gene ECH_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0937 
SymbolargH 
ID3927223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp957508 
End bp958920 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content33% 
IMG OID637902053 
Productargininosuccinate lyase 
Protein accessionYP_507726 
Protein GI88657716 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.941991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC CTTTATGGGG AGGAAGGTTT ACTGTATCCC CCAGTGACAT TATGAAAAAG 
ATTAATGAAT CAATATCGTT TGACAAAATA CTATATGAAG AAGATATATC TGGGTCAATA
GCACACTGTA AAATGTTAGT TAACCAAAAA ATCATTAGCA AATATGAAGG TCAACTTATT
ATTCATGGAC TAGAAGTTAT ACAAAACCAA ATTTCATCTG GCACTTTTGA ATTCAGCACA
GACCTAGAAG ACATACACAT GAACATAGAA CACCACTTAA AGAAAATGAT AGGTAACATT
GCAGGAAAGT TGCATACTGC AAGATCTCGT AATGATCAAG TTGCAACAGA TTTTAAACTT
TGGATACGGA AATCAATAGT AAAATTAGAA ACGCTATTAC ATGAATTACA ACAGACTATA
CTTAATATAG CTGAAGCTAA TTACGATACT ATCATGCCAG GATTTACACA CTTACAAATT
GCTCAACCTG TAACATTAGG TCATCATTTA ATGGCATATT TTGAAATGTT AAAAAGAGAC
TGTTCACGCT GGCAAGATTT ACACAAACGC ATGAATCAAT GTCCTGCAGG ATCTGCAGCA
TTAGCAGGAA CATCTTTTCC AATAGACAGA CATTTCATCG CACAAGAACT AAAATTTGAC
AGCCCAACAG AAAATTCTAT AGATGCAGTA TCAGACAGAG ACTATGTTAT TGAATTTTTA
TCAAATGCTT CAATATGCAT AATGCATTTA TCAAGGTTAG CAGAAGAAAT TATACTTTGG
TGCAGCTACA ATTTTAAGTT TATAACACTT TCCGATAATA TCACAACCGG AAGTTCAATA
ATGCCACAAA AGAAAAACCC AGATGCAGCA GAACTTATCA GAGGAAAAAC TGGAAGGATT
TTTGCATCAT TAAACCAAAT ATTAGTCGTC ATGAAAGGAC TACCACTAGC ATATAGCAAA
GATATGCAAG AAGACAAAGA ACCTGTCTTT GATGCAGCAA ACAACTTAAT GTTATGTATA
GAAGCAATGA ACAGCATGTT AAACAATATT ACCATTAACA AAAGTAATAT GCTAAAAGCA
GCAGAGCATG ACTATTCAAC AGCAACAGAT CTTGCAGACT GGCTAGTCAA AAATCTTAAT
CTTTCATTTA GAGAATCTCA TGAAACTACT GGACAAATAG TCAAGTTAGC AGAGCAAAAC
CACTGTAAAC TACATGAATT AACTCTAGAA CAAATGAAAA CGATCATCCC TTCTATAACT
GAAGACGTCT TTTCAATATT ATCGGTAAAA AACTCAGTAG ACAGTAGAAC GAGCTATGGA
GGAACTGCTC CTGCAAATGT AATCGAAGCA ATAAAAAGAG GAAAGTTATA TCTCAGCAAT
ATTACTACTT TACATTCAGA AAACAATATG TAA
 
Protein sequence
MKNPLWGGRF TVSPSDIMKK INESISFDKI LYEEDISGSI AHCKMLVNQK IISKYEGQLI 
IHGLEVIQNQ ISSGTFEFST DLEDIHMNIE HHLKKMIGNI AGKLHTARSR NDQVATDFKL
WIRKSIVKLE TLLHELQQTI LNIAEANYDT IMPGFTHLQI AQPVTLGHHL MAYFEMLKRD
CSRWQDLHKR MNQCPAGSAA LAGTSFPIDR HFIAQELKFD SPTENSIDAV SDRDYVIEFL
SNASICIMHL SRLAEEIILW CSYNFKFITL SDNITTGSSI MPQKKNPDAA ELIRGKTGRI
FASLNQILVV MKGLPLAYSK DMQEDKEPVF DAANNLMLCI EAMNSMLNNI TINKSNMLKA
AEHDYSTATD LADWLVKNLN LSFRESHETT GQIVKLAEQN HCKLHELTLE QMKTIIPSIT
EDVFSILSVK NSVDSRTSYG GTAPANVIEA IKRGKLYLSN ITTLHSENNM