Gene ECH_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1001 
Symbol 
ID3927752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1026178 
End bp1027431 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content32% 
IMG OID637902117 
Productaspartate kinase 
Protein accessionYP_507788 
Protein GI88657747 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGGA TTTTAGTAAA AAAATTTGGA GGAACTTCTT TACAAGACAT AGAATGCATT 
AATAGAGTTG CAGAAATAAT AAAACAAGAT GTTAACAATA ATTATAAAGT AGTTGTAGTA
GTATCAGCTA TGGGAAAATT CACTGATAAC ATCATTTCAC AAATTAAACA AATTTCCGAT
GTCAAATCTC AATCTGAACG CTCCGAATAT GATCTAATCA TTTCTTCAGG AGAACAAATA
TCATGCGGAC TATTATCATT AGCTCTACAA AAAATAGGAA TCAATGCTCA ATCATGGTTA
GGATGGCAAC TACCAATAGT AACAACTGAA GACCATACTA AAGCAAGAAT CATAGATATT
AACACATGTT CACTACAGGA TTCACTAGCT AATAATGATG TTGCTATTGT GGCTGGATTT
CAGGGAATGC ATAAAAACAA TAGAGTAACA ACCTTAGGCA GAGGAGGTTC TGACACTTCA
GCTGTAGCAA TTGCAGCAGC ACTAAAAGTA GATTTATGCT ACATTTACAC AGATGTAGAC
GGAATATATA CAGCAGATCC TAATGTGGTA CCAAAAGCAC GCAAATTAGA TTACATTACA
TATGATGAAA TGATAGAGAT GTCTTCTCTT GGCGCTAAAG TATTACAAGT ACGTTCAGTA
GAAATAGCAA TGAAATATAA CATAAAATTG TGTATATTAT CTACTTTTAA TCCTGGAAAA
GGGACAATCT TACGCAAAAA AGGAGAATCA GATATGGAAA GTCAATTAAT TACTGGGGTT
ACATGTAATA ACAAAACAGC AAGTATTACA CTAAAAGAGG TAAAAGCAAT ATCTGGCGTT
ACCACAGTAT TTAATGCAAT AGCAGAAAAA AACATTAACG TCGATATGAT CATTCAAAGT
GTGAATGATA ACAATGCAAA TGATATCACT TTTACAATTT CAGAAGAAGA TTTGCCAACA
ACAACAAAGT TTTTAACAGA AATTCAAACT GAACTTATGT ATCAGGATTT AATAATCAAT
TCCGAAGTTG CAAAAGTTTC CATTATTGGA GTAGGCATGA TTTCTCATTC TGGAGTAGCT
TACAAAATGT TTGATACTTT AACATCTAAT AATATAAAAA TATTAGCAGT TACTACTTCA
GAGATAAAAA TCAGCGTTCT AATATCGAGA AAAGACAGCC AACTTGCAAC AATAGCATTG
CACTCTACTT TTGGACTTGA TAACACAGAA TCAGATTTAC ACATAATAAG TTAA
 
Protein sequence
MKRILVKKFG GTSLQDIECI NRVAEIIKQD VNNNYKVVVV VSAMGKFTDN IISQIKQISD 
VKSQSERSEY DLIISSGEQI SCGLLSLALQ KIGINAQSWL GWQLPIVTTE DHTKARIIDI
NTCSLQDSLA NNDVAIVAGF QGMHKNNRVT TLGRGGSDTS AVAIAAALKV DLCYIYTDVD
GIYTADPNVV PKARKLDYIT YDEMIEMSSL GAKVLQVRSV EIAMKYNIKL CILSTFNPGK
GTILRKKGES DMESQLITGV TCNNKTASIT LKEVKAISGV TTVFNAIAEK NINVDMIIQS
VNDNNANDIT FTISEEDLPT TTKFLTEIQT ELMYQDLIIN SEVAKVSIIG VGMISHSGVA
YKMFDTLTSN NIKILAVTTS EIKISVLISR KDSQLATIAL HSTFGLDNTE SDLHIIS