Gene ECH_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0523 
Symbol 
ID3927260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp522829 
End bp524298 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content30% 
IMG OID637901646 
Producthypothetical protein 
Protein accessionYP_507338 
Protein GI88658285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0189949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC TTTTTGTGAT TTCTGCTTTT ACTTCTCTGT TAATGATGTC TTCCTATAAT 
GCTTTTTCTG ATGAAATATT AGATGGTATT TTTGGAAGTA ATAATAAATT TATAAATAAT
ACTAAGAATT CATTTTCAGG AATTAATAAT AAGGCGTTGG TAAAAGGTGG ACGTATTAAA
TTTGCTGGTG ATATGATATC TTATACTTGG TATTCAAGTG ATGATACTAG GAATTCGAAT
AAATTTAGTA GGGTTAGTAA GCGTTTTGAT ATAGATACGG GAGGGAATAT TAATAATGTT
GGTGCTAAGC ATGATGGTAT GTTTAGTATT GAAATTGATT CAAATCCGGA TAAACATGGA
ATTGTATATG GAGCATATTC TCAAATAAAT ATCCCACATG TCGCAGGAAA AAGTTTTGGT
AATAATGCTG CATTTAATAG AGGATCAAAA ATATTTGCTA AAACTCCTTA TGGTAACTTT
TCAGTTGGAT ATCAGGAAGG TGTAGAGTCC ATGATGAAAT TAAATGCTTT TAGTATAGTT
GCTGGTGATG ATTCTAATAT ATGGACAAAG CATTTAAGGA ATATTCTGCA TGAAAAAAAA
GATGGTCAAA GTGGTTATTC AGTATATTAT TTCAATTTTA ACTCTGGGTT ATATAGTGAA
AGTTTATTTC GTAATAGTGA CAATATTGTT TTTGATGACA TAGACTATTA TTTAGGAACT
GGTATTATTT CTAGGAGTTT TATTAATAAT TTGCCATTTA GGTTATCTTA TCAATCACAG
AATTTTATGG GATTAAGGTT TGGAGTAAGT TACTCTCCTT TTGGGTATGA TCAGAGATTG
TTTGAATTAC AAAAAGATCG AAACAGTGAT ACTTTAATTT TGGTTGGGCC AAGGTATAGA
CATATTGTTA GTGGAGGTAT TTCTTATACT TATAATATTA AAAATTTAAA ATTTAGTGCT
TCCGTGATAG GTGAATATGG TGATGAAGAG CATGATTATA AAGCGCACTA TAATAGATAT
TATAGGCATA ATACATTAAA GGCAGTATCC ATTGGTTGGA ATGTTGGTTA CGATAAAATA
GAATTAGCAG GGTCTTATGG AAAATTGAAT AGTGCTGGAA TTCCTTATGA TAAGTGTGTT
ATACATGGAG TTCCTTATGA GTATGTATAT AGGAGTGTTA TTCACTGGCT GTATTTGAAA
GACATGGATT ACTATTGGGA TATAGGTATT GCTTATAAGT ATGCACCTTT AAGTCTAAGT
GTTATTTACT TTATGAGTAA TAGGGTTGGT AATGAATTAA GTGATGTAAA TGTAGGCATT
GAATATGATA TTTTAAAATA TAGCGGTTTT AAGAGTAGTT TGTTTGCTAA TTACAATTAT
TATACTTTTC GCCAATTTAG TGACACTTAT AGGATTCATG TGAATGGTAA GGGCAGTATA
TTGTTAGTGG GTGCAAAGTT AAGTTTTTAA
 
Protein sequence
MKNLFVISAF TSLLMMSSYN AFSDEILDGI FGSNNKFINN TKNSFSGINN KALVKGGRIK 
FAGDMISYTW YSSDDTRNSN KFSRVSKRFD IDTGGNINNV GAKHDGMFSI EIDSNPDKHG
IVYGAYSQIN IPHVAGKSFG NNAAFNRGSK IFAKTPYGNF SVGYQEGVES MMKLNAFSIV
AGDDSNIWTK HLRNILHEKK DGQSGYSVYY FNFNSGLYSE SLFRNSDNIV FDDIDYYLGT
GIISRSFINN LPFRLSYQSQ NFMGLRFGVS YSPFGYDQRL FELQKDRNSD TLILVGPRYR
HIVSGGISYT YNIKNLKFSA SVIGEYGDEE HDYKAHYNRY YRHNTLKAVS IGWNVGYDKI
ELAGSYGKLN SAGIPYDKCV IHGVPYEYVY RSVIHWLYLK DMDYYWDIGI AYKYAPLSLS
VIYFMSNRVG NELSDVNVGI EYDILKYSGF KSSLFANYNY YTFRQFSDTY RIHVNGKGSI
LLVGAKLSF