Gene ECH_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0189 
Symbol 
ID3927670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp181597 
End bp182637 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content34% 
IMG OID637901313 
Productputative iron-binding protein 
Protein accessionYP_507013 
Protein GI88658650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.120226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAA TTGCTTGTCT TGGTATTATA GCTGTTGTTA TCCTAGCCTT TAGTTTTTTT 
ACTAAAAAGC AGCAGGTTCA AGATTTAACA CAAGAAGTAC GAGTATATTC ATCTCGCAAG
GAAGAATTAT TACATAGTTT GTTTAAACAA TTTACTAAAG AAACTGGTAT AAATGTTAAA
TACATCAATG ACGAAGCCGC TCAACTTATT AATAGAATGG AAAATGAGGG TACTGCTACT
TCAGCTGATG TATTTTTAAC TGCAGATGCT GTTAATCTTA TTCTTGCTAA AAAGAAAGGA
TTGTTGCAAC CTGTTCAATC TGAAGTGTTG AATCAAGCAA TTCCTAGTAA GTATAGAGAT
AGTGAGGGGT TTTGGTTTGG GTTAACTAAG CGTGCAAGGG TGATAGTATA TAACAAAGAT
TTAGTTGAAA AGAGTGACTT AAGTACATAT GAGCACCTTG CAAATACAAA ATGGAAAGAT
AAAATTTTAG TAAGATCTTC TAGCAGTCCA TATAACCAGT CTTTAATTGC TTTTATGATA
GCAAATAATG GTATAGAAAA TACTAAGATT TGGGTTAAAG GTTTAGTTTC AAATATGGCT
AGGAAGCCTA GTGGTGGGGA TATAGATCAA ATTTATGCTG TTGCAGCAGA TGAAGGTAGT
ATAGCTATAG TTAATAGTTA TTATTTTGGT AGGATTGCAG CTTCTGATAA GAAGAGTGAT
CAGATTGCAG TTAAAAAACT TGGTATCTTT TTCCCTAATC AGGAAACCAC AGGTACTATG
ATTAACATTA GTGGTGGTGC TGTAACAAAG AATGCAAAGA ATAAGCAGAA TGCTATAAGA
TTGTTAGAGT TTTTAACTAG CGTGAAAGCA CAAAAGGTCT ATGCTCAAGT TAATCAAGAA
TATCCTGTTG TAGAAGGGGT AGAGCTCTCA GAGATTTTAG GGACTTTTGG TTCATTTAAG
GAGAGCAATT TGCCTTTACA AGAATTAGAG AAACATTTGA CTGAAGCTGT TAAAATGGCA
GATGAGTGTG GGTGGAGATA G
 
Protein sequence
MRLIACLGII AVVILAFSFF TKKQQVQDLT QEVRVYSSRK EELLHSLFKQ FTKETGINVK 
YINDEAAQLI NRMENEGTAT SADVFLTADA VNLILAKKKG LLQPVQSEVL NQAIPSKYRD
SEGFWFGLTK RARVIVYNKD LVEKSDLSTY EHLANTKWKD KILVRSSSSP YNQSLIAFMI
ANNGIENTKI WVKGLVSNMA RKPSGGDIDQ IYAVAADEGS IAIVNSYYFG RIAASDKKSD
QIAVKKLGIF FPNQETTGTM INISGGAVTK NAKNKQNAIR LLEFLTSVKA QKVYAQVNQE
YPVVEGVELS EILGTFGSFK ESNLPLQELE KHLTEAVKMA DECGWR