Gene ECH_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0188 
Symbol 
ID3927614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp178181 
End bp181222 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content32% 
IMG OID637901312 
Productputative surface protein 
Protein accessionYP_507012 
Protein GI88658025 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTTT TTTTCGAATA TAATGGCAGT AAAGTTAACA CTAAGGAAGT AGTGAAACGC 
AATACTGCAG GTAGAACAGA TCAAGATAAG GACTCTTCTA CCTATGTATA TAATACTTAT
AACGCTGGTG ATCTACAAGG GCATGTACAA CAGTCCTTTA TGAGAAATAA TGCTGAGCAC
AATAGAACAG ATACTGTTCT TCCTTATAAT GATAGTTATG AGACTCTACC TGTTGTGCCT
GCTGTAGAAA CTGGTGCTGA TGGTAATTCA GGTAAGGGTA ATTCTGTAGA TTTTAAATCT
AATGTTTTTT GTAATACGCA TACTATAAGA GACGATCAGT TGAAATTAAA TCTAGAAATA
CACACTTTAG GAAATTTATC TGATAGAAGA TTACAGGAGA TTAAGAAAGG ATTTGATGAT
TCAATAATAA GATTTAAAGA TAATTTTGGG TTGGAACCCA ATGAAAAAGA TACTACTTTT
GAATTATATC TTTTTGATGA TAAAGAACAA TATGAGCATT ATGGACGGCT TTATAATCTA
GGAATAAATG GAGCTGGTGG TATGACTTTT TATGGAGATG CTGATGTTCC TTATAAAATT
TATGTATATC AGTTTGGTGA AATATTAAAT TTAAAACATG AGTTAACTCA TGCTTTAGAG
AATTATGCGT CTGGACACTC ATTGTCTAAG CTTAAAATAA ATCATGATAT ATTTACAGAA
GGGTTGGCTG ACTATATACA AAATGATGGT ACTTTTATTA TGAGAGAATT AAGAGATAAA
GAAGTAACAT CAAGTGTATT GAAGGAGGGT TCTTCTAAAG ATATAGATCA AGTCAGTGAT
GCTGCAGTAG CTAAGGATCA GCATTTGAGT TATAGTATAG GACACGCATT TGTAACATTT
TTACAGGAGA ATTATCCTGG TGTGATTTCA GAGTACTTTG CAGCATTGAG GGAAGGTAAT
GCTGTGCATG CTAGAGAAAT AATTAGCATG AATAAGTATG CAGATTTTGA ACCGTGGGTA
CAGTCTAAAG ACATTTCTTT GTATCTAGAA GGCATGAATG TATTAAAGAT AGATTTAGGA
GAAAAAATGT TTTCTGCTAA AAACGCTGTT TCTTTTGAAA ATAAGAATGT AAGAAATGAA
TATTACTGTG AAAACATTTG TACTATGAAC GGTGAAGTAG TAGGGAAAAT ATCTCCTGTG
GTGCATTATG CCGATAAGGA TACTATTCGT ACTTGGAATA TTGCGAGTAC TGATATGATA
GAAGTAAAAC CAGAGTATAG TTTTCTAAAA TTGGTTAGTA CTCCATCTGG TAAGTCTGCA
TATGTATATT GTGATAAGGA TGGTAATGAG TATTTTAATA CTCAATCTTA TGTGGAATAT
GCGTTTAATA TCTTGAAAAA ATATGATGAG AATCTTCGTA TCAATGGTGA TTTTTTAGAT
ATTAGAGGTC GTTACTCAGA TGCTGATAAA GTGTTTGATA AAATTCCTAA CGCAGATTTG
TTGTTGGATC AGTTCTTAGA TAAAATTGGT TATGGAAATT ATAAGCAAGT AATAATGAGC
GACCCAGAGC AGGTTAGTGT TATAAAAATG CATATAGTAA AAAAAGCTTT TGAAGATTTT
AGAGAATCTG AAGTCAAAAA AGTGTTTACT GGTGAATCAG GTGTTGATTC TACAATAAAA
AATCTATTGA TGGATTTAAC TTATATTAAT TTAAGTGATG TGATAGGAGT AAATGGTTCT
AATATTGAAA GCATTGTATC TGATCCAAAT GTAATGTTGC GTACTGCTAT ATTAGGTAAG
GGAGATGCAA GTGGAATATC TCTGTATGTA GGTGATCAAA AAGTCGCTGA ATTATCTACT
GAAGGAGGTT ATTGTGTGAA GGATCTTGAT ACTAATAATG TGAATTTTGT ATTCCGCAAT
GCTGTTGGAA ACATAGCAAG TAGTTATCAA GATAGAGCAT ACATGGTTGT ATGTGAAAAA
GATGGTGAAT TTACTACTGC TCTAATTGAT GATATACAAA AGACAGAGCA TGGAAATGTT
ATATGGGATA ATCAGTTTAA TCATCCTGGT GTTAATCATT TATATCCAAA TTATCAAAAG
GTATTATTAA ATGATGCTTC ACTTAAGGAT TATTCCCATC TTGCTAATAC AAGGTTTCAT
CATGATGATA CAGTAATTGT ACGGGGAGAT CTGTTGGATG ACAAAGGAAC TGTTACAACG
AGTGATGACA TTCATCAAGC AGTGATTAAA CATGATGATC AAGTACTACA TCAATTTAAA
AGTATTTCTT TTTACATAAG TGAGCCATCA ACAGATAGTG CTGGAACTTA TGGTAGTGAT
TTCTTTATTG CTGATGAAGG GAAAAATCTT AGATTTCAAC TTCCTAAAAC AATTACTCAT
TTAAAGTTAG TAAATGTTGA TGGGAATCAA AAATTAGTAC CATGTACTGC AGATGGTAAT
GAGCACCCTG ATGGTATGCC ATCTAATTTA ACAGATGAGT ATCGATATAT TGATCCTATT
TTTGCTCACA CATTTGAGAA ACAAAGCTAT TCTAAAAATA GTGTTAGCAT TGGTTTGGTA
GATTTTGATA AATATCAAGA AGGAACTATG TTTAAATTAC AGTATTATTC TGATGATTAT
CATATTAATA AGGATGAACA TGGTAATATA ATTAGACCTA ATAATGTGTC TTATAAAACA
AAAGTTGACC TGGTATATGA TGATAAAGTT ATTGGAATGT TATCTGATAA TGTAAATAAA
TTTCAGGGAG ATGTTTTTGT TGCTGCAAGC CTTAATTATA GCCACAGTGA TTTTCTTTCG
TCTAAATATT TTCAGAAGGT CAATATTGAA GCGTTAGAAA ATGGGGTATA TAGTGGAAGA
TATGATGTAG GAAGTGGTGA TGAAATAGCC AATCTTGATA CTGATGTAGG TTATAGTGAC
AAAACTGTTT TTTATTTTAA AGGAAGTAAT TCACCTGTTG ATGTATTAGA TAATGTTGAT
ACTGTGTCTA CTATTTCACC TTATATTAAT GAGTTTCAAT AG
 
Protein sequence
MTVFFEYNGS KVNTKEVVKR NTAGRTDQDK DSSTYVYNTY NAGDLQGHVQ QSFMRNNAEH 
NRTDTVLPYN DSYETLPVVP AVETGADGNS GKGNSVDFKS NVFCNTHTIR DDQLKLNLEI
HTLGNLSDRR LQEIKKGFDD SIIRFKDNFG LEPNEKDTTF ELYLFDDKEQ YEHYGRLYNL
GINGAGGMTF YGDADVPYKI YVYQFGEILN LKHELTHALE NYASGHSLSK LKINHDIFTE
GLADYIQNDG TFIMRELRDK EVTSSVLKEG SSKDIDQVSD AAVAKDQHLS YSIGHAFVTF
LQENYPGVIS EYFAALREGN AVHAREIISM NKYADFEPWV QSKDISLYLE GMNVLKIDLG
EKMFSAKNAV SFENKNVRNE YYCENICTMN GEVVGKISPV VHYADKDTIR TWNIASTDMI
EVKPEYSFLK LVSTPSGKSA YVYCDKDGNE YFNTQSYVEY AFNILKKYDE NLRINGDFLD
IRGRYSDADK VFDKIPNADL LLDQFLDKIG YGNYKQVIMS DPEQVSVIKM HIVKKAFEDF
RESEVKKVFT GESGVDSTIK NLLMDLTYIN LSDVIGVNGS NIESIVSDPN VMLRTAILGK
GDASGISLYV GDQKVAELST EGGYCVKDLD TNNVNFVFRN AVGNIASSYQ DRAYMVVCEK
DGEFTTALID DIQKTEHGNV IWDNQFNHPG VNHLYPNYQK VLLNDASLKD YSHLANTRFH
HDDTVIVRGD LLDDKGTVTT SDDIHQAVIK HDDQVLHQFK SISFYISEPS TDSAGTYGSD
FFIADEGKNL RFQLPKTITH LKLVNVDGNQ KLVPCTADGN EHPDGMPSNL TDEYRYIDPI
FAHTFEKQSY SKNSVSIGLV DFDKYQEGTM FKLQYYSDDY HINKDEHGNI IRPNNVSYKT
KVDLVYDDKV IGMLSDNVNK FQGDVFVAAS LNYSHSDFLS SKYFQKVNIE ALENGVYSGR
YDVGSGDEIA NLDTDVGYSD KTVFYFKGSN SPVDVLDNVD TVSTISPYIN EFQ