Gene ECH_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0888 
Symbol 
ID3927343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp912875 
End bp914020 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content35% 
IMG OID637902005 
Producthypothetical protein 
Protein accessionYP_507683 
Protein GI88658233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACTAT TAAAGCACAC TTTTGTGTTG TACACCACTC AGTGGCTATT TCATCTACTT 
TTTAAGTTAA CATGGTACAG GGCTATGATA GTAGGACAAA ACAATCAAGG GGGTTCTTGC
GCAGGATCAG ATGAATATCA ACCACTAAAC ACAGATCCAC TTCCAAATGA CGATACATCA
ACAGTAGAAT ATAATGAGTT TTCCCCTCTA TTAAGGTCAG AAGAAGATGA AACACCAGAT
AAGGCAAATG ATGAAATACT GAATAAGATA GATTTTGATA GATATTTTGT AATTTTTTCT
TTCATTGGGT TATTAGCAGA AGCAGCGTCT TCTATATTCA ATTTAGTATC AACCCAAGTT
TTTATTCCTA CTAGCACTAA ACATGCAGTA GCTACTGCTT TTTATGCTCT CTGTATACTA
ATTGCAATTT CCATGATTGT AAGTTCAATA CTTGCAATAA AGAAATCACT CAACCAAAAA
AAGCATCTTG ACGATATGCC AACAGATGCA TCAAATGAAG AATGTGTAGA AGAGAACGCC
AAATATAAAA AATTAAAAAA AATACAGGCT CATGCTCAAG TTTCTGAAAA TGCTCTTACT
ATCATTTCAC AGGTGATGTG GCTTATTGTT TATATTGCAT CACTAGTAAT GATATCTATG
GGTGACAACC AAATACTTGA AAACATGAGC CTGTTTTTAT CGATTACTGC ATCTCTTTTA
GGTATTATAT CTTGTGTTAT AAGGTTAATA GATGCAAATA TATCACGTAA GACATCTGGT
TCTGAGGAAG AAAAAAAACA ACACCTTAGT TTCACAATTT TTTGCGGTAT CATCTTAGCT
TTTGAGATAA TTCATTGTGC ATGCCATATA TCAGAAGCAA TATCTCTTGG TGGAAAAATG
CACAATCTTT ATGACTTTCA GAATATTCCT ATACTCTGTT TCGAACTGAT AACAGTAGCT
ATGTTTATTG CATCATTCTT CATAGAACAG TGCATTAAAA GTAAAGGAGG AAAGCACCAG
ACCAATGATG ATGGTGTTGC CGCTGCCGCT TGTTGCGGTG ATAATCTCCA TCCTAGTAGC
TTATTAGCTG ATGATAGTGG TGGTAATATT GCCAGACTTA TAGTAGCACA AGAACTATCA
GCTTAA
 
Protein sequence
MVLLKHTFVL YTTQWLFHLL FKLTWYRAMI VGQNNQGGSC AGSDEYQPLN TDPLPNDDTS 
TVEYNEFSPL LRSEEDETPD KANDEILNKI DFDRYFVIFS FIGLLAEAAS SIFNLVSTQV
FIPTSTKHAV ATAFYALCIL IAISMIVSSI LAIKKSLNQK KHLDDMPTDA SNEECVEENA
KYKKLKKIQA HAQVSENALT IISQVMWLIV YIASLVMISM GDNQILENMS LFLSITASLL
GIISCVIRLI DANISRKTSG SEEEKKQHLS FTIFCGIILA FEIIHCACHI SEAISLGGKM
HNLYDFQNIP ILCFELITVA MFIASFFIEQ CIKSKGGKHQ TNDDGVAAAA CCGDNLHPSS
LLADDSGGNI ARLIVAQELS A