Gene ECH_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0121 
Symbol 
ID3927246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp107054 
End bp108160 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content34% 
IMG OID637901245 
Producthypothetical protein 
Protein accessionYP_506949 
Protein GI88658059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACATA CTGCAGTACC TGGAATGGTA GCTCCAACAA GCGTTATTTC TGCTAAGCAT 
GTTGTAATTA AAGGACTTGT TTACAAGCAT GTGAAGCATT ATTCGATAGA GGAATATAAA
TCTCAAATAA AAGAGTTTAG GGAATCTATA ACGTGTTTTG CAAGAATGCA TATGTCCTAT
ATGTATCATA TGCTGCATAA TACGTTCGTT GTAAGGAATG GAAGGATTAT GTTGAAGTCT
GAAATTGAAC AGTGTCTATC AAAAATAACC AGTAATATAA GGCTGTGTGC CTTTGTGATT
AAGATAGGAA TAGTAGACCA CGTTATGAGT AGGCTTTGCA GGTTTTATGG TTCTGACAGC
ATAAAGTATT GTGCAAGTCA TTACCATGAT CCAAGGGTTA TAGATTCGAT ACTTATTGGG
TTATATGGTG CGTCATATTC TGATTTTTCA AGGATGTCAT ATCAAGTACG TAGTAATATA
GTTTATTGTG TTGGAAAACA TGGTATTGCA GGTGTTTTTA AGCTACATAA TAGTGGTTTT
TACTCAGAAT TATTAGGTAT GTGTTATGAT TTTGTTCATG CAAGGGGTAA GGGTGTAAAA
TTGCAAGAAT TATGTGATTT TATGAAGTTG TCTTGTAGTA TACAACTTGG GCAAATGTAT
CACATGATGG TAAAAGTCAA ATGTTCTATT GGAGATGAGC AAAGTGATAT ACGGAAACTT
GTATCCCAAG AATGTAGTGT AGGGTATCTA GTATATCGTT CTTTACTTTT TGGTAGGTAT
GCTTATCATG TAAGAAAAGC GTTTAGGCAT TTATATGCTC CAAGTGATAA AAACCCTGTA
CGTACAGTAT CTGGGTTAAA CATTCCGCAT AGTCTAATTC GACTAAATCA TAGAGGAATT
TTTACAAAAA TTGAACATTG TATAAACGCA GAAAAAATGA GTTTTAATGT TTTTGTTGTT
GATATAGTGC GTCATATTGA CAAGCTATTA TTGCATCCGC GTGAAGAAGT TTATATAAGA
GAAGATATAA GTACATATTG CGCTATAGTG AGTAGTAGAT ATAGTACTAT GGGGCCTGAC
ATAGATTCTT CTTATCATAT ATTGTAG
 
Protein sequence
MQHTAVPGMV APTSVISAKH VVIKGLVYKH VKHYSIEEYK SQIKEFRESI TCFARMHMSY 
MYHMLHNTFV VRNGRIMLKS EIEQCLSKIT SNIRLCAFVI KIGIVDHVMS RLCRFYGSDS
IKYCASHYHD PRVIDSILIG LYGASYSDFS RMSYQVRSNI VYCVGKHGIA GVFKLHNSGF
YSELLGMCYD FVHARGKGVK LQELCDFMKL SCSIQLGQMY HMMVKVKCSI GDEQSDIRKL
VSQECSVGYL VYRSLLFGRY AYHVRKAFRH LYAPSDKNPV RTVSGLNIPH SLIRLNHRGI
FTKIEHCINA EKMSFNVFVV DIVRHIDKLL LHPREEVYIR EDISTYCAIV SSRYSTMGPD
IDSSYHIL