Gene ECH_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0543 
SymbolobgE 
ID3926949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp545444 
End bp546466 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content30% 
IMG OID637901665 
ProductGTPase ObgE 
Protein accessionYP_507356 
Protein GI88658589 
COG category[R] General function prediction only 
COG ID[COG0536] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02729] Obg family GTPase CgtA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTA TCGATGAAGC AAAAGTCTAC TTAAAAGCAG GAAATGGTGG AAACGGCTGT 
AGTAGTTTCC GTCGTGAAAA ATTTATTGAA TTCGGAGGAC CAGATGGCGG TAATGGCGGT
AATGGTGGCA ATATTGTTTT TGCTACAAGC AATCACATTA ACACATTACT ATATTTTAGA
TATAAACAGC ATATAAAAGC AGAAAATGGA AATCCAGGTT CTGGCAAAAA GAAATCTGGA
TCATCTGGAA AAGATATTAT TATAAAAGTT CCTATAGGCA CTCAATTATA TGATGAAGAT
GGAATATTAA TTGCTGACCT TAGTTCAGAA AATCAAAAGG TTATAGTAGC ACAAGGGGGT
AAAGGTGGAA CAGGTAATGC TAACTATAAA ACATCTACTA ATAGAGCACC TCGGTATTTT
ACCCTCGGAG AAGCAGGAGA AGAAAAATAT ATCACATTAA AATTAAAAAT TATTTCAGAT
ATAGGAATCA TAGGATTGCC TAATGCAGGA AAATCCAGTT TTTTAGCTTC ATGTACTGAC
TCAAAAACAA AAATTGCTGA TTATCCTTTT ACAACCTTAG AACCACATTT AGGAGTAGCT
TTTATTGACA ATAGAGAATT AGTATTAGCT GATATACCAG GATTAATTGC TGGAGCACAT
TTAGGATATG GAATAGGTGA CAAATTTCTA AAACATATTG AAAGATGTTC TACATTACTA
CACATCATAG ACTGCACTCT AGATGATATT ATTGATTCAT ATGAATGTAT TAGAAAAGAA
TTATTACTTT ATAATAAAGA GCTAATTAAT AAACCAGAGT TCATAGTATT AAATAAAAGC
GATTTACTAG AGAAAAAAGA AATTACTAAA AAAAAGCAAT TATTATCACA GTATACAAAA
AAAGAGATAT TCGTATCATC GATAAAAGAT AATCGATATG CTATTTTATC TACTTTAATT
CAATATATAC ATAAAAAAAA TGCCAATGCC GAACCATATA TATATGATCC ATTTAATATA
TAA
 
Protein sequence
MSFIDEAKVY LKAGNGGNGC SSFRREKFIE FGGPDGGNGG NGGNIVFATS NHINTLLYFR 
YKQHIKAENG NPGSGKKKSG SSGKDIIIKV PIGTQLYDED GILIADLSSE NQKVIVAQGG
KGGTGNANYK TSTNRAPRYF TLGEAGEEKY ITLKLKIISD IGIIGLPNAG KSSFLASCTD
SKTKIADYPF TTLEPHLGVA FIDNRELVLA DIPGLIAGAH LGYGIGDKFL KHIERCSTLL
HIIDCTLDDI IDSYECIRKE LLLYNKELIN KPEFIVLNKS DLLEKKEITK KKQLLSQYTK
KEIFVSSIKD NRYAILSTLI QYIHKKNANA EPYIYDPFNI