Gene ECH_0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0581 
Symbol 
ID3927325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp589788 
End bp590924 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content31% 
IMG OID637901703 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_507392 
Protein GI88658620 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAGATA TTTTTATTAA CTTACTCAAA CTAATTAGCT TACCAGTAGT ATTTTTTTCT 
ATTACATCGA CAATTTCTGG ATTGGCAAAT TTGACAGAAA TTAAAACTTT AATAAGAAAA
ACCATATTTT ATACTATATC TACTACAGTG ATTGCAGCTA CTGTAGGTCT TATCACATAC
CTATTAATTG ATCCATCAAA AAAAGAACTT ATATATAACA TATTGAGTAC CAATAAGCAT
ATTAGCAATA CTCCAGATTA TTTATCGTAT TTAATGTCTA TATTACCTTA TAACTTCATT
AAGGTATTTC TAGATAATAA TGTAATTGGG TGTGTCATAC TTGCATTCCT TATAGGAGGA
GCACTGCTAT TATTGCCTGA TAAAAATAAA CGCGAATTGC TTCATAAAGT TTTTGATGCC
CTCTTCGATA CATTTTTAGA AATAGCAAAG CTTATATTAA AGCTAATGCC CATAGCATTA
TGGTCATTTA TTACTGTACT GTTATATAAC ATGAAGGAAG GATATAACAT ATCAAATGTT
TTAAAATATC TGTTATGTAT CATGATAGCA AATTTCATAC AAGCATGTAT AATACTGCCT
CTGTTACTAA AATTAAAGAA GATTCCTGTT ATAAAAACAT TTCGCGGTGT TTTGCCAGCT
TTAACCATAG CTTTCTTTTC AAAATCTTCT ACAGCTACTC TACCAACAAC TCTTCGTTGC
ACACAAGATT ACTTAAATAT TCCAAAAAAG ATATCATCTT TTATACTGCC AGTTTGTACA
ACTATTAATA TGAATGCATG TGCTGCTTTC ATATTAATCA CAGTATTTTT TGTATCAGAA
GTTAACGGAT ACACATTTTC TATTGGTGAG ATGTTTTTGT GGGTGTTCCT AGCTACTGGA
GCAGCTATTG GTAATGCAGG GGTTCCAATG GGGTGCTACT TTATGGCTAT GAGTTATCTC
ATGTCAATGA AAGTGCCTTT GAGTATCATG GGAGTTATAT TGCCTGTATA TACAATAATA
GATATGTTTG AAACTGCAAT TAATGTATGG TCAGATGTAT GCATTACTCA GATAGTACAT
AAAGAATATG ATGCGTTGAT AAAAAAAGGT AAGAAAATTG GAATAAATGA TCAATGA
 
Protein sequence
MGDIFINLLK LISLPVVFFS ITSTISGLAN LTEIKTLIRK TIFYTISTTV IAATVGLITY 
LLIDPSKKEL IYNILSTNKH ISNTPDYLSY LMSILPYNFI KVFLDNNVIG CVILAFLIGG
ALLLLPDKNK RELLHKVFDA LFDTFLEIAK LILKLMPIAL WSFITVLLYN MKEGYNISNV
LKYLLCIMIA NFIQACIILP LLLKLKKIPV IKTFRGVLPA LTIAFFSKSS TATLPTTLRC
TQDYLNIPKK ISSFILPVCT TINMNACAAF ILITVFFVSE VNGYTFSIGE MFLWVFLATG
AAIGNAGVPM GCYFMAMSYL MSMKVPLSIM GVILPVYTII DMFETAINVW SDVCITQIVH
KEYDALIKKG KKIGINDQ