Gene ECH_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0910 
Symbol 
ID3926966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp935108 
End bp936247 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content30% 
IMG OID637902027 
Producthypothetical protein 
Protein accessionYP_507702 
Protein GI88657814 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGTGC ATAGACTAAT GAGCGATCAA TTTTGCAAGC TAATTAATAG GTATGTAACA 
CGTAGTGTTA TTGTACTAGC AATTGTTTTG GTGATGTTTG TTATTAAACC TGTACTTGCT
CCGTGTTGTA CTGCTATGAT AATGGCTTAT CTTCTTAATC CATTGGTGGA TAAGTTACAA
AGATTTAAAC TATCAAGGCA ACTATCTGTT GCTATAATTT TGCTATCTTC GTTGTGTGTA
ATTATAGCAT TTTTGGTCAG TTTTATTCCC CTTGCTTATT CTCAGTTGTT ATCACTTATA
AAGTTTCTTA TGGAAAAAGT GCCATTGATT CATAAAGACA GTATCATTTC TCTCCTTCAG
AAATATAATA TGATTGATTA TGAAGAAGTA TCGGATGCAA TAAAGTTGCC GCAAGCGTCT
TTAAAAAGTT TGTTGCATTA CGAAAACATT AAACCTCTTG TAAGTATTTT TGGAAATTTT
CTAAAAAATT TAGATGGTAT ATTGTTTAGT GCAATAAATT CTAGTATAAG CATTAGTTAT
ACAATTTCTA TAATATTAAT AACTCCTCTA TTATTATTTT ATATATTATG TAACTGGCCA
TCGATTGTTG AATCTGCTGA TGCACTAGTT CCTGTAAAAT ATCAAAGTAT TGCTAGATTA
TATACAAAAA AAATAGACCA AGTAATTTCA GCTTATATTA GAGGCCAATT AAGTGTATGT
TTTATCATGG CTGTATACTA CATTATATGT TTTAGTTTGG TGAAGTTAAA GTATTTTTTA
ATTATAGGTT TTGTATCGGG AATCATGACT TTTATTCCAT ATATAGGACC TATTTCATGT
GCAATATTGA GTTCCATTAC AACAATGTTA CAGTTTAATG ATTGGACGAT GTGTGGAGTG
GTAGTGACAA TGTTCATTGT TGGACAGTTA GTTGAGTCGA ATATTATTAC TCCATTATTA
ATAGGAAAAC GGGTAGATAT ACATCCTATA TGGATAATTA TTGGAATGAT AACATGTGGA
TCACAAATTG GATTTACAGG GGTATTATTG TCAATTCCTA TAACAGCAAT AGTTGGTGTA
TTTGTAAGAG CACTTATAGC CCACTATATG GGTAGTAAAT TTTATAATAA TGCTGATTGA
 
Protein sequence
MHVHRLMSDQ FCKLINRYVT RSVIVLAIVL VMFVIKPVLA PCCTAMIMAY LLNPLVDKLQ 
RFKLSRQLSV AIILLSSLCV IIAFLVSFIP LAYSQLLSLI KFLMEKVPLI HKDSIISLLQ
KYNMIDYEEV SDAIKLPQAS LKSLLHYENI KPLVSIFGNF LKNLDGILFS AINSSISISY
TISIILITPL LLFYILCNWP SIVESADALV PVKYQSIARL YTKKIDQVIS AYIRGQLSVC
FIMAVYYIIC FSLVKLKYFL IIGFVSGIMT FIPYIGPISC AILSSITTML QFNDWTMCGV
VVTMFIVGQL VESNIITPLL IGKRVDIHPI WIIIGMITCG SQIGFTGVLL SIPITAIVGV
FVRALIAHYM GSKFYNNAD