Gene ECH_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0438 
Symbol 
ID3927239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp417083 
End bp418387 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content30% 
IMG OID637901562 
Productsodium:alanine symporter family protein 
Protein accessionYP_507256 
Protein GI88658052 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127152 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAT TAAATTTAGT ATTATCTTTT CCAGCAGTTT TTCTGTTGCT GTTTACTGGT 
ATTTACTTAT CAATAAAATC TAAATTCTTA CAAATTACAA AACTTCCAAG TGCGATTTCA
TTAATCACAA TGAAGAAGTA TCGTGACAAG TCTTTTTCTA TTGGTGCATT ATGCACAATA
ATAGGGGGGA ATTTAGGTGT TGGTAATATT TCAGGCACAG CAGTTGCATT AAAATCAGGA
GGCCCAGGAT TTGCTTTATG GATGATAATC ATAGTAACTC TCTGTTCAAT TATTAAATAT
GTAACTTGTT ATCTAAGTAT AGAGACACGG GTTAAAGTCA ATAAACAATA TATTGGAGGG
CCCGCTATAT ATTTTAAAAA TGCATTTAAG TCAAAAAAAG CTGTCATAGT TTTCACCATA
CTCATGCTCA TATGTTCTAT TGCAATAGGT AATTTTGTTC AAGTTAACTC TCTATCAATA
CCAATGGAAT TAATTAACAA ACCACCAATA ATAGCAGGAT TATTCATGTG TGCAATGTTC
TTTGCCGTTA CAATATTAAG GTTGGAAATT ATTACAGCAT TAATTTCAAG ACTTGTACCA
GGAATGGCTA TAGCATATAT AATACTAGCA TCTTTTGTAC TTTATAAATT TAATGACAAT
ATTATACCTT CTATAAAGTT AATGTTTGCT AATTTCCTAA CTTTTGATAG CTTTAAATCA
GGAATGATAG GGGCATTCAT ACTAGAAACA TTTCATATAA TTCAAGTAGG CACTTTTAGA
GGTATATTTG CAACAGATAT AGGATTAGGG TTAGAGGGTA TGGTTCATTC CTCTATCAAT
AGTAATAGTA AAAACTTCAA TACACATCAA AGTATGATTT CTTTAATTTC ACCATTTATA
GTAGTGATTG TCACATTAGT TACAACCCTA GTTATACTGG TAACTAATGC ATGGTCTGGC
CCTCTTGAAA GTACAAACAT GTGTATTGCT GCATTTAAAT CTGCATTCGA TTCAGAATAC
ATAAACTATG TTATCATACT AATCATGTTT TGCTTCTCAT TCACAACTGT TTTCACATGG
TTTGTTTGTG CTAAAAGCAC TTTATATTGT CTTACAGATG GAAAAGATAG TTTACTTATC
AAAATGTGGA AAATTTTATA CACTATAATA ATACCAATAG GAGCATTAGG AAAAGTGCAG
CTTCTATGGG ACATAGCTGA CATTTCCATA TCACTAATAT TAATTAGTAA CACTATTGCT
ATACTAATAT TACTACCAAA ATACAAAAAC ATATTTAAAG TGTAG
 
Protein sequence
MSILNLVLSF PAVFLLLFTG IYLSIKSKFL QITKLPSAIS LITMKKYRDK SFSIGALCTI 
IGGNLGVGNI SGTAVALKSG GPGFALWMII IVTLCSIIKY VTCYLSIETR VKVNKQYIGG
PAIYFKNAFK SKKAVIVFTI LMLICSIAIG NFVQVNSLSI PMELINKPPI IAGLFMCAMF
FAVTILRLEI ITALISRLVP GMAIAYIILA SFVLYKFNDN IIPSIKLMFA NFLTFDSFKS
GMIGAFILET FHIIQVGTFR GIFATDIGLG LEGMVHSSIN SNSKNFNTHQ SMISLISPFI
VVIVTLVTTL VILVTNAWSG PLESTNMCIA AFKSAFDSEY INYVIILIMF CFSFTTVFTW
FVCAKSTLYC LTDGKDSLLI KMWKILYTII IPIGALGKVQ LLWDIADISI SLILISNTIA
ILILLPKYKN IFKV