Gene Rleg_5944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5944 
SymbolnhaA 
ID8016364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp488221 
End bp489411 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content61% 
IMG OID644828057 
ProductpH-dependent sodium/proton antiporter 
Protein accessionYP_002979257 
Protein GI241518629 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3004] Na+/H+ antiporter 
TIGRFAM ID[TIGR00773] Na+/H+ antiporter NhaA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.195827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTGC CGATCGCGAC TGGCCGCATT CAATCCACTC TCCGCAGATT TCTTGACAGC 
GAGGCCTCCG GCGGCATCGT TTTGATGGCT GCCGCGGCAA TGGCCCTTGC GGTCGCAAAC
TCGCCGCTGG CGGGCGCATA TTTCCATGCC CTCCATCTTT ACCTCGGGCC ACTGAGCTTG
CAGCACTGGA TCAATGACGC CCTGATGGCC GTGTTCTTCC TGCTCGTCGG TCTGGAGATC
AAGCGCGAAA TGCTCGATGG CCAGCTCTCG ACCTGGAGCA GGCGAATTCT TCCCGGTGCC
GCCGCGGCCG GAGGAATGCT CGCCCCAGCC CTCGTCTATC TGGCTTTCAA TGCCGGGACG
CCAGCTAGCC TCCGCGGTTG GGCGATCCCG ACGGCCACAG ACATTGCCTT CGCGCTCGGA
GTGCTGTCGC TCTTTGGCAA CCGGGTTCCG GCGTCCCTGA AAATCTTTCT GGCGGCGCTT
GCCATTATCG ACGACCTCGG CGCCGTCCTC GTCATCGCAC TCTTCTATAC AAACGGTCTC
AACCTCCTGG CCCTTGCCGG AGCCGCCGCA GTTCTTGCCC TGCTGTTTTT CATGAACCGA
GCCGGCGTGA AGACGCTGAC GCTTTATCTC GGCCTTGGCG TCGCCTTGTG GGTGCTGATG
TTCACCTCCG GGATCCACGC CACGCTTGCG GGCGTGCTGC TGGCCTTGAC GATACCGATT
AAGCTTTCTC CCGGCGCTCC GGAGGCTAGC GACGAAGAAT CTCCGCTGCA CCGGCTCGAA
CACCTGCTTC ATCGACCAGT GGCATTCATC ATCGTCCCAC TCTTTGGCTT GGCCAACGCC
GGCGTTTCGC TTCGCGGCAC GTCCATCTCG AGTTTGGGAG ACCCTCATAC AATTGGCGTC
GCCGCCGGAC TGTTTGCTGG CAAGTTGCTT GGGGTTCTCT CCGTCGTCGG CCTCCTGGTG
AAGCTACGTT TTGCGCAGCT TCCAGCAATG GCGAACTGGA CGCAGATGAC CGGTGTCGCG
CTTCTTTGCG GCATCGGCTT CACGATGAGT CTCTTTATCG GTCTTCTCGC TTTCGATGAC
CCAGCCGTAC AGGACAAGGT CAAGATCGGC ATTCTGCTCG GTTCGGCGAT CTCTGGCGTG
GCCGGATCCG CGGTTCTGAT GGCGAGCCGG CGGAAAAGCA GCCGGTCGTA G
 
Protein sequence
MSLPIATGRI QSTLRRFLDS EASGGIVLMA AAAMALAVAN SPLAGAYFHA LHLYLGPLSL 
QHWINDALMA VFFLLVGLEI KREMLDGQLS TWSRRILPGA AAAGGMLAPA LVYLAFNAGT
PASLRGWAIP TATDIAFALG VLSLFGNRVP ASLKIFLAAL AIIDDLGAVL VIALFYTNGL
NLLALAGAAA VLALLFFMNR AGVKTLTLYL GLGVALWVLM FTSGIHATLA GVLLALTIPI
KLSPGAPEAS DEESPLHRLE HLLHRPVAFI IVPLFGLANA GVSLRGTSIS SLGDPHTIGV
AAGLFAGKLL GVLSVVGLLV KLRFAQLPAM ANWTQMTGVA LLCGIGFTMS LFIGLLAFDD
PAVQDKVKIG ILLGSAISGV AGSAVLMASR RKSSRS