Gene Sare_4411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4411 
SymbolhppA 
ID5705513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4985862 
End bp4988210 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content70% 
IMG OID641273830 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001539179 
Protein GI159039926 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.917351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0418433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA CCTTGGCCGC CGAGGGCGGC GGGATCTCCC TCACCGGGAA CAATGTCACG 
TACGTGGTTC TCGCCGCCGT GATCGCGGCC GTGGCGCTGG CGTTCGCCGC CGCGCTCACG
CGGACCGTGC TGGCAGCCGG CAGGGGCACC ACCAACATGC AGGAAATCTC GGGGGCGGTG
CAGGAGGGTG CCTCGGCGTA CCTGTTCCGC CAGTTCCGTA CCCTGGCGAT CTTCGTGGTC
GTCGCCGTGC TGCTGCTCTT CCTGCTGCCG GTGCACGACA CCGATGGCAG CGAGACCGCG
GTGAAGATCG GCCGATCCGC CTTCTTCGTC GTCGGCGCAC TGTTCAGCGC GTTCATCGGG
GGCGCCGGCA TGTGGCTGGC CACCCGGGCC AACCTGCGGG TCGCCGCCGC CGCTCGGGAG
CGTGCGGGTG GGCGGGAAGC AGCCATGCGG ATCGCGTTCC GCACCGGCGG TGTGGTCGGA
TTCCTGACCG TCGGCCTCGG TCTTCTCGGC GCCGCGCTGG TGGTTCTCTT CTACCGCGGT
GACGCCCCGA CGGTGCTCGA GGGCTTCGGC TTCGGCGCCG CGCTGCTCGC CATGTTCATG
CGGGTCGGTG GTGGCATCTT CACCAAGGCC GCCGACGTCG GCGCCGACCT GGTCGGCAAG
GTCGAGCAGG GCATTCCCGA GGATGATCCG CGCAACGCCG CCACCATCGC CGACAACGTG
GGTGACAACG TGGGCGACTG CGCCGGTATG GCCGCCGACC TGTTCGAGTC GTACGCGGTC
ACCCTGGTCG CCGCGCTGAT TCTCGGCCGT GCCGCCTTCG GCGAGGAGGG TCTGGTCTTC
CCGCTGATCG TCTCCGGCAT CGGCGCGATC GTCGCGATCA TCGGAGTCTT CATCACCCGG
CTGCGCGCCT CGGATCGTTC CGGCCTCACC GCCATCAACC GGGCCTTCTA CGCCTCCGCG
CTCATCTCCG CGGTGCTGGT GGCCATCGCG ACCTGGGCAT ACCTGCCGGC GACCTTCGGC
GAGCTGGCGG GGGGACTCAC CGGCGTCAAC GAGAACCCGC GCGTGGTGGC CCTCGGCGCG
GTCGTGATCG GTATCGTGCT GGCCGCCGCT ATCCAGGCGT TGACCGGCTA CTTCACCGAG
ACCAACCGTC GCCCGGTGCA GGACATCGGC CGCAGCTCGC AGACCGGCCC GGCCACCGTC
ATCCTCGCCG GCATCGGTGT CGGTCTGGAG TCCGCGGTCT ACTCGGCACT GCTGATCGGG
GCCGGAGTCT TCGGCGCGTT CCTGCTCGGC GGCAGCTCCA TCACCCTGTC GCTGTTCGCG
GTCGCGCTGG CGGGTACCGG TCTGCTCACC ACGGTCGGCG TCATCGTCGC GATGGACACC
TTCGGCCCGA TCTCCGACAA CGCCCAGGGC GTGGCGGAGA TGTCCGGCGA CATCGAGGCG
GACGGTGCCC GCACGTTGAC CGAGCTGGAC GCGGTCGGCA ACACCACCAA GGCGATCACC
AAGGGCATCG CGATCGCCAC CGCTGTGCTG GCCGCGACGG CGCTGTTCGG CTCGTACACC
GACACGCTGC GCACCGCGTA CGCGGACGCG GGCATCGCCG ACGTCGGCGG CGAGATCCTC
AACTCGCTGA ACGTGGCGAA CCCGCGGAAC CTGGTGGGTC TCATCATCGG CGCCGCGGTG
GTCTTCCTCT TCTCCGGGCT GGCCATCAAC GCGGTGTCCC GCTCGGCGGG GGCGGTCGTC
ATGGAGGTCC GCCGGCAGTT CCGCGAGCTG CCCGGGATCA TGGATCGTAC CCAGCGCCCC
GAGTACGGCA AGGTCGTCGA CATCTGCACC CGGGACGCGC AGCGTGAGCT GATGACCCCT
GGCCTGCTGG CGATCCTGGC GCCGATCGCG GTCGGCTTCG GGCTCGGGCC GGGGGCGCTC
GCGTCGTACC TGGCCGGGGC GATCGGTGCC GGCACACTGA TGGCCGTCTT CCTGGCCAAC
TCCGGCGGCG CCTGGGACAA CGGCAAGAAG ATGGTCGAGG ATGGCGCGTT CGGTGGTAAG
GGGTCCGAGG CGCACGCCGC GACCGTCATC GGCGACACCG TCGGCGACCC GTTCAAGGAC
ACCGCCGGTC CGGCGATCAA CCCGTTGATC AAGGTGATGA ACCTGGTTTC GTTGCTGATC
GCGCCGGCGG TGGTGGCCTG GAGCGTGGGT GACGACCGCA ACACCGGGCT GCGGATCTCG
ATCGCGGTGG TGGCGACGCT GATCATCGTC GCGTCCGTCG TGTTCAGCAA GCGTAAGGGC
GTGGCGATGT CCGACCCCGG TACGGGCACG GGCAGCGCCG ACCAACACCC GCAGGAGGTT
CGTGCCTGA
 
Protein sequence
MSDTLAAEGG GISLTGNNVT YVVLAAVIAA VALAFAAALT RTVLAAGRGT TNMQEISGAV 
QEGASAYLFR QFRTLAIFVV VAVLLLFLLP VHDTDGSETA VKIGRSAFFV VGALFSAFIG
GAGMWLATRA NLRVAAAARE RAGGREAAMR IAFRTGGVVG FLTVGLGLLG AALVVLFYRG
DAPTVLEGFG FGAALLAMFM RVGGGIFTKA ADVGADLVGK VEQGIPEDDP RNAATIADNV
GDNVGDCAGM AADLFESYAV TLVAALILGR AAFGEEGLVF PLIVSGIGAI VAIIGVFITR
LRASDRSGLT AINRAFYASA LISAVLVAIA TWAYLPATFG ELAGGLTGVN ENPRVVALGA
VVIGIVLAAA IQALTGYFTE TNRRPVQDIG RSSQTGPATV ILAGIGVGLE SAVYSALLIG
AGVFGAFLLG GSSITLSLFA VALAGTGLLT TVGVIVAMDT FGPISDNAQG VAEMSGDIEA
DGARTLTELD AVGNTTKAIT KGIAIATAVL AATALFGSYT DTLRTAYADA GIADVGGEIL
NSLNVANPRN LVGLIIGAAV VFLFSGLAIN AVSRSAGAVV MEVRRQFREL PGIMDRTQRP
EYGKVVDICT RDAQRELMTP GLLAILAPIA VGFGLGPGAL ASYLAGAIGA GTLMAVFLAN
SGGAWDNGKK MVEDGAFGGK GSEAHAATVI GDTVGDPFKD TAGPAINPLI KVMNLVSLLI
APAVVAWSVG DDRNTGLRIS IAVVATLIIV ASVVFSKRKG VAMSDPGTGT GSADQHPQEV
RA