Gene Rcas_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3559 
SymbolhppA 
ID5541060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4646434 
End bp4648773 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content61% 
IMG OID640895678 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001433626 
Protein GI156743497 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0306706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.266916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAC TCAGCGCTTT CGAACAGGCA GCCGTATGGG CCGTGCTCGG CGTTGCGATT 
TTCGGCATCG CATATGCCTT CGTCCTTCGG GCGCAGATTC TGGCGCAGGA CAAAGGCACC
GCCCGCATGC AAGAGGTCTG GGGATTCATC AAAGACGGCG CGAATGCCTA CCTCAGTCGT
CAGTTCCGCA CCATCGCCAT TCTGATCGTC GTTCTGACAT TTCTCCTCGC CGCCAGTGTG
TTCATCATTC CGCCGACGCG CGAAGCGGTC GAACACTTCG GCAGTGAGGA AGCAGCGACT
CTGTGGGTAG CACTCGGGCG TGCCGTCGCC TTCTTGATGG GGTCGTTGTT CTCGTATGCG
GTCGGCTTCG TCGGCATGAA TGTGGCAGTC GAAGGGAATG TGCGTGTGGC TGCCGCATCG
CGCAAAGGGT ACAACCCGGC GTTGCAAGTC GCCTACAAGT CGGGGTCGGT GACTGGCATG
CTGACAGTGG GGCTTGGACT GCTGGGTGGG ACGTTGATCT TTATGGTGTT TGGCATCGCT
GCCCCGGACG CGCTCCTCGG CTTTGGCTTT GGCGGCTCGC TGATCGCGCT CTTCATGCGC
GTCGGCGGCG GTATCTACAC CAAGGCAGCC GACGTTGGCG CCGACCTGGT GGGAAAGGTG
GAAGCCGGCA TCCCTGAAGA CGACCCGCGC AATGCGGCGG TGATCGCCGA CCTGGTGGGC
GACAATGTCG GCGACTGCGC CGGTATGGCG GCAGACGTGT TCGAAAGCTT CGAGGTAACG
CTGGTATCGG CGCTCATCCT GGGACTGGTG CTGGGCGATG CCGTCGTCGG CACGCTGGGC
GATGGTCAGT ACGACCTGCG CTTCATTGTC TTCCCGCTCG TGTTGCGCGC AATCGGCGTG
ATTGCATCGG TCATCGGCAA CTCTATCGTC AGCACCGACG AGAAGCGCCG CAATGCGATG
GCAGCCATGA ACCGCGGCTT TTATGTCGCT GCGCTCGTCT GCTTCATTGG ATTTGCGGGG
TTCACGGCAG TCTATATGGT CGATCCGACA ACCGGCGCAA TCGACTGGCG CCCCTTCCTG
GCGACGATCG CCGGTCTGGT GCTGGCAGTG GCGCTCGATA AACTGACGGA GTATTTCACA
TCGACACACT TCAACCCGGT GAAGGAAACC AGTAAAGCAT CGAAGACTGG CGCAGCCACC
AATATTCTTT CCGGTCTGGC GCTTGGCATG GAGTCGAGCG TGTGGGCGAT CCTGGTGATC
TGTGCGTCGA TCCTGACGTC GATTGCCATC TACAGCGGGT ACTCGACCGA TCCCACGGTA
ACGCTCACCG CTGTGCTGTA CGGCGTGTCG CTCACCGGCA TCGGGATGCT GACGCTCACC
GGCAACACGA TCTCGATGGA CTCGTTCGGC CCGATTTCAG ACAATGCCAA CGGCATCGGT
GAGATGGCAG GGCTGGATAA GAATGCACGC AATGTCATGG ACGATCTGGA CGCGGTCGGC
AACACGACAA AGGCGGTCAC CAAAGGGATC GCCATCGGTT CCGCCGTGAT TGCAGCCGTT
GCGCTCTACG GCTCGTACCT GGCGGATGTG TCGAAGGTGC AGGAGCAGAT CGGCGTGCCG
CTGGCAGAGC AGCTGCGCAC CATCGGCATC AATGTCGCTA TGCCGACCGT GTTTATTGGG
TTGTTGATCG GTGGCGCTGT GCCGTTCTTG TTCTCGTCGC TGACGATCCG CGCAGTGCAG
CGCGCGGCGT CGCAGATCGT GAACGAAGTG CGCCGTCAGT TCAAGATTCC AGGAGTCATG
GAAGGCACGG TGACGCCTGA TTATGCACAG GCCGTCAGCA TCTCCACCGT AGCAGCGCAG
AAGGAACTGA TCAGCTTGGG CTTGATCGCG GTGATGGTGC CGATCCTGGT CGGCTTCCTA
CTCGGTGTCG AGGCGCTCGG CGGATTCCTG GCGGGCATCA TCCTCTCCGG TCAGTTGATG
GCGGTCTTTC AGGCGAACGC TGGCGGCGCC TGGGATAATG CGAAGAAGTA CATCGAAGAA
GGGAACTTCG GCGGCAAGCA CTCAGAACCG CACAAAGCAG CCGTGGTGGG CGACACGGTC
GGCGATCCCC TGAAGGACAC GGCGGGACCG GCGTTGAACC CCATGATCAA GGTGATCAAC
CTGGTGGCGC TGATCATCGC CCCGATTGTG GTGACGATCC CAAGCGGCAG CCCCGGCGTC
ATTTTTGCTA TGGTCCTCTG CGCTGCGGCG CTGGTCTGGG CAATCTGGCA GAGCAAGCGC
GAAGCGCCGT CGATGACGAC CGAGACTCCC GCACCCGCAA CGACGGCGAA AGGGGTGTGA
 
Protein sequence
MQELSAFEQA AVWAVLGVAI FGIAYAFVLR AQILAQDKGT ARMQEVWGFI KDGANAYLSR 
QFRTIAILIV VLTFLLAASV FIIPPTREAV EHFGSEEAAT LWVALGRAVA FLMGSLFSYA
VGFVGMNVAV EGNVRVAAAS RKGYNPALQV AYKSGSVTGM LTVGLGLLGG TLIFMVFGIA
APDALLGFGF GGSLIALFMR VGGGIYTKAA DVGADLVGKV EAGIPEDDPR NAAVIADLVG
DNVGDCAGMA ADVFESFEVT LVSALILGLV LGDAVVGTLG DGQYDLRFIV FPLVLRAIGV
IASVIGNSIV STDEKRRNAM AAMNRGFYVA ALVCFIGFAG FTAVYMVDPT TGAIDWRPFL
ATIAGLVLAV ALDKLTEYFT STHFNPVKET SKASKTGAAT NILSGLALGM ESSVWAILVI
CASILTSIAI YSGYSTDPTV TLTAVLYGVS LTGIGMLTLT GNTISMDSFG PISDNANGIG
EMAGLDKNAR NVMDDLDAVG NTTKAVTKGI AIGSAVIAAV ALYGSYLADV SKVQEQIGVP
LAEQLRTIGI NVAMPTVFIG LLIGGAVPFL FSSLTIRAVQ RAASQIVNEV RRQFKIPGVM
EGTVTPDYAQ AVSISTVAAQ KELISLGLIA VMVPILVGFL LGVEALGGFL AGIILSGQLM
AVFQANAGGA WDNAKKYIEE GNFGGKHSEP HKAAVVGDTV GDPLKDTAGP ALNPMIKVIN
LVALIIAPIV VTIPSGSPGV IFAMVLCAAA LVWAIWQSKR EAPSMTTETP APATTAKGV