Gene Caul_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3041 
SymbolhppA 
ID5900496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3304562 
End bp3306700 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content67% 
IMG OID641563543 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001684666 
Protein GI167647003 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.108688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTT GGCTATATGC GGCCATTGGG GCCGGGCTGC TGGCGGTGCT TTACGGCGCC 
GTGCAGACGG CGTCTTTGCT ACGGGCCTCG CCCGGCAACG CGAAGATGCA GGAGATCGCC
GCGGCGATCC AGGAAGGGGC CCAGGCCTAT CTGAACCGCC AGTACACGAC GATTGGGATC
GTCGGCGTGG TGGTCATCGC CCTGCTGGGC TTCTTCTTCA AGTCGTGGGA ACAGCCGGTC
GGCTTCGCCC TGGGCGCGAT CCTCTCGGGC GCGGCCGGGT TCGCGGGCAT GCTGATCTCC
GTCCGGGCCA ATGTGCGCAC GGCCCAGGCC TCGTCCGAGA GCCTGGCGGC CGGCCTGAAG
ATGGCCTTCA CCTCCGGCGC CGTGACCGGC ATGCTGGTGG CGGGCTTCGC CCTGATCGGC
GTGGCGGGCT ACTTCGCCCT GCTGCTGAAC ACGGGCCATG CGGGCACCGA CCGGGTGGTG
GTCGATTCGC TGGTGTCGCT GGGCTTCGGC GCCTCGCTGA TCTCGATCTT CGCCCGCCTG
GGCGGGGGCA TCTTCACCAA GGGCGCCGAC GTCGGCGGCG ATCTGGTGGG CAAGGTCGAG
GCCGGCATTC CGGAAGATGA CCCGCGCAAC GCCGCGACCA TCGCCGACAA CGTGGGCGAC
AATGTCGGCG ACTGCGCCGG CATGGCCGCC GACCTGTTCG AGACCTATGC GGTGACCACG
GTCGCCACCA TGGTCCTGGC CGCTATCTTC TTCCGCGGCA CGGACGCCGT GTCGTCGATG
ATGCTGCTGC CGCTGGCCAT CTGCGCCGTC TGCATCGTCA CCTCGATCAT CGGCTCGTTC
GCCGTGCGGC TGGGCAAGAA GCAGAACATC ATGGGCGCCC TCTATCAGGG CCTGATCGTC
ACCGGCGTGC TGTCGATCCC GGCGGTCTAT TGTGTGATCC ACCAGTTGGT GCCGACCGCC
GTCACCGTTG GCGACCGGAC CTTCGACGCC AACGCGCTGT TCTTCTGTGG CCTGGCCGGC
CTGGCGGTCA CCGCGGCCAT CGTGGTGATC ACCGAGTACT ACACCGGCAC CAACTTCCGC
CCAGTGAAGT CGGTGGCCCA GGCCAGCGTC TCGGGTCACG GCACCAACGT GATCCAGGGC
CTGGCCATGT CGCTGGAATC CACGGCCCTG CCGGCCCTGA CCATCATCGT CGGCATCCTG
GTGACCTATA ATCTGGCGGG CCTGTTCGGC ATCGCCATCG CCACCACCAC CATGCTGTCC
CTGGCCGGCT TCATTGTCGC CCTAGACGCC TTCGGCCCCG TCACCGACAA CGCCGGCGGC
ATCGCGGAAA TGGCCGGCCT GCCGCCGGAA GTCCGCGTCA CCACCGACGC CCTGGACGCC
GTGGGCAACA CCACCAAGGC CGTCACCAAG GGCTACGCCA TCGGCTCGGC CGGCCTGGGC
GCCCTGGTGC TGTTCGCCGC CTATACCGAG GACCTGAAGT TCTTCTCGGC CCACGCCGAG
GCGGGCAGCT TCTTCGACGG CATGGGTCCG GTGACCTTCG ACCTGTCCAA CCCCTATGTC
GTGGTCGGGC TGCTGTTCGG GGGGCTGCTG CCCTTCCTGT TCGGCGGCAT GTCGATGACC
GCCGTGGGCC GGGCCGCCGA GGCCGTGGTG GCCGAGGTGC GTCGCCAGTT CCGCGAGAAC
CCGGGCATCA TGACCTATGA GGTCAAGCCC GAATACGGCA AGGCGGTCGA CATCCTGACC
AAGGCCGCGA TCCGCGAGAT GATCGTGCCC AGCCTGCTGC CGGTGGTTTC GCCGGTGGTG
CTGTTCTTCG TGATCAAGGC CATTGCCGGC AAGGTCGACG CGTTCGCCTC GCTCGGCGCC
ATGCTGATGG GCGTGATCGT CACCGGCCTG TTCGTCGCCA TCTCGATGAC CAGCGGCGGC
GGCGCCTGGG ACAACGCCAA GAAGGTGATC GAGGAAGGCT TCACCGACAA GGACGGCGTC
CTGCACTCCA AGGGCGGCGA CACCCACAAG GCCGCCGTCA CCGGCGACAC CGTCGGCGAC
CCCTACAAGG ACACCTCGGG TCCCGCCGTG AACCCGATGA TCAAGATCAC CAACATCGTG
GCCCTGCTGC TTCTGGCGGT TCTGGCCCAC GGCGTCTGA
 
Protein sequence
MSSWLYAAIG AGLLAVLYGA VQTASLLRAS PGNAKMQEIA AAIQEGAQAY LNRQYTTIGI 
VGVVVIALLG FFFKSWEQPV GFALGAILSG AAGFAGMLIS VRANVRTAQA SSESLAAGLK
MAFTSGAVTG MLVAGFALIG VAGYFALLLN TGHAGTDRVV VDSLVSLGFG ASLISIFARL
GGGIFTKGAD VGGDLVGKVE AGIPEDDPRN AATIADNVGD NVGDCAGMAA DLFETYAVTT
VATMVLAAIF FRGTDAVSSM MLLPLAICAV CIVTSIIGSF AVRLGKKQNI MGALYQGLIV
TGVLSIPAVY CVIHQLVPTA VTVGDRTFDA NALFFCGLAG LAVTAAIVVI TEYYTGTNFR
PVKSVAQASV SGHGTNVIQG LAMSLESTAL PALTIIVGIL VTYNLAGLFG IAIATTTMLS
LAGFIVALDA FGPVTDNAGG IAEMAGLPPE VRVTTDALDA VGNTTKAVTK GYAIGSAGLG
ALVLFAAYTE DLKFFSAHAE AGSFFDGMGP VTFDLSNPYV VVGLLFGGLL PFLFGGMSMT
AVGRAAEAVV AEVRRQFREN PGIMTYEVKP EYGKAVDILT KAAIREMIVP SLLPVVSPVV
LFFVIKAIAG KVDAFASLGA MLMGVIVTGL FVAISMTSGG GAWDNAKKVI EEGFTDKDGV
LHSKGGDTHK AAVTGDTVGD PYKDTSGPAV NPMIKITNIV ALLLLAVLAH GV