Gene Franean1_0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0310 
SymbolhppA 
ID5668734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp370571 
End bp372931 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content70% 
IMG OID641239241 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001504682 
Protein GI158312174 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.588911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.319436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGA TCCACACCCT ACAGGCCGAG GACCCCCCTA TCGACCTTAG CGGTGGAGCT 
GTCGGATTGG TCGCCGGTGT GGCGATCGTC GCAGCACTCG CCCTCCTCGT CGCCGCTTAC
CTGGTGCGCG AGGTACTAGC GGCGAACGCG GGCACTCCGA AAATGATCGA GGTGGGCAGA
GCGGTTCAGG AGGGAGCCGC AGCCTACCTC CGCCGGCAGT TCCGGACCCT CGCCGGGTTC
GTCATCGTGA TCCCGTTCGT CCTGTTGCTC CTACCGGCGG ATGACACGGG CGCTCGTGTG
GGCCGTTCCG TGTTCTTCGT CGTGGGCGCC GTCTTCTCCG CGCTGGTCGG CTTCGTCGGC
ATGTCGCTCG CGACCCGGGC CAACACCCGC ACCGCGGCGG CGGCGATCGA GAGCGGTGAA
CGTGCCGCGA TGCGGATCGC GTTCCGCACC GGCGGCGTCG TCGGGATGTT CACCGTCGGA
CTCGGCCTCC TCGGGGCGGC GGTGGTGGTC CTGGCCTTCC GGGACACCGC CCCGCAGGTG
CTCGAGGGGT TCGGGTTCGG CGCCGCCCTT CTCGCGATGT TCATGCGAGT CGGCGGCGGG
ATCTTCACCA AGGCGGCGGA TGTCGGAGCG GACCTGGTCG GCAAGGTCGA GCAGGGCATT
CCCGAGGACG ACCCGCGCAA TGCGGCGACG ATCGCCGACA ACGTCGGCGA CAACGTCGGC
GACTGCGCCG GGATGGCCGC CGACCTGTTC GAGTCATACG CGGTGACCCT GGTCGCGGCG
CTGATTCTCG GGGTCAAGGC GTTCGGCGAG AACGGGCTGG TCTATCCGCT GCTGATCCCG
GCGGTCGGCG TGCTCACGGC GATCATCGGG ATCTTCGCCG TCTCCCCCAG AGACGGCGAC
CGCACCGGCA TGACCGCGAT CAACCGGGGC TTCTTCATCT CCGCCGTGGT CTCCGCGATC
GGCGTGGTGA TCGTCTCCAT GGTCTACCTG CCGGGCAGTT TCGCCGAGTT CCCGGGGCTG
GAAGGCAGCA CCCAGAGCGG CGATCCGCGG GTGATCGCGA TCAGCGCGGT GCTCATCGGC
ATCGTGCTCG CCGCCGCCAT CCAGCTGCTG ACCGGCTACT TCACCGAGAC GAACCGCCGC
CCGGTCCTTG GCGTCGCCGA GGCGTCGACG ACCGGGCCGG CGACGAACAT CCTGGCCGGA
ATCGGGGTAG GTCTCGAGTC GGCCGTGTAC TCCGCGTTGC TCATAGGCGC GGCTGTGTTC
GGCGCGTATC TCCTCGGCTC GGGCAGCGTC ACGATCGCCC TGTTCGCGGT GGCGCTTGCC
GGGACCGGCC TGCTCACCAC GGTCGGTGTC ATCGTCTCGA TGGACACGTT CGGCCCGGTC
AGCGACAACG CCCAGGGCAT CGCCGAGATG TCGGCCGGGC CGGAGGGCAT CGACGAACGC
GCCGGGGCGA TCCTGACCTC GCTGGACGCG GTGGGCAACA CCACCAAGGC GATCACCAAG
GGCATCGCGA TCGCCACGGC GGTGCTCGCC GCGACCGCGT TGTTCGGCTC GTTCACCGAC
ACGGTCACGA CCGCGCTCGC CGATGCGGGG GCGTCCACGG AGGCCGCCCG GGGCACCGTT
GGTGGGCTGA ACATCGCCTA TCCGGACGCC CTGGTCGGCC TCATCATCGG CGCCGCCGTA
GTCTTCCTGT TCTCCGGTTT GGCGATCAAC GCGGTCGGCC GCGCCGCCGG CCGGGTCGTG
ATGGAGGTGC GCAACCAGTT CCGGACCAAG CCGGGAATCA TGGAGGGCAC AGAGAAGCCC
GACTACGGCG CCGTTGTGGA CATCTGTACC AGGGACTCGC TGCGCGAGCT GGTGACGCCA
GGAACACTCG CCGTGATGGC CCCGATCGCG GTCGGGTTCG GCCTCGGCTA CCCGCCGCTG
GGCGCGTTCC TCGGCGGGGC GATCGCCGCC GGGGTGCTGA TGGCGGTCTT CCTCGCGAAC
TCGGGCGGCG CCTGGGACAA CGCGAAGAAA CTCGTGGAGG ACGGCAACTA CGGCGGCAAG
GGCTCCGACG TGCACGCCGC GACGGTGATC GGGGACACCG TCGGCGACCC GTTCAAGGAC
ACCGCCGGCC CGTCGATCAA CCCACTGCTC AAGGTGATGA ACCTCGTCAG CCTGCTGATC
GCTCCGCTGG TCGTGAAGTT CTCGGTGGGC GAGGACGAGA ACACGGCCGC CCGCATCGGC
ATCGCCCTCG CCGCCGTCGT GGTGATCGTC GCGGTCGTCG CGACGTCCAG GCGGCGGGGC
TCGTCGGTGG GGGACGGGCC ACCGACGCCC TCGGCACCGC CGACGCGCAC CCCGCCGTCG
GGCTCGGAGG TGAAGGTCTG A
 
Protein sequence
MSPIHTLQAE DPPIDLSGGA VGLVAGVAIV AALALLVAAY LVREVLAANA GTPKMIEVGR 
AVQEGAAAYL RRQFRTLAGF VIVIPFVLLL LPADDTGARV GRSVFFVVGA VFSALVGFVG
MSLATRANTR TAAAAIESGE RAAMRIAFRT GGVVGMFTVG LGLLGAAVVV LAFRDTAPQV
LEGFGFGAAL LAMFMRVGGG IFTKAADVGA DLVGKVEQGI PEDDPRNAAT IADNVGDNVG
DCAGMAADLF ESYAVTLVAA LILGVKAFGE NGLVYPLLIP AVGVLTAIIG IFAVSPRDGD
RTGMTAINRG FFISAVVSAI GVVIVSMVYL PGSFAEFPGL EGSTQSGDPR VIAISAVLIG
IVLAAAIQLL TGYFTETNRR PVLGVAEAST TGPATNILAG IGVGLESAVY SALLIGAAVF
GAYLLGSGSV TIALFAVALA GTGLLTTVGV IVSMDTFGPV SDNAQGIAEM SAGPEGIDER
AGAILTSLDA VGNTTKAITK GIAIATAVLA ATALFGSFTD TVTTALADAG ASTEAARGTV
GGLNIAYPDA LVGLIIGAAV VFLFSGLAIN AVGRAAGRVV MEVRNQFRTK PGIMEGTEKP
DYGAVVDICT RDSLRELVTP GTLAVMAPIA VGFGLGYPPL GAFLGGAIAA GVLMAVFLAN
SGGAWDNAKK LVEDGNYGGK GSDVHAATVI GDTVGDPFKD TAGPSINPLL KVMNLVSLLI
APLVVKFSVG EDENTAARIG IALAAVVVIV AVVATSRRRG SSVGDGPPTP SAPPTRTPPS
GSEVKV