Gene Francci3_4304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4304 
SymbolhppA 
ID3907272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5136512 
End bp5138851 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content67% 
IMG OID637881631 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_483379 
Protein GI86742979 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCGA TCCAAGCCCT ACAGGCCGAG GGCTCTGGAA TTGACCTTAG CGGCAGAGCC 
CTCGGCCTGG TCGCCGGTGT GGCGATAGTC GCAGCCCTCG CATTGCTTGT TGCCGGTTAT
CTGGTTCGTG AGGTATTGGC GGCGAGTCCA GGCACCCCGA GGATGGTGGA GGTCGGCCGG
GCGGTCCAGG AGGGCGCCGC CGCCTATCTC AGACGGCAGT TCACCACACT CGCGGGATTC
GTCGTCGTCA TCCCCTTCGT ACTGCTGCTC CTGCCAGCCG AGAACACTGG AGCGAAGATC
GGTCGGTCGA TCTTCTTCGT GGTCGGTGCG ATCTTCTCCG CGTTGGTCGG ATTCGTCGGG
ATGTCGCTGG CGACCCGGGC CAACACCCGC ACTGCCGCCG CCGCCATGAC CCGAGGGGAA
CGAGCCGCCG TCCGCATCGC CTTCCGCACC GGGGGGGTGG TGGGGATGTT CACCGTCGGT
CTCGGACTCC TGGGGGCGGC GGCCGTGGTC CTCGTCTTCC GGGACACCGC CCCCCAGGTC
CTGGAGGGAT TCGGATTCGG CGCCGCTCTG CTCGCGATGT TCATGCGAGT AGGCGGAGGG
ATCTTCACCA AGGCCGCCGA TGTCGGGGCG GACCTGGTCG GCAAGGTGGA ACAGGGGATT
CCCGAGGATG ATCCCCGCAA CGCCGCGACG ATCGCCGACA ACGTCGGCGA CAACGTCGGC
GACTGCGCCG GCATGGCCGC CGACCTGTTC GAATCCTATG CGGTGACGCT CGTCGCGGCG
CTGATTCTCG GGGTCGAGGC GTTCGGCGAA CGCGGCCTGG TGTTCCCCCT GCTCATCCCG
GCGGTCGGGG TTGTGACCGC GGTCATCGGA ATCTTCGCGG TGTCGCCGCG GGCCGGTGAC
CGCACCGGAA TGTCCGCCAT CAACCGGGGT TTCTTCATCT CCGCGGTGGT GTCCGCGATC
GGCGTCGTCG CCGTCTCACT GCTCTACCTC CCGACGAGCT TCGCCGATTT CCCCGGGATG
CAGGGAAGCA CCCAGTCCGG TAATCCACGA GTGATCGCGA TCGGCGCGGT CCTGATCGGG
ATCGTGCTCG CCGCCGCCAT TCAGCTGCTA ACCGGGTATT TCACCGAGAC CGGCCGCCGG
CCGGTGCGGG ACATCGCAAA GGCGTCGCTC ACCGGACCAG CCACCAACAT CCTCGCCGGA
ATCGGGGTGG GTCTGGAGTC GGCGGTCTAT TCTTCGCTGC TCATCGGTGC CGCGATCTTT
GGCGCCTACC TGCTCGGCTC CGGCAGCGTC ACGATCGCGC TGTTCGCGGT GGCGCTGGCG
GGGACCGGTC TGCTGACGAC GGTGGGTGTC ATCGTCTCGA TGGACACCTT CGGGCCGGTG
AGCGATAACG CACAGGGCAT CGCCGAGATG TCCGGCGACA TGGACGAGGC GGGTGCGGCC
ATCCTCACCT CGCTCGACGC CGTCGGCAAC ACCACCAAGG CGATCACGAA GGGCATCGCG
ATCGCCACCG CGGTGCTCGC CGCCTCGGCG CTGTTCGGCT CGTTCACCGA CACCGTCACG
ACGGCTCTCC GCGACGCCGG AGCGCTGCCC GAGGCGGGTC GGGGCATCGT TGGCGGCCTG
AACATCGCCT ACCCCGACGC GCTGGTCGGG CTCACCATAG GTGCGTCCGT GGTCTTCCTG
TTCTCCGGGC TCGCGATCAA TGCGGTGGGC CGCGCGGCCG GGCGGGTCGT CCTGGAGGTA
CGCAACCAGT TCCGGTCCAG GCCTGGAATC ATGACCGGCG ACGAGAAGCC GGATTACAGT
GCGGTCGTCG ACATCTGCAC CCGCGATTCG CTCCGCGAAC TGATTACACC GGGAACGCTG
GCGGTTCTCG CCCCAATCGC GGTCGGTTTC GGCCTGGGAT ACCCGCCACT CGGTGCGTTC
CTCGGCGGCG CCATCGCCGG CGGCGTGCTG ATGGCCGTGT TCCTCGCGAA CTCCGGCGGG
GCCTGGGATA ACGCGAAGAA GATGGTCGAG GACGGCCACC ACGGCGGAAA AGGCTCGGAG
GTCCACGCGG CCACGGTGAT CGGCGATACC GTGGGAGATC CATTCAAGGA CACCGCCGGC
CCTTCGATCA ACCCGCTGCT CAAGGTGATG AATCTGGTCA GCCTGCTGAT CGCTCCGACG
GTCGTAAAGT ACTCGGTGGG CGCGGACGAG AACACGGGGC TGCGCATCGG TGTCGCCCTC
GCCGCCCTCG CCGCCATCGC AGCCGTGATC ATCATCTCCA GACGACGGAG CTCAATGATC
AACGATGTGC CCGATATGAC ACAACCGACA CCGGCTCCCA GCGAGAAGAT CAACGCCTGA
 
Protein sequence
MSSIQALQAE GSGIDLSGRA LGLVAGVAIV AALALLVAGY LVREVLAASP GTPRMVEVGR 
AVQEGAAAYL RRQFTTLAGF VVVIPFVLLL LPAENTGAKI GRSIFFVVGA IFSALVGFVG
MSLATRANTR TAAAAMTRGE RAAVRIAFRT GGVVGMFTVG LGLLGAAAVV LVFRDTAPQV
LEGFGFGAAL LAMFMRVGGG IFTKAADVGA DLVGKVEQGI PEDDPRNAAT IADNVGDNVG
DCAGMAADLF ESYAVTLVAA LILGVEAFGE RGLVFPLLIP AVGVVTAVIG IFAVSPRAGD
RTGMSAINRG FFISAVVSAI GVVAVSLLYL PTSFADFPGM QGSTQSGNPR VIAIGAVLIG
IVLAAAIQLL TGYFTETGRR PVRDIAKASL TGPATNILAG IGVGLESAVY SSLLIGAAIF
GAYLLGSGSV TIALFAVALA GTGLLTTVGV IVSMDTFGPV SDNAQGIAEM SGDMDEAGAA
ILTSLDAVGN TTKAITKGIA IATAVLAASA LFGSFTDTVT TALRDAGALP EAGRGIVGGL
NIAYPDALVG LTIGASVVFL FSGLAINAVG RAAGRVVLEV RNQFRSRPGI MTGDEKPDYS
AVVDICTRDS LRELITPGTL AVLAPIAVGF GLGYPPLGAF LGGAIAGGVL MAVFLANSGG
AWDNAKKMVE DGHHGGKGSE VHAATVIGDT VGDPFKDTAG PSINPLLKVM NLVSLLIAPT
VVKYSVGADE NTGLRIGVAL AALAAIAAVI IISRRRSSMI NDVPDMTQPT PAPSEKINA