Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4304 |
Symbol | hppA |
ID | 3907272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5136512 |
End bp | 5138851 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637881631 |
Product | membrane-bound proton-translocating pyrophosphatase |
Protein accession | YP_483379 |
Protein GI | 86742979 |
COG category | [C] Energy production and conversion |
COG ID | [COG3808] Inorganic pyrophosphatase |
TIGRFAM ID | [TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCGA TCCAAGCCCT ACAGGCCGAG GGCTCTGGAA TTGACCTTAG CGGCAGAGCC CTCGGCCTGG TCGCCGGTGT GGCGATAGTC GCAGCCCTCG CATTGCTTGT TGCCGGTTAT CTGGTTCGTG AGGTATTGGC GGCGAGTCCA GGCACCCCGA GGATGGTGGA GGTCGGCCGG GCGGTCCAGG AGGGCGCCGC CGCCTATCTC AGACGGCAGT TCACCACACT CGCGGGATTC GTCGTCGTCA TCCCCTTCGT ACTGCTGCTC CTGCCAGCCG AGAACACTGG AGCGAAGATC GGTCGGTCGA TCTTCTTCGT GGTCGGTGCG ATCTTCTCCG CGTTGGTCGG ATTCGTCGGG ATGTCGCTGG CGACCCGGGC CAACACCCGC ACTGCCGCCG CCGCCATGAC CCGAGGGGAA CGAGCCGCCG TCCGCATCGC CTTCCGCACC GGGGGGGTGG TGGGGATGTT CACCGTCGGT CTCGGACTCC TGGGGGCGGC GGCCGTGGTC CTCGTCTTCC GGGACACCGC CCCCCAGGTC CTGGAGGGAT TCGGATTCGG CGCCGCTCTG CTCGCGATGT TCATGCGAGT AGGCGGAGGG ATCTTCACCA AGGCCGCCGA TGTCGGGGCG GACCTGGTCG GCAAGGTGGA ACAGGGGATT CCCGAGGATG ATCCCCGCAA CGCCGCGACG ATCGCCGACA ACGTCGGCGA CAACGTCGGC GACTGCGCCG GCATGGCCGC CGACCTGTTC GAATCCTATG CGGTGACGCT CGTCGCGGCG CTGATTCTCG GGGTCGAGGC GTTCGGCGAA CGCGGCCTGG TGTTCCCCCT GCTCATCCCG GCGGTCGGGG TTGTGACCGC GGTCATCGGA ATCTTCGCGG TGTCGCCGCG GGCCGGTGAC CGCACCGGAA TGTCCGCCAT CAACCGGGGT TTCTTCATCT CCGCGGTGGT GTCCGCGATC GGCGTCGTCG CCGTCTCACT GCTCTACCTC CCGACGAGCT TCGCCGATTT CCCCGGGATG CAGGGAAGCA CCCAGTCCGG TAATCCACGA GTGATCGCGA TCGGCGCGGT CCTGATCGGG ATCGTGCTCG CCGCCGCCAT TCAGCTGCTA ACCGGGTATT TCACCGAGAC CGGCCGCCGG CCGGTGCGGG ACATCGCAAA GGCGTCGCTC ACCGGACCAG CCACCAACAT CCTCGCCGGA ATCGGGGTGG GTCTGGAGTC GGCGGTCTAT TCTTCGCTGC TCATCGGTGC CGCGATCTTT GGCGCCTACC TGCTCGGCTC CGGCAGCGTC ACGATCGCGC TGTTCGCGGT GGCGCTGGCG GGGACCGGTC TGCTGACGAC GGTGGGTGTC ATCGTCTCGA TGGACACCTT CGGGCCGGTG AGCGATAACG CACAGGGCAT CGCCGAGATG TCCGGCGACA TGGACGAGGC GGGTGCGGCC ATCCTCACCT CGCTCGACGC CGTCGGCAAC ACCACCAAGG CGATCACGAA GGGCATCGCG ATCGCCACCG CGGTGCTCGC CGCCTCGGCG CTGTTCGGCT CGTTCACCGA CACCGTCACG ACGGCTCTCC GCGACGCCGG AGCGCTGCCC GAGGCGGGTC GGGGCATCGT TGGCGGCCTG AACATCGCCT ACCCCGACGC GCTGGTCGGG CTCACCATAG GTGCGTCCGT GGTCTTCCTG TTCTCCGGGC TCGCGATCAA TGCGGTGGGC CGCGCGGCCG GGCGGGTCGT CCTGGAGGTA CGCAACCAGT TCCGGTCCAG GCCTGGAATC ATGACCGGCG ACGAGAAGCC GGATTACAGT GCGGTCGTCG ACATCTGCAC CCGCGATTCG CTCCGCGAAC TGATTACACC GGGAACGCTG GCGGTTCTCG CCCCAATCGC GGTCGGTTTC GGCCTGGGAT ACCCGCCACT CGGTGCGTTC CTCGGCGGCG CCATCGCCGG CGGCGTGCTG ATGGCCGTGT TCCTCGCGAA CTCCGGCGGG GCCTGGGATA ACGCGAAGAA GATGGTCGAG GACGGCCACC ACGGCGGAAA AGGCTCGGAG GTCCACGCGG CCACGGTGAT CGGCGATACC GTGGGAGATC CATTCAAGGA CACCGCCGGC CCTTCGATCA ACCCGCTGCT CAAGGTGATG AATCTGGTCA GCCTGCTGAT CGCTCCGACG GTCGTAAAGT ACTCGGTGGG CGCGGACGAG AACACGGGGC TGCGCATCGG TGTCGCCCTC GCCGCCCTCG CCGCCATCGC AGCCGTGATC ATCATCTCCA GACGACGGAG CTCAATGATC AACGATGTGC CCGATATGAC ACAACCGACA CCGGCTCCCA GCGAGAAGAT CAACGCCTGA
|
Protein sequence | MSSIQALQAE GSGIDLSGRA LGLVAGVAIV AALALLVAGY LVREVLAASP GTPRMVEVGR AVQEGAAAYL RRQFTTLAGF VVVIPFVLLL LPAENTGAKI GRSIFFVVGA IFSALVGFVG MSLATRANTR TAAAAMTRGE RAAVRIAFRT GGVVGMFTVG LGLLGAAAVV LVFRDTAPQV LEGFGFGAAL LAMFMRVGGG IFTKAADVGA DLVGKVEQGI PEDDPRNAAT IADNVGDNVG DCAGMAADLF ESYAVTLVAA LILGVEAFGE RGLVFPLLIP AVGVVTAVIG IFAVSPRAGD RTGMSAINRG FFISAVVSAI GVVAVSLLYL PTSFADFPGM QGSTQSGNPR VIAIGAVLIG IVLAAAIQLL TGYFTETGRR PVRDIAKASL TGPATNILAG IGVGLESAVY SSLLIGAAIF GAYLLGSGSV TIALFAVALA GTGLLTTVGV IVSMDTFGPV SDNAQGIAEM SGDMDEAGAA ILTSLDAVGN TTKAITKGIA IATAVLAASA LFGSFTDTVT TALRDAGALP EAGRGIVGGL NIAYPDALVG LTIGASVVFL FSGLAINAVG RAAGRVVLEV RNQFRSRPGI MTGDEKPDYS AVVDICTRDS LRELITPGTL AVLAPIAVGF GLGYPPLGAF LGGAIAGGVL MAVFLANSGG AWDNAKKMVE DGHHGGKGSE VHAATVIGDT VGDPFKDTAG PSINPLLKVM NLVSLLIAPT VVKYSVGADE NTGLRIGVAL AALAAIAAVI IISRRRSSMI NDVPDMTQPT PAPSEKINA
|
| |