Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0568 |
Symbol | |
ID | 5668985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 650584 |
End bp | 655125 |
Gene Length | 4542 bp |
Protein Length | 1513 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239495 |
Product | ATPase, P-type (transporting), HAD superfamily, subfamily IC |
Protein accession | YP_001504933 |
Protein GI | 158312425 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0474] Cation transport ATPase |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATTC CGGGCAGTCG ATCGGTATGG CATCTGCCGG CCGCCGTCCT CCACACCCTC ACCCCGCCCC TGTTCGACCG GTTCGAACCG CGCACGGAAC CCGGCCGGCG CGACCGCGTC GCCGGCCGAA TGGCGAGCTT CGAGGTACGC GGAGCCTCCG GGCCGGAGGG CGAGACCCGC TGCCGGGCGC TGGAAGAGGC GCTCGGGCGT CATCCGGGGG TCGCCTGGGC CCGGGTGAAC GCGCCGCTGA AGCGCATCCT GATCGGCCTG GTCGATTCCG CGCCCACCGC GGACGAGCTG GCCGCCGCCG TGAGGCAGGT GGAGGGGGTG ATGACCGAGC CGAGCGTTGA CCTGCTGCCC GGCGGCTTCA CCGGACCGGT CCGCCGGTCG GGGATGGCCC TGGCCGCCGA CGCGGCGGCG CTGGCACTGG CGTCGGCGGG TCGGCTCAGC CGGTGGGCGT CGCTCCCCGC CGAGCTCGCC TCGCTGGTCA CGCTCGTCGA CACCCAGCCC CGGCTGCGCT GTCAGCTGGA ACGGGTCCTT GGCCAGGACA ACGCCGACCT CGCCCTCGCG GTGGCCAACG CGGCCGCCCT CGGCCTGACT CAGGGATATG CCAGCCTCGG TACCGACGTC GCCTACCGAC TCCTGGCCCT CGACGCGGCA CGCGCCCGCC GGGATGCGTT CGCGGCCGTG GCGGACGAAC TGCTGGGCCG CCCCGAATGG GCCGGGGCGC GACCGGTGGT CGTCGAACGG CCGCGACCGC TGCCGGCCGG CCCGATCGAG CACTATGCGG ATGTCGCCGT GCTCGGCGGG CTGGCGGCCG GTGCGGCCGC GGCCGGCCTC ACCCGCGCCC CGCGCCGGGC GGCTGCTCTC GCCCTCACCG CGCTGCCCCG CTCGACCCGG CTCGGCCGCG AGGGATTCGC CCTCCAGCTG ACCCGCGTGC TCGCCCGACG CGGCGCGCAG GTGATGTCCC CAGCCGCACT GCGGTTGCTC GACCGCGTCG ACACCGTCAT CGTCGATCAC GACGTCCTGG TCAGCGGCCA GCACACCATC GGTGACATCG TGTCGTTGCC TGGCGCCGAC CCGAGCGAGG TGGCCGTCCG CTCCCACGCG CTGTTTCGTC CCGCCGACAC CACTGCCCGC GTGAGCCGGG ACGGCTGGAC GCTCGGCCCG GTGGAACGGT TGGCCATCCG GGGCCGCACC GGGGTGCGGG AGCGGCGCCG GCTGGCCGCG GCAGGGGCGG TTACCGTGCT CGGCCTCGCG CACGGCACCA GGTTGATGGC GCTCGTCGGG CTGGTGCCCG AGCGTGTGGA GGCCAGCGAG CAGTTCCTGG CCGCCTGCCG CCGTTCCGGC TGCGCCGTGT TCGTGGCCGG CGGCCCGGAG GGTGAGCCGA GCGGGCGGAC CCCCTGGGAC GGGTGGGCCG GGTTCCAGCA TGCCCCGGGG GGCACCCGGC TGGTGGCGAC CGTCCGTGGT TTGCAGGCGG AGCGCGGCGG TGTACTCGTC GTGTCCGGGC AGCGGGCAGC GGTCGGCTGC GCGGACTGCG GGGTGGGCGT AGCCGGGCCG GACGGATCGC CGGCGTGGGG GGCTCATGTG CTGGTCGGCA ACGACCTCCT CGCCGCGGGC CTCGTGGTCG AGGCTGCCGG TGTGGCGTCC CGGGTCAGCC GACACGCGGT GCGCCTGGCG GAGCTGGGCA CGGGGACCGC CGCCCTGGTC GCGCTGACCG GGGCCGGCGA GGCGCTGGCG GCCCGCGCCC AGCTCATGGT GACGGTTGCG GCCGCGGCCG CTCTGGGGGA AGGCGCCTGG GCGGCGCGTG AGCTGGGCCG CAGGCCGGTG CCCCCGACGA TCCCGCATAT CCCATGGCAT GTGCTGCCGC CCGAGGCATG CCTGGACCGG CTCGGCAGCA CCCGGCGAGG GCTCTCCGCC GGCCAGGTCG CCCAGCGGCA GGTCGGCGAG ACGTCCGTGT CCGTGAACGC CCCCGGCCTG GCCCGGGCGT TCGCCGCGGA ACTGGCAAAC CCGCTCACCC CGGTCCTGCT CGGCGGCGCC GCGCTGTCCG CCTCCACCGG ATCGGTCCTC GACTCCGGGC TCGTCGTCAG CGTCGCGGTC GGCTCCGCGC TCATCGGGGC GGTGCAGCGG CTGCGCGCCG ACCGGGCGCT GGCTCGTCTC TTCGCCGTCT CGGCCATCCC AGCTCTGGTG CGGCGGGACG GTGCGGACGT GGAGCTGACC GCGGACGATC TGGTGGCCGG GGATGTGCTC ACGCTGGGGC CGGGCGACGT CATCCCCGCC GACTGCCGCC TGCTGTCCAC CGAGGCACTC GAGGTGGACG AGTCGTCGCT GACCGGCGAA TCGCTCCCGG TGGCGAAGTC GCCGAAGCCG GTCGCCGCCC GGGACCTGGC GGAGCGGACC TGCATGGTCT TCCAGGGCAC GACCGTGGCC GCCGGGCGCG CCACCGCCCT GGTGGTGGCT ACCGGCACCG CCACCGAGGC GGGGCGGGGC CTGGCCGCGG CCGCCGAGCC GGCGAGACCG GTGGGTGTGC AGGCCCGGCT CGCGGCGATC ACGGACCTCA CCATCCCGGT CGCACTCGGC GCGGCCGGGA CGTTGCTGCT GTCCGGGCTG GTACGCGGCC TCCCGATCCG GAACACGCTG AGTGCCGGAG TGGCGCTGGC CGTCGCGGCG GTCCCCGAGG GGCTGCCCTT CCTCGCCACG GCCGCCCAGC TCTCCTCCGC CCGACGGTTG GCCGGGCGGG GCGCGCTCGT ACGCAACCCA CGCACCATCG AGACGCTCGG CCGGGTGGAC GTGCTGTGCT TCGACAAGAC CGGGACCCTC ACCAAGGGAA AGATCCGGCT GACCGCGGTG TCGGACGGCT CCCGTTCCTG TCCGCTGGCG GACCTCGACG AGACCGGGCA TCAGGTGGTG GCCGTCGGCC TGCGCGCCAC CCCGGACGAC CAGCATCGCA AGCTGCCGCA CCCGACCGAC CGGGCGGTCG TGAAGGGCGC CGCCGCCGCG GGCGTGGGCC GGGCACCCGA CGGGCGGGAA TGGTCGCCGT TGACCGCCCT GCCGTTCGAC CCGAGCCGCA GCTACCACGC CTGCCTGGGC CAGCTTGGCG CCGCCACGTT GCTCAGCGTC AAGGGAGCAC CCGAGGTGGT GCTGCCCCGC TGCACCCATC GCCGGACCAG CCGCCGCACC CAGCCGCTCG ACGTCCGTGG CCAGGCACGT CTGGCTCGGG AACACCACCG CCTCGCCGCC GCGGGCTACC GGGTGCTTGC CGTCGCCGAG CGCCGGGTGG CGGCCACGGG CCCCGCGGCG CGCCGGCAGC TGGCCGACGA CGACGTCGCG GGCCTCGCGT TCCTCGGGTT CCTCGCGCTG TCCGACCCGG TCCGGGACAC CGCGACCCCG TCCCTCGACC AGCTGCGCGC CGCGGGCGTG CAGATCATAA TGATCACCGG CGACCATCCG AGCACCGCGC GGACGATCGC CGCGGAGCTC GGGGTGCTTG ACGGCGATGC GCAGGTGGTG ACCGGCGCGG AACTCGATGC CATGGCCGAC GCGGAACTCG ACGCCATGCT GCCGCTGGTG GCCGTCGTCG CCCGCGGCAC ACCGGCGCAC AAGGTCCGGG TGATCGAGGC CTTCCAGCGG CTGGGCAGGA CCGTCGCGAT GACAGGTGAC GGTGACAACG ACGCGCCCGC CATCCGGCTC GCGGACGTGG GCATCGCGCT GGGTAAACGG GGGACCCCGG CGGCGCGGGC CGCGGCCGAC GTGATCATCA CGGAGAACCG GCTGGACGCC ATCCTCGCCG TGCTCGTCGA GGGCCGGTCG ATGTGGGCCT CGGTCCGCAA GGCGCTGGGC ATCTTCGTCG GCGGCAACCT CGGCGAGATC GCCTTCACCC TTCTCGGCTC GCTGGCCACG GGACGCTCAC CGCTGTCCGC CCGGCAGTTG CTGCTCGTCA ATCTGCTCAC CGACCTCGCC CCCGGACTGG CGGTGGCGCT GCGCCCACCG GATCCGGAGG CGGCCGGCCG GCTCCTGGGC GAGGGGCCTG AACGGTCCCT GGGCAGCCCC CTCAACGCAG AGATCGCGGT GAGGGCGACC GCGACGACCC TGGCCGCCGC GGGTGCCTGG ATCGCCGCCC GCCTCACCGG GCGGCGCCGG CGCGCCGACA CGGTCGGGCT CGCGGCGCTC GTCGGCTCCC AGCTCGGCCA GACCCTGCTC GTTGGCGGGC GCAGTCGCAT GGTGATCCTC AGCAGCCTGG TCTCGGCGGT CGTCCTGGCC GCCGTCGTGC AGGCGCCCGG GTTGAGCCAC TTCTTCGGTT GCACCCCACT CGGCACGGCC GGATGGACGA TCGCCGTCGC CGCGTCCGCT GTCGCGACGC TGCTCTCGTT CCTGCTACAG CGGCGTACCG AACGCCTGCT ACAGCGGCGT ACCGAACGGC CCGTCCTGGC TCGCGCCGAC TTGACCCAGA TCCTCCACGG CCTCGCCCGT GTTCCCCGGG CGGCAGTCCG GACAAGCGCG GCCGGCCCCG TCCCGGCACC GGCCGGTCGC GGAGCAGGAT GA
|
Protein sequence | MRIPGSRSVW HLPAAVLHTL TPPLFDRFEP RTEPGRRDRV AGRMASFEVR GASGPEGETR CRALEEALGR HPGVAWARVN APLKRILIGL VDSAPTADEL AAAVRQVEGV MTEPSVDLLP GGFTGPVRRS GMALAADAAA LALASAGRLS RWASLPAELA SLVTLVDTQP RLRCQLERVL GQDNADLALA VANAAALGLT QGYASLGTDV AYRLLALDAA RARRDAFAAV ADELLGRPEW AGARPVVVER PRPLPAGPIE HYADVAVLGG LAAGAAAAGL TRAPRRAAAL ALTALPRSTR LGREGFALQL TRVLARRGAQ VMSPAALRLL DRVDTVIVDH DVLVSGQHTI GDIVSLPGAD PSEVAVRSHA LFRPADTTAR VSRDGWTLGP VERLAIRGRT GVRERRRLAA AGAVTVLGLA HGTRLMALVG LVPERVEASE QFLAACRRSG CAVFVAGGPE GEPSGRTPWD GWAGFQHAPG GTRLVATVRG LQAERGGVLV VSGQRAAVGC ADCGVGVAGP DGSPAWGAHV LVGNDLLAAG LVVEAAGVAS RVSRHAVRLA ELGTGTAALV ALTGAGEALA ARAQLMVTVA AAAALGEGAW AARELGRRPV PPTIPHIPWH VLPPEACLDR LGSTRRGLSA GQVAQRQVGE TSVSVNAPGL ARAFAAELAN PLTPVLLGGA ALSASTGSVL DSGLVVSVAV GSALIGAVQR LRADRALARL FAVSAIPALV RRDGADVELT ADDLVAGDVL TLGPGDVIPA DCRLLSTEAL EVDESSLTGE SLPVAKSPKP VAARDLAERT CMVFQGTTVA AGRATALVVA TGTATEAGRG LAAAAEPARP VGVQARLAAI TDLTIPVALG AAGTLLLSGL VRGLPIRNTL SAGVALAVAA VPEGLPFLAT AAQLSSARRL AGRGALVRNP RTIETLGRVD VLCFDKTGTL TKGKIRLTAV SDGSRSCPLA DLDETGHQVV AVGLRATPDD QHRKLPHPTD RAVVKGAAAA GVGRAPDGRE WSPLTALPFD PSRSYHACLG QLGAATLLSV KGAPEVVLPR CTHRRTSRRT QPLDVRGQAR LAREHHRLAA AGYRVLAVAE RRVAATGPAA RRQLADDDVA GLAFLGFLAL SDPVRDTATP SLDQLRAAGV QIIMITGDHP STARTIAAEL GVLDGDAQVV TGAELDAMAD AELDAMLPLV AVVARGTPAH KVRVIEAFQR LGRTVAMTGD GDNDAPAIRL ADVGIALGKR GTPAARAAAD VIITENRLDA ILAVLVEGRS MWASVRKALG IFVGGNLGEI AFTLLGSLAT GRSPLSARQL LLVNLLTDLA PGLAVALRPP DPEAAGRLLG EGPERSLGSP LNAEIAVRAT ATTLAAAGAW IAARLTGRRR RADTVGLAAL VGSQLGQTLL VGGRSRMVIL SSLVSAVVLA AVVQAPGLSH FFGCTPLGTA GWTIAVAASA VATLLSFLLQ RRTERLLQRR TERPVLARAD LTQILHGLAR VPRAAVRTSA AGPVPAPAGR GAG
|
| |