Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5724 |
Symbol | |
ID | 5674050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6948769 |
End bp | 6952698 |
Gene Length | 3930 bp |
Protein Length | 1309 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641244577 |
Product | ATP/GTP-binding protein |
Protein accession | YP_001509980 |
Protein GI | 158317472 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.308781 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGTGT CTGGTACGGA CTTCGGAACC GTCGTCACCT TCTACTCCTA CAAGGGTGGC ACCGGCCGGT CCATGGCGCT CGCCAACACC GCCTGGCTGC TGGCGAGCAG CGGCTGCCGG GTGCTGGTCG TGGACTGGGA CCTGGAGGCT CCCGGCCTCC ATCGTTACTT CCACCCCTTC CTCCACGATC CCGACCTCCG GTCGACTCCC GGGATCATGG ACATGATCTG GGAGTTCACG GTGACGGTCA TGTCGCCGCA GGGGCCGAAC GACGACGACT GGTTCAGTCG GTCGACCACG ATCACGCCGT ACGCGGCGTC GCTGAACTGG CCCTTTCCCG ACGGTGGGAC CGTGGACTTC GTCTGTTCCG GTCGGCACGA CGCAGCCTAC GCCCAGCGGG TCGGCACGTT CGACTGGAAC GCGTTCTACG CGGACAAGGG CGGAGGGGAA TTCGTCGACG CGCTTCGCCA GGACATGAGG TCCTCCTATG AGTGGGTCCT GATCGACAGC CGCACCGGGT TGAGCGACAC GGCCGGGATC TGCACCGTCC AGCTGCCCGA CATCGTGGTC AACTGTTTCA CCCTCAGCAC GCAGAGTATC GAGGGTGCGG TCGCCGTCTC CCGCTCCATC GCGGCACACA AGGAGGGGCG CGAGATTCTA CAGCTGCCCG TCCCGATGCG GGTCGAGGAC GGCGAGCAGC GAAAACTCGA GCTCGGTCGG GATGTCGCAC GGGTGGCATT CGACCCGTTC CTGCATGGCT TCGATGAACG CGCGAAGGAG CGTTACTGGG GTGACGTCGA GATCCCCTAC AAGCCCTATT ACGCGTATGA GGAGATCCTC GCCGTCTTTG GGGACCGCCC GCTCCAGGAG GGCACTCTGC TGGCCGCATA CGAGAGGCTG GCGTCCTACC TGACTGACGG TATGGTCGCC GCGCTTCCTC CGCAGGACGA GTCCCGGCGC CGGAGCACTC TCGCGGTTTT CGAGCGGTCG CGAGTATCGA TCCCGACCGA CATGGTCGTC AGCCACGCCT CGACCGACCG CCCCTGGGCG GAGTGGGTCG CCGCCGAGCT CGAGGCGGCC GGGTTCCAGG TGATCCTCGA ACAGATCAGT CAGTCCGGGG CCGGCTGGAA GGACGACACG GAGCCGGAGC CGATCCGGCG AATGGTGGCT ATCGTGTCGG CCGCCTACCT CCGTTCCGCC GAGGCGATGG AACGGTGGCG GCGGAGCCTG ACCTTCGATC CGGACGGGGC CCAGCGCCTC CTCGTCCCGG TCCGGGTCGA GGAGGTTCGC CTGCCGCCGG AGGGCGCCGC GCGTGACCAC GCGGATCTCG TCGGGCTTGC CGAAGCACCG GCCCGCCGAG CACTGGTGGC CGCGGCCGGC CGGCCGTCCC GACTGGCGGT CACCACCCGC ACGCGGCCGG CGGACGCGAC CGATGTCCGC TATCCCGGCT CCCCCCCGGC GATCTGGGAG GGGGTGCCTG CGCGGAACCC GATGTTCGTC GGCCGGGACG CGCTGCTGAT GGACATGCGG AACCGTTTCA TGGGTGCCGA CAAGCCGGCG GCCCACACCC TGGTCCTGCA GGGGCTCGCC GGTGTGGGCA AGACCCAGGT CGCCGCCGAG TATGTCTTCC GTTTCGGTTC GACCTACGAC CTCGTCTGTT GGATCGGCTG CGATCAGGCG GCGCTGGCGC GCAGTGACAT GACCCGCCTC GCCCACACCC TCGGCCTGCC CGTCCGGCAG GGGCGGGACC CGGTGGACAG CCTGGTCGAC GCCCTGCGCA AGGGAGTGCC GTACCGGCGC TGGCTCCTCG TGTTCGACAA CGCAGACACC CCGGACGAGA TCCTTCCCCT CATTCCGAAC GGCTCGGGGC ATGTGGTCAT CACCTCCCGT AACCAGCGCT GGCGGGGCCG GCAGTCCCCG GTGGAGATCG ACGTGTTCAA CCGCGACGAG AGTGTCGAGC TGCTCCAGCG GAGCTCGCCG GCGCTGACGA CCGAGGTCGC GACCCGGCTC GCCGAGGCGC TGGGCGACCT TCCCCTCGCG CTTGAGCACG CCGGCGCCTG GCATGCGGAG ACCGGGATGC CGGCGGAGCG CTACCTGCAG CTGCTCGAGA GCAGCCCCGG CCCGCTGCTG CTCGAGGGTG AGGTCCCGAC CTACCCGCGG CCGGTCGCCA TGACCTGGCT GCTGTCGATG GAGCGGCTGC GTTCCCGCGC CCCCACGTCC GCCCGGCTGG CGCAGCTCTG CGCCTTCTTC GGGCCGGAAC CGATCAGCCT CGAGCTGTTC GCCGGGGACG GCCTGGCCGT TCTTGCCGAC GTCACGGACC AGACGCTGCG GGACGACCTC ACGCTGGCCG AGGCCGTGCG GCAGATCAGC GAGCACGCGC TCGCGCGGGT GAACTCCACC GACCGAAGCC TGATGATGCA TCGCCTCGTG CAGACCGCGA TACGAGACGA GCTGACCCCC ACCGAACGGG CCGGCGTCCG GGCCCGGGTG TACGCGATCC TCGCGGCCGC CGATCCGGGA AATCCCGACG ATCCCGAGAA CTGGAAGCGC TACGCCCTGA TCCGCCCGCA CCTGGGACCC ACCCGCGCGC CGACCGCTCG CGGCGAGGAC GTCTCCCGCC TCGTCCGTCA TCAGGTGCGA TGTCTGTATT TGCAGCGTGA TCACATCGGC TGCCGTGACC TCGCCAGCAA GACGCTCGTC CTGTGGCGGG AGCGCTTCGG AGACGACGAC GAGCGTACGC TGCAACTCGC CTTGGACCTG GCGGACAGTC ACCGGGCGCT CGGCGACGTC GAACAGGCCC GGGTGCTCGA TCTCCGCGCG CGGGAACGGC TGGTTCTCCT TCTCGGCCCG AACCATCCGC TCAGCCTGCG CGCCGCGATG GCGCTGGGCG GTGACTACCG CGGCGTCGGT GACTACGAGG CGGCCTGGCT GCTCGATGAG GATACGTACG CCCGCTACCG GGAGACGGCG GGCCTCGACA ATCCGGAGAC GCTCAAGGCG GCGAACAACC TGGCCGTCTC CCTGCGTTTC CTGGGCGATT TCAAACTGGC ACTCGAGATG AGTGAGGAGG TCTTCGAGCG CCACCGCAGG CTCACGGGGG ACGCCGATGT TTCGGCGCTC ATGTACCTGG AGAGTTATGC CCGGGACCTT CGAGAGAACG GGAGGTACCC GGCCTCGCTC ACTCTTCTCG AAGGCGCGCT CGAATGGTCT CGCCAGGCGC TGGGTGACGA TCACCTGGAC CGGCTGCGGA CAATGACTAA CTACGCCGTG TCGCTCCGTT GGGCCGGTCA TTCGGGCCGC GCCCGCGAGA TCGCGGAGGA TGCCCACGCC AGGTTCCGCG CCCACGCCGA TCCCGGGCAG TCAGAGGCGC TCGCGGTCGC AGTCTGCCTG GTCGGCGTCC TGCTGGAGGT GGGTGAGACC GCGACGGCGC GGAGGCTGGC GGACGAGACC CACCGGCGGG CCCGGTCCAG ACTGGGTGAG CGGAACATCT ACACGCTTGC CGCCGCCAAC AGCCTGGTCA TTGCCACGCG CCAGTGCGGC GAGCACATCG CCGCCGAGGT GCTGGGCCAG CAGACGCTGG ACGGCCTGCG GTCGGCGCTC TCCGCCATGC ACCCGTTCAC GGTCTTCTGC TCGATGAACG TGGCGAACTG CCACGCTGAT GCCGGGCGGA TCGCCGCGGC CCGCGAACTC GACCAGGGGG CGTGGAACGT CCTGCAGGCC AAGCTGGGGG AGGACCACCC GGTGACGCTG ATCGCCGCGG TGAACCTGGC GGCGGACGAG CAGCGAAGCG GGGACCTGGA AAACGCGCGG GCGCAGCGAT CTGGCCTGCT CCGTCAGCTC GGGGAGCGAC TGGGCGTGGA ACATCCCGTC GTCGTGGCGC TGGGCCGGGG CCGGCGAGTC GACCTGGACA TCGACCCTCC TCCCATCTGA
|
Protein sequence | MSVSGTDFGT VVTFYSYKGG TGRSMALANT AWLLASSGCR VLVVDWDLEA PGLHRYFHPF LHDPDLRSTP GIMDMIWEFT VTVMSPQGPN DDDWFSRSTT ITPYAASLNW PFPDGGTVDF VCSGRHDAAY AQRVGTFDWN AFYADKGGGE FVDALRQDMR SSYEWVLIDS RTGLSDTAGI CTVQLPDIVV NCFTLSTQSI EGAVAVSRSI AAHKEGREIL QLPVPMRVED GEQRKLELGR DVARVAFDPF LHGFDERAKE RYWGDVEIPY KPYYAYEEIL AVFGDRPLQE GTLLAAYERL ASYLTDGMVA ALPPQDESRR RSTLAVFERS RVSIPTDMVV SHASTDRPWA EWVAAELEAA GFQVILEQIS QSGAGWKDDT EPEPIRRMVA IVSAAYLRSA EAMERWRRSL TFDPDGAQRL LVPVRVEEVR LPPEGAARDH ADLVGLAEAP ARRALVAAAG RPSRLAVTTR TRPADATDVR YPGSPPAIWE GVPARNPMFV GRDALLMDMR NRFMGADKPA AHTLVLQGLA GVGKTQVAAE YVFRFGSTYD LVCWIGCDQA ALARSDMTRL AHTLGLPVRQ GRDPVDSLVD ALRKGVPYRR WLLVFDNADT PDEILPLIPN GSGHVVITSR NQRWRGRQSP VEIDVFNRDE SVELLQRSSP ALTTEVATRL AEALGDLPLA LEHAGAWHAE TGMPAERYLQ LLESSPGPLL LEGEVPTYPR PVAMTWLLSM ERLRSRAPTS ARLAQLCAFF GPEPISLELF AGDGLAVLAD VTDQTLRDDL TLAEAVRQIS EHALARVNST DRSLMMHRLV QTAIRDELTP TERAGVRARV YAILAAADPG NPDDPENWKR YALIRPHLGP TRAPTARGED VSRLVRHQVR CLYLQRDHIG CRDLASKTLV LWRERFGDDD ERTLQLALDL ADSHRALGDV EQARVLDLRA RERLVLLLGP NHPLSLRAAM ALGGDYRGVG DYEAAWLLDE DTYARYRETA GLDNPETLKA ANNLAVSLRF LGDFKLALEM SEEVFERHRR LTGDADVSAL MYLESYARDL RENGRYPASL TLLEGALEWS RQALGDDHLD RLRTMTNYAV SLRWAGHSGR AREIAEDAHA RFRAHADPGQ SEALAVAVCL VGVLLEVGET ATARRLADET HRRARSRLGE RNIYTLAAAN SLVIATRQCG EHIAAEVLGQ QTLDGLRSAL SAMHPFTVFC SMNVANCHAD AGRIAAAREL DQGAWNVLQA KLGEDHPVTL IAAVNLAADE QRSGDLENAR AQRSGLLRQL GERLGVEHPV VVALGRGRRV DLDIDPPPI
|
| |