Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5417 |
Symbol | |
ID | 5673748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6534137 |
End bp | 6539233 |
Gene Length | 5097 bp |
Protein Length | 1698 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244272 |
Product | hypothetical protein |
Protein accession | YP_001509678 |
Protein GI | 158317170 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.265481 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.989313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TCAGCCCACT GCTGGCCACC CGGGTCGCCA ACCGGCTGGT GACGGACTTC CTGCGCGCAG CTCCCCCCGG CCGCTGCATG CGGCTCGACC ACCTGCGCGC CGACGACTGC CACGCGGTAC GCGACGCGGT GACGAACGCG CTTACCGAAA CCGCGGTGGG CGGCGGATGC GTGGTCGCCG TCCTCGGCGC CGCGGTCAGC GACGACGACG CGGTCATCTC CCCAGAGCGC GCGATCGAGC TGCGTAACCG CAAGTCGGCC GCGTTGCTGC TGCTCGTGCC GGCAGGCACC GACAGCCCGG CGGCCAGCAG CCTGGAGAAC TCGTTTGAGT CGATCGACAT CGAGGATCTG TTCGCCAAGA TCGTCCGGGA CGCGCTGCAC CGGCTGGACC GTCCCCTGCG GGACCTGTTC GGCCAGGTCC AGGCGACGGT GCGCCGCGGC GCGTTCCCGC GCTCGCGCCA GGCCCGCGCC GAGTACCTGC TCGCGGTCGG CGCCGCCGCG GCCGACCCGG TCGCCGGCGG CGCCGGGGCC GCGCAGTACG TCGCCGGAAC GCACCTGCAC CTGCTCGGCC TGATCCCGGA CCAGGGCGGC GCGTCGTTCG TCTCCCGGCT CGACGCGAAC GCCCAGTGCG TGACCGCTCT GGTCAAGCCG GGCCGCTCGC AGAAGGACGC GCGCACCCGG CTCGCCGACT GCCCGCTGCG CCAGGACGAC GAGCGTTACC GGGCGATCGA GCGGTTCCTC GTCGCCGTGC CGCGGCTGTC CGAGCGTGCC TGGCTGCGCG ACCTGGCCGC CGACGACAAC GCGGACCTGG CCTTCAACCG CTGGCCGCTG GACCTTCCCG TCGACAGCGA CCTGGTGGCC CTGCAGATCG AGCCGTTCGC CAACACCGAG GGGATCATCG CGGCAGGCAC CGGCCTGCGC GCGGGCGCGG ACGGCGCGGT CCCGACCTGC TCTGCCCGCG GCGGCTTCGT GAAAGTCACC TGGACGACGG AGCCCACCAG GCCGAAGGAC GTGGCCCGCT GGCAGGTGGA GATCGTCCCG TCAGAGGAGT TCTACGGCAA CGAGACCGAC TTCGACGTCA CCCTGCCCGC GGCGAAGGTC GCCGGCAGCA AGCACAGCCA CCGGCTGACG GTGAATGTCG ACACCGAGGG CGCCGAGCGG GTGCCGGCGG TGGTGACCGT CCGGGTCAGC GCGCTGGACC GCAACAACCA GGTGCTACGG CTGCGCGACG GCGCGCTCGC CCAGGCCACC GGCCAGAAGC TCGCCCAGGC CACCAGCCAG GAGTTCGCGC TCGACGACGC GCCGCCGCCG GAAAGCACCG CCGTCCGGCG TGACACCGCC ATCTCCCTGC CGGTGGCGCG GCTTCGGGCC GAGCAGGCGG GCGCGGATTC GGACGTCGAG ACGTCCGAGG GCTGGCAGGT GGCCGACCTG GCCTATCTGC AGTTGCGGTT CGGTCGGGGC GGCAGGACGC ACGTCGCCCG GATCGGGGTC AGCGCGCCGC TGCGCGAGCT TTCCGCCCGC GCACTCGCCG AGCTGGACAA CGTCGGCCGG TACGAGGCGG AGGTGGGCGC CGGGCAGCCG TTCGCGGCGG CGGCCGCCGC CGCGGTGGGC CCCGGCTGGC CGTCGGGCGC GGGCAGGGCG TTCCTCGCCG CCCGGCGGGA GTTCTTCGAG GCGGTCCGCA ACCAGCCCGG TCGCGGCCTG CTCGAGGTAG CCGCCCTCTC CGAGGAACTG GCCGCGCTGG GGGCGAAGTA CACCGCCGCC TACGGGCGGA TCCTGCGCCG GGACTCCGGC GCCGCCGACC TGCGCGCGCT CACCTCGATC GACACTCTTC GGCTGCGGAT CCGCCTCGGA CGGCGCCACC CGGTGGACGC GCTGGTCATG CTGCCCACGC ATCCGCTGCG GGTCGCCTGG TACCTCGGCT ACCACGCCGA ACTGGAGGCG TGGCGTACCA GGCTGCGGGT GCTGGCGCCG AAGGACCGCC GCGACGAGGT CGACGTCGAC CTGCTGGGCA GGCTGTCGCC CGCCCGGGTG CCGTTCCTGC TCGCCGGCCC GGACGGCGCC CCGTACGTGT TCGCGCAGAA CCTGCGGCTG CTCTACGGGC TGTACCTGCC GGTCGACGAG CCCGACCCGG CGACGGCCGT CAGCGAGACG GTGACCGCGC TCGGGCTGCC GAGTGCCGAT GTCAGCGTAG GCGAGGTGCC GCCCGCTCGG CTGGCCGACC GCATCAGGCT CTACCGGGAC ACCCACCGCG GTCCCGAGCG GCTGCGGATC CTGGCGACCA ATCCGGGCAG TGGCGCGTTT CTCGGCGAGG CGCTGCGAGC CGTCACCGCC GAACCCGGCG ACAACGCCGC CGGCCGCGAG CCGGGCGAGG ACGAGGCCGC CACCGCGCCG CACATCTGGA TCGAGGCGTT CGGCGAGGGC GCGTCGGAGA CGAACCCGTT GCCCGGCCTG CGCGAGCTGC AGCGCGACAT CGCCGACAAC CGGGCCCGTC CGGGCCGCGG CTTCCTCGAC CCGGCGCTGG AGATCTCCGC CGCCCCGCTG GACGCGATCG AGGCGGCACG TGACGCGCAC CTGGCGGTGC TCGCCGACGT CAGTCGTCCC GAACTGCACC TGGGCGTCGC CGAGTCCAGC GGGGGCAGCG TCTCGTTCCG TGGGCTCATC ACCCAGCTGG TCACCAGACG GGACCCGTCC GAGCTGGTCT GGTACGTGGG GATCGAGTTC CCCACGGCGC GCGGTGTGGA CTCCGCGCCG GTCACCGACC TGCACCGCCG GTTCGCCGAG GCTGTCTCAA GTCTGTTCTC GGCCGGCGCG GACGGCGCCG GTCAGGTCGA CCCGGCGGAT CCCGGGTCGG ACTCAACGGA CCATGATGCG GACCCGGCGG ACCTCGAGAC GACGGTACTG GTGCGTCATA CCAGGACGGT GCCGCCCTCC CTGCCGCAGG CGTTCGCGGT GGACCCGGAG CGGACCATGC CGGCGGTGCG GGGCGACCGG GCCGCGCCGA CGGCCCGGAT CCGGATCCAG GTCGACGGCG ACACCCGGCG GGTCCTTGAC CTCGCGCATG ACCGGTGCGA CTGGGTTGTC GTCCTGGACC GGTTCCTCGG CCTGGATCTG TTCGACGACC CGTCGCGGCG GGCGTTCGGC GGGCGGCGCT ACATCCTGGA CTACGCGCCC GAGTTCCTCG ACGGCCTCGG CCACCAGATG GCCATCACGA CGGCGCACCG GGCCGACGTC GAGAAAATGT TCCTGCATGC GATGACCGAG CTTGGTTTCG AACAGCCCGG CGAGTCGGTG TCGTCGGTCG TCGACGAGCT GCTGCTCGTC TCCGGTCGGC TGATTCTCGC CGCGACCGGC GACGACAAGC GCGCCAAGGA GGCGGTCGCG CTCGCCGCCG TCGTCTCCCA TCTGCGCCGG CGCGGCGAGC TCGCCGACAC GATCGTCATC CCGGTCGACG CGCATCTCGA CCTGTTCGGG CCGCGTGCCC ACCGTGGCGG TGCCAATGGC GAGAAGGCGC GTCGGTGCGA CCTGCTGCTG GTGCGGTTCC CGGGGCGGCG GCTGCACATC GAGGCCGTCG AGGTGAAGTC GCGCGGAATG CTCGACAGCG AGGACCTCGC CCGCGGGATC GACGCCCAGG TCAAGGCGAC CGTCGACGTC GTCCAGCGGC TTTTCTTCGC CGACCCGGCG CGGATCGACC GGCCGCTGCA GCGGACCCGG CTCGCCACCC TGTTGCGGTA CTACCTGCGG CGCGCGGCGC GGCGCGGCCT GGTCACCGAC GCGGTCGCGT TCGGCCGGAT GCAGGAAGGC ATCGACCGGC TCGACACCGC CGACCCGGCG GTCTCCTACC AGCACAGCGG CTACATCGTC GTCCAGCGGG GCGACGGCGT GGACGAGTTC ACCATGGGCG AGACGCGCAT CCGCACGCTG ACCGCCGCGA CCCTCGGCAC AGACACCCCC GATCCGGAGA TCCTTGTTCT GGGCCCGGCC GAGACGCCTG GCGTGTCGGT CGAGAACGGG CCCGACGCGC CACGTCAGCC CGCGGGCCAG CCGGAGGTCC TCCGGGTGCG GGTCGGGAAG ACGCTGCCCC CGGAGGAGGA GGTGGTCTGG GAGGCCGGCA CGATCGGCAG CCCACACCTG TTCATTCTCG GCATCCCCGG GCAGGGGAAG TCGGAGACGA CGATCCGGCT GCTGCAGGGC GCCGCCGACG GTGGCCTGCC CGCGCTGGTC ATCGACTTCC ACGGCCAGTT CAGCTCCGAC CCGCGCCGCC CGTCGTCGCT GCGGGTGCAC GACGCGGCGG CCGGGCTGCC GTTCTCGCCG TTCGAGCTGA CCGAGGCCGG CGGGCGGCAC GCGTACAAAA TGAACGCGCT GTCGATCTCG GAGATCTTCG CCTACGTCTG CGGGCTGGGC GACATCCAGC GCGACGTCGT CTACCAGGCG CTGATCAGCG GCTACGAGGC GCACGGCCAC GGCCAGCTCA TCCCGCCGAG TGGTATCCCG ACACTTGACG AGGTGCGCGG CTCCATCGCG GCGCTGGAGA AGGAGCGCGG TGTGGCGAAC GTGCTCGCCC GCTGCCGCCC GCTGCTCGAA TACGGCCTGT TCACCGACAA CACCGGGGTG AAGGTCCAGG ACCTGATCCG GGATGGCCTG GTCGTCGACC TGCACGGCTT CGCCGAGGTG GAGCAGGCAC AGGTCGCCGC TGGCGCGTTC CTGCTCCGCA AGATCTACAA GGACATGTTC TCCTGGGGCC AGACCGGGGA ACTGCGGCTC GCGATCGTTC TCGACGAGGC GCACCGCCTC GCCAAGGACG CGACCCTGCC CCGGCTGATG AAGGAGGGCC GCAAGTTCGG CGTCGCCGTC ATCGTCGCCA GCCAGGGCAT CGACGATTTC CACCCCGATG TCCTCGCCAA CGCCGGCACC AAAATCATCT ACCGGGTCAA CTACCCCCAG TCCCGCAAGG CCGCCGGCTT CCTGCGCACC CGCACCGGCA AGGACCTCTC CGAGGAGCTC GAACAGCTCC CCGTCGGCAA CGCCTACATC CAGACCCCTC ACATGCCCGT CGCCCGCCGC ACCCGCATGC TCCGCCCCGA GGCCTGA
|
Protein sequence | MTDLSPLLAT RVANRLVTDF LRAAPPGRCM RLDHLRADDC HAVRDAVTNA LTETAVGGGC VVAVLGAAVS DDDAVISPER AIELRNRKSA ALLLLVPAGT DSPAASSLEN SFESIDIEDL FAKIVRDALH RLDRPLRDLF GQVQATVRRG AFPRSRQARA EYLLAVGAAA ADPVAGGAGA AQYVAGTHLH LLGLIPDQGG ASFVSRLDAN AQCVTALVKP GRSQKDARTR LADCPLRQDD ERYRAIERFL VAVPRLSERA WLRDLAADDN ADLAFNRWPL DLPVDSDLVA LQIEPFANTE GIIAAGTGLR AGADGAVPTC SARGGFVKVT WTTEPTRPKD VARWQVEIVP SEEFYGNETD FDVTLPAAKV AGSKHSHRLT VNVDTEGAER VPAVVTVRVS ALDRNNQVLR LRDGALAQAT GQKLAQATSQ EFALDDAPPP ESTAVRRDTA ISLPVARLRA EQAGADSDVE TSEGWQVADL AYLQLRFGRG GRTHVARIGV SAPLRELSAR ALAELDNVGR YEAEVGAGQP FAAAAAAAVG PGWPSGAGRA FLAARREFFE AVRNQPGRGL LEVAALSEEL AALGAKYTAA YGRILRRDSG AADLRALTSI DTLRLRIRLG RRHPVDALVM LPTHPLRVAW YLGYHAELEA WRTRLRVLAP KDRRDEVDVD LLGRLSPARV PFLLAGPDGA PYVFAQNLRL LYGLYLPVDE PDPATAVSET VTALGLPSAD VSVGEVPPAR LADRIRLYRD THRGPERLRI LATNPGSGAF LGEALRAVTA EPGDNAAGRE PGEDEAATAP HIWIEAFGEG ASETNPLPGL RELQRDIADN RARPGRGFLD PALEISAAPL DAIEAARDAH LAVLADVSRP ELHLGVAESS GGSVSFRGLI TQLVTRRDPS ELVWYVGIEF PTARGVDSAP VTDLHRRFAE AVSSLFSAGA DGAGQVDPAD PGSDSTDHDA DPADLETTVL VRHTRTVPPS LPQAFAVDPE RTMPAVRGDR AAPTARIRIQ VDGDTRRVLD LAHDRCDWVV VLDRFLGLDL FDDPSRRAFG GRRYILDYAP EFLDGLGHQM AITTAHRADV EKMFLHAMTE LGFEQPGESV SSVVDELLLV SGRLILAATG DDKRAKEAVA LAAVVSHLRR RGELADTIVI PVDAHLDLFG PRAHRGGANG EKARRCDLLL VRFPGRRLHI EAVEVKSRGM LDSEDLARGI DAQVKATVDV VQRLFFADPA RIDRPLQRTR LATLLRYYLR RAARRGLVTD AVAFGRMQEG IDRLDTADPA VSYQHSGYIV VQRGDGVDEF TMGETRIRTL TAATLGTDTP DPEILVLGPA ETPGVSVENG PDAPRQPAGQ PEVLRVRVGK TLPPEEEVVW EAGTIGSPHL FILGIPGQGK SETTIRLLQG AADGGLPALV IDFHGQFSSD PRRPSSLRVH DAAAGLPFSP FELTEAGGRH AYKMNALSIS EIFAYVCGLG DIQRDVVYQA LISGYEAHGH GQLIPPSGIP TLDEVRGSIA ALEKERGVAN VLARCRPLLE YGLFTDNTGV KVQDLIRDGL VVDLHGFAEV EQAQVAAGAF LLRKIYKDMF SWGQTGELRL AIVLDEAHRL AKDATLPRLM KEGRKFGVAV IVASQGIDDF HPDVLANAGT KIIYRVNYPQ SRKAAGFLRT RTGKDLSEEL EQLPVGNAYI QTPHMPVARR TRMLRPEA
|
| |