Gene Franean1_5417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5417 
Symbol 
ID5673748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6534137 
End bp6539233 
Gene Length5097 bp 
Protein Length1698 aa 
Translation table11 
GC content73% 
IMG OID641244272 
Producthypothetical protein 
Protein accessionYP_001509678 
Protein GI158317170 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.265481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.989313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TCAGCCCACT GCTGGCCACC CGGGTCGCCA ACCGGCTGGT GACGGACTTC 
CTGCGCGCAG CTCCCCCCGG CCGCTGCATG CGGCTCGACC ACCTGCGCGC CGACGACTGC
CACGCGGTAC GCGACGCGGT GACGAACGCG CTTACCGAAA CCGCGGTGGG CGGCGGATGC
GTGGTCGCCG TCCTCGGCGC CGCGGTCAGC GACGACGACG CGGTCATCTC CCCAGAGCGC
GCGATCGAGC TGCGTAACCG CAAGTCGGCC GCGTTGCTGC TGCTCGTGCC GGCAGGCACC
GACAGCCCGG CGGCCAGCAG CCTGGAGAAC TCGTTTGAGT CGATCGACAT CGAGGATCTG
TTCGCCAAGA TCGTCCGGGA CGCGCTGCAC CGGCTGGACC GTCCCCTGCG GGACCTGTTC
GGCCAGGTCC AGGCGACGGT GCGCCGCGGC GCGTTCCCGC GCTCGCGCCA GGCCCGCGCC
GAGTACCTGC TCGCGGTCGG CGCCGCCGCG GCCGACCCGG TCGCCGGCGG CGCCGGGGCC
GCGCAGTACG TCGCCGGAAC GCACCTGCAC CTGCTCGGCC TGATCCCGGA CCAGGGCGGC
GCGTCGTTCG TCTCCCGGCT CGACGCGAAC GCCCAGTGCG TGACCGCTCT GGTCAAGCCG
GGCCGCTCGC AGAAGGACGC GCGCACCCGG CTCGCCGACT GCCCGCTGCG CCAGGACGAC
GAGCGTTACC GGGCGATCGA GCGGTTCCTC GTCGCCGTGC CGCGGCTGTC CGAGCGTGCC
TGGCTGCGCG ACCTGGCCGC CGACGACAAC GCGGACCTGG CCTTCAACCG CTGGCCGCTG
GACCTTCCCG TCGACAGCGA CCTGGTGGCC CTGCAGATCG AGCCGTTCGC CAACACCGAG
GGGATCATCG CGGCAGGCAC CGGCCTGCGC GCGGGCGCGG ACGGCGCGGT CCCGACCTGC
TCTGCCCGCG GCGGCTTCGT GAAAGTCACC TGGACGACGG AGCCCACCAG GCCGAAGGAC
GTGGCCCGCT GGCAGGTGGA GATCGTCCCG TCAGAGGAGT TCTACGGCAA CGAGACCGAC
TTCGACGTCA CCCTGCCCGC GGCGAAGGTC GCCGGCAGCA AGCACAGCCA CCGGCTGACG
GTGAATGTCG ACACCGAGGG CGCCGAGCGG GTGCCGGCGG TGGTGACCGT CCGGGTCAGC
GCGCTGGACC GCAACAACCA GGTGCTACGG CTGCGCGACG GCGCGCTCGC CCAGGCCACC
GGCCAGAAGC TCGCCCAGGC CACCAGCCAG GAGTTCGCGC TCGACGACGC GCCGCCGCCG
GAAAGCACCG CCGTCCGGCG TGACACCGCC ATCTCCCTGC CGGTGGCGCG GCTTCGGGCC
GAGCAGGCGG GCGCGGATTC GGACGTCGAG ACGTCCGAGG GCTGGCAGGT GGCCGACCTG
GCCTATCTGC AGTTGCGGTT CGGTCGGGGC GGCAGGACGC ACGTCGCCCG GATCGGGGTC
AGCGCGCCGC TGCGCGAGCT TTCCGCCCGC GCACTCGCCG AGCTGGACAA CGTCGGCCGG
TACGAGGCGG AGGTGGGCGC CGGGCAGCCG TTCGCGGCGG CGGCCGCCGC CGCGGTGGGC
CCCGGCTGGC CGTCGGGCGC GGGCAGGGCG TTCCTCGCCG CCCGGCGGGA GTTCTTCGAG
GCGGTCCGCA ACCAGCCCGG TCGCGGCCTG CTCGAGGTAG CCGCCCTCTC CGAGGAACTG
GCCGCGCTGG GGGCGAAGTA CACCGCCGCC TACGGGCGGA TCCTGCGCCG GGACTCCGGC
GCCGCCGACC TGCGCGCGCT CACCTCGATC GACACTCTTC GGCTGCGGAT CCGCCTCGGA
CGGCGCCACC CGGTGGACGC GCTGGTCATG CTGCCCACGC ATCCGCTGCG GGTCGCCTGG
TACCTCGGCT ACCACGCCGA ACTGGAGGCG TGGCGTACCA GGCTGCGGGT GCTGGCGCCG
AAGGACCGCC GCGACGAGGT CGACGTCGAC CTGCTGGGCA GGCTGTCGCC CGCCCGGGTG
CCGTTCCTGC TCGCCGGCCC GGACGGCGCC CCGTACGTGT TCGCGCAGAA CCTGCGGCTG
CTCTACGGGC TGTACCTGCC GGTCGACGAG CCCGACCCGG CGACGGCCGT CAGCGAGACG
GTGACCGCGC TCGGGCTGCC GAGTGCCGAT GTCAGCGTAG GCGAGGTGCC GCCCGCTCGG
CTGGCCGACC GCATCAGGCT CTACCGGGAC ACCCACCGCG GTCCCGAGCG GCTGCGGATC
CTGGCGACCA ATCCGGGCAG TGGCGCGTTT CTCGGCGAGG CGCTGCGAGC CGTCACCGCC
GAACCCGGCG ACAACGCCGC CGGCCGCGAG CCGGGCGAGG ACGAGGCCGC CACCGCGCCG
CACATCTGGA TCGAGGCGTT CGGCGAGGGC GCGTCGGAGA CGAACCCGTT GCCCGGCCTG
CGCGAGCTGC AGCGCGACAT CGCCGACAAC CGGGCCCGTC CGGGCCGCGG CTTCCTCGAC
CCGGCGCTGG AGATCTCCGC CGCCCCGCTG GACGCGATCG AGGCGGCACG TGACGCGCAC
CTGGCGGTGC TCGCCGACGT CAGTCGTCCC GAACTGCACC TGGGCGTCGC CGAGTCCAGC
GGGGGCAGCG TCTCGTTCCG TGGGCTCATC ACCCAGCTGG TCACCAGACG GGACCCGTCC
GAGCTGGTCT GGTACGTGGG GATCGAGTTC CCCACGGCGC GCGGTGTGGA CTCCGCGCCG
GTCACCGACC TGCACCGCCG GTTCGCCGAG GCTGTCTCAA GTCTGTTCTC GGCCGGCGCG
GACGGCGCCG GTCAGGTCGA CCCGGCGGAT CCCGGGTCGG ACTCAACGGA CCATGATGCG
GACCCGGCGG ACCTCGAGAC GACGGTACTG GTGCGTCATA CCAGGACGGT GCCGCCCTCC
CTGCCGCAGG CGTTCGCGGT GGACCCGGAG CGGACCATGC CGGCGGTGCG GGGCGACCGG
GCCGCGCCGA CGGCCCGGAT CCGGATCCAG GTCGACGGCG ACACCCGGCG GGTCCTTGAC
CTCGCGCATG ACCGGTGCGA CTGGGTTGTC GTCCTGGACC GGTTCCTCGG CCTGGATCTG
TTCGACGACC CGTCGCGGCG GGCGTTCGGC GGGCGGCGCT ACATCCTGGA CTACGCGCCC
GAGTTCCTCG ACGGCCTCGG CCACCAGATG GCCATCACGA CGGCGCACCG GGCCGACGTC
GAGAAAATGT TCCTGCATGC GATGACCGAG CTTGGTTTCG AACAGCCCGG CGAGTCGGTG
TCGTCGGTCG TCGACGAGCT GCTGCTCGTC TCCGGTCGGC TGATTCTCGC CGCGACCGGC
GACGACAAGC GCGCCAAGGA GGCGGTCGCG CTCGCCGCCG TCGTCTCCCA TCTGCGCCGG
CGCGGCGAGC TCGCCGACAC GATCGTCATC CCGGTCGACG CGCATCTCGA CCTGTTCGGG
CCGCGTGCCC ACCGTGGCGG TGCCAATGGC GAGAAGGCGC GTCGGTGCGA CCTGCTGCTG
GTGCGGTTCC CGGGGCGGCG GCTGCACATC GAGGCCGTCG AGGTGAAGTC GCGCGGAATG
CTCGACAGCG AGGACCTCGC CCGCGGGATC GACGCCCAGG TCAAGGCGAC CGTCGACGTC
GTCCAGCGGC TTTTCTTCGC CGACCCGGCG CGGATCGACC GGCCGCTGCA GCGGACCCGG
CTCGCCACCC TGTTGCGGTA CTACCTGCGG CGCGCGGCGC GGCGCGGCCT GGTCACCGAC
GCGGTCGCGT TCGGCCGGAT GCAGGAAGGC ATCGACCGGC TCGACACCGC CGACCCGGCG
GTCTCCTACC AGCACAGCGG CTACATCGTC GTCCAGCGGG GCGACGGCGT GGACGAGTTC
ACCATGGGCG AGACGCGCAT CCGCACGCTG ACCGCCGCGA CCCTCGGCAC AGACACCCCC
GATCCGGAGA TCCTTGTTCT GGGCCCGGCC GAGACGCCTG GCGTGTCGGT CGAGAACGGG
CCCGACGCGC CACGTCAGCC CGCGGGCCAG CCGGAGGTCC TCCGGGTGCG GGTCGGGAAG
ACGCTGCCCC CGGAGGAGGA GGTGGTCTGG GAGGCCGGCA CGATCGGCAG CCCACACCTG
TTCATTCTCG GCATCCCCGG GCAGGGGAAG TCGGAGACGA CGATCCGGCT GCTGCAGGGC
GCCGCCGACG GTGGCCTGCC CGCGCTGGTC ATCGACTTCC ACGGCCAGTT CAGCTCCGAC
CCGCGCCGCC CGTCGTCGCT GCGGGTGCAC GACGCGGCGG CCGGGCTGCC GTTCTCGCCG
TTCGAGCTGA CCGAGGCCGG CGGGCGGCAC GCGTACAAAA TGAACGCGCT GTCGATCTCG
GAGATCTTCG CCTACGTCTG CGGGCTGGGC GACATCCAGC GCGACGTCGT CTACCAGGCG
CTGATCAGCG GCTACGAGGC GCACGGCCAC GGCCAGCTCA TCCCGCCGAG TGGTATCCCG
ACACTTGACG AGGTGCGCGG CTCCATCGCG GCGCTGGAGA AGGAGCGCGG TGTGGCGAAC
GTGCTCGCCC GCTGCCGCCC GCTGCTCGAA TACGGCCTGT TCACCGACAA CACCGGGGTG
AAGGTCCAGG ACCTGATCCG GGATGGCCTG GTCGTCGACC TGCACGGCTT CGCCGAGGTG
GAGCAGGCAC AGGTCGCCGC TGGCGCGTTC CTGCTCCGCA AGATCTACAA GGACATGTTC
TCCTGGGGCC AGACCGGGGA ACTGCGGCTC GCGATCGTTC TCGACGAGGC GCACCGCCTC
GCCAAGGACG CGACCCTGCC CCGGCTGATG AAGGAGGGCC GCAAGTTCGG CGTCGCCGTC
ATCGTCGCCA GCCAGGGCAT CGACGATTTC CACCCCGATG TCCTCGCCAA CGCCGGCACC
AAAATCATCT ACCGGGTCAA CTACCCCCAG TCCCGCAAGG CCGCCGGCTT CCTGCGCACC
CGCACCGGCA AGGACCTCTC CGAGGAGCTC GAACAGCTCC CCGTCGGCAA CGCCTACATC
CAGACCCCTC ACATGCCCGT CGCCCGCCGC ACCCGCATGC TCCGCCCCGA GGCCTGA
 
Protein sequence
MTDLSPLLAT RVANRLVTDF LRAAPPGRCM RLDHLRADDC HAVRDAVTNA LTETAVGGGC 
VVAVLGAAVS DDDAVISPER AIELRNRKSA ALLLLVPAGT DSPAASSLEN SFESIDIEDL
FAKIVRDALH RLDRPLRDLF GQVQATVRRG AFPRSRQARA EYLLAVGAAA ADPVAGGAGA
AQYVAGTHLH LLGLIPDQGG ASFVSRLDAN AQCVTALVKP GRSQKDARTR LADCPLRQDD
ERYRAIERFL VAVPRLSERA WLRDLAADDN ADLAFNRWPL DLPVDSDLVA LQIEPFANTE
GIIAAGTGLR AGADGAVPTC SARGGFVKVT WTTEPTRPKD VARWQVEIVP SEEFYGNETD
FDVTLPAAKV AGSKHSHRLT VNVDTEGAER VPAVVTVRVS ALDRNNQVLR LRDGALAQAT
GQKLAQATSQ EFALDDAPPP ESTAVRRDTA ISLPVARLRA EQAGADSDVE TSEGWQVADL
AYLQLRFGRG GRTHVARIGV SAPLRELSAR ALAELDNVGR YEAEVGAGQP FAAAAAAAVG
PGWPSGAGRA FLAARREFFE AVRNQPGRGL LEVAALSEEL AALGAKYTAA YGRILRRDSG
AADLRALTSI DTLRLRIRLG RRHPVDALVM LPTHPLRVAW YLGYHAELEA WRTRLRVLAP
KDRRDEVDVD LLGRLSPARV PFLLAGPDGA PYVFAQNLRL LYGLYLPVDE PDPATAVSET
VTALGLPSAD VSVGEVPPAR LADRIRLYRD THRGPERLRI LATNPGSGAF LGEALRAVTA
EPGDNAAGRE PGEDEAATAP HIWIEAFGEG ASETNPLPGL RELQRDIADN RARPGRGFLD
PALEISAAPL DAIEAARDAH LAVLADVSRP ELHLGVAESS GGSVSFRGLI TQLVTRRDPS
ELVWYVGIEF PTARGVDSAP VTDLHRRFAE AVSSLFSAGA DGAGQVDPAD PGSDSTDHDA
DPADLETTVL VRHTRTVPPS LPQAFAVDPE RTMPAVRGDR AAPTARIRIQ VDGDTRRVLD
LAHDRCDWVV VLDRFLGLDL FDDPSRRAFG GRRYILDYAP EFLDGLGHQM AITTAHRADV
EKMFLHAMTE LGFEQPGESV SSVVDELLLV SGRLILAATG DDKRAKEAVA LAAVVSHLRR
RGELADTIVI PVDAHLDLFG PRAHRGGANG EKARRCDLLL VRFPGRRLHI EAVEVKSRGM
LDSEDLARGI DAQVKATVDV VQRLFFADPA RIDRPLQRTR LATLLRYYLR RAARRGLVTD
AVAFGRMQEG IDRLDTADPA VSYQHSGYIV VQRGDGVDEF TMGETRIRTL TAATLGTDTP
DPEILVLGPA ETPGVSVENG PDAPRQPAGQ PEVLRVRVGK TLPPEEEVVW EAGTIGSPHL
FILGIPGQGK SETTIRLLQG AADGGLPALV IDFHGQFSSD PRRPSSLRVH DAAAGLPFSP
FELTEAGGRH AYKMNALSIS EIFAYVCGLG DIQRDVVYQA LISGYEAHGH GQLIPPSGIP
TLDEVRGSIA ALEKERGVAN VLARCRPLLE YGLFTDNTGV KVQDLIRDGL VVDLHGFAEV
EQAQVAAGAF LLRKIYKDMF SWGQTGELRL AIVLDEAHRL AKDATLPRLM KEGRKFGVAV
IVASQGIDDF HPDVLANAGT KIIYRVNYPQ SRKAAGFLRT RTGKDLSEEL EQLPVGNAYI
QTPHMPVARR TRMLRPEA