Gene Franean1_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5240 
Symbol 
ID5673574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6296933 
End bp6300094 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content73% 
IMG OID641244094 
Producthypothetical protein 
Protein accessionYP_001509504 
Protein GI158316996 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTT ACCAGCCCAT GCTGTTCGTC GGACTGGGCG GAACAGGCTG CCTGGTCGGC 
GCCGAGCTCG AGCGCCGGCT GCGGCACGAG CTGTGCGGCC CGGACGGCCG GGCGCTGAAC
CTGCGGGTGC CCGGCAAGAA CTATCTGCCC TACCAGCTGC CGTCCTGCGT GCAGTTCGTC
TACGCCGACC TCAACGAGTC CGAGCTCACC AGGCTGCGCA CCCGGGTCGT CCCCTCGGAG
GAGCACGCGA ACGCGGCGGC GCCCACCCAG CACGTCACGC ACGGCCTCAT CCCGACCGTC
GACACCTACC CGGACGTCGC CCGCAGCCTG CGGGTGAACG CGCGGGAGCT GGTCCGGGGC
TGGCTGCCGC CGGCCGAGGG CGAGCCCCGG GTGGCGCCGC TGATGCGGGG GGCGGGCCAG
CTGCCCACCG TCGGGCGGGC GGCACTGTTC GAGACGTTCC GGTCCGGGGT GGCGCCGGCC
CGCCGTCCAC TGTCCGACGC AGTCGGCAAG ATCGCGACCT CCGGCCAGGA GCTGGCGGCG
CTCGGCGGCC GGCTCGGGCA GACCTGTGAC GTCTTCGTCG CCTTCTCCGT CGCCGGCGGC
ACCGGCGCCG GCATCTTCTA CGACTACCTG CACCTGATCG GCGACGCGTT GCAGCGCTCG
AAGGTCCGGG CGCAGATCTA CCCGCTGGTG CTGATGCCCT CGGCGTTCGA CGAGGGGGTC
GGCGGCGGCC GGCGGGCCCG GCTGAACGCC GGCCGCGCGC TGCTCGACCT GTTCCGCCTC
ATCGACGACC AGAACGGCCA CGACGCGGGG ACGGAGCTGA CCGCGCGCGG CACCTCCGGC
ACGCTCGGCG TCCAGTACCC GGACGGCGAG GGCCAGATCA ACCTGTTGCC CTCGACCATC
CAGACCGGAT TCCTGTTCAA CCGGGCCGAC GGCGTGGAAC GCGACGACCT GCACCGCTCG
GTCGTGTCGC TGATCCTCTC CCTGGTCGGC ACCGGGCAGG AACACGACGA CGGCGACGCC
ATTGCCGACC GGATCTACCA GTCGTTCGCC GACGACTTCA TCAACCGCGG CGTCGAGCGG
GAGGTGCCGG CGTCCTCCGG CATCGGCCGG CGGGGCGTGT CGACGAGCCT GGTCGCGTCC
ATGACGATCC CGGTCGAGGA GCTGGCCGAC CTGGTCGCCT CCCGGCTGCT GGCCCGCGGG
GCGTACGACC TCGCCGCGCC CGCCCCGGGC ACGGCCGCCG ACGACGCCGG CATGATCAGG
CGGTTCGCCT CCGCCGCGAA CATCGGCCCG ATGGTCACCC GGCAGCCCTT CCAGTTCACC
GAGCCGGCAC CGGCCCGCGG CGCGGCGGAG ATCCTCTCCG CGCTGCGGGC CCGGCTGCAG
GCCATGGAGG GCAACCTGGC GGGCCTGGGC ACGATGGTCC GCCAGCAGGT GCCGGCGATC
GCCGGCGCCT TCGACCCGGC GGCCGGCCTC GACGACCTGC TCGGCGAGGT CGACCTGCTG
CACGCCCGCC GGATCGTCGA GGGCCGCGCG GGGGAGAGTG ACCCGGTGAG CCGGGGCGGG
GTGGTCGGCT TCCTGGCCAA CCGCCGCACC GAGCCGCCCG CGCCCGCCGG GATCTCGGTG
AACCCACCGG CGCCGCTGGC GATCCGCGAC CGGCGCCTCG GCGCCAAGGT GCGCTGGGGC
GACCGGGAGG TGGTGGAGAC CATCCACCGC CAGGACACCT GGTACAGCTG GCGTACCCGG
TGCGTCTGGA ACGCCGCCTG GGACGAGCAG CAGGTGCGCT GGGAGAAGGC CCTGCGGCGG
TTCCGCGCGC AGCTGAGGCT CGCCGCCGAG GCCTTCGACG AACACAGCCG CTCCGAGCCC
GGCGAGTTCG CCCGGCGCAC GGCGGACCTG TTCCGGCCGC GGGTCGGGGT GACCTACCTG
CTGCCCCCGC ACGGCGACAT GGACGACTTC TACCAGAGGG CCCTGCAACG GCTCACGTCC
GCGCTGGGCC TGCGGGGAGC CGCCTCCGAG GGCGACGTGA TGCACCGCCT GCTCGGCCCG
GACGGCTGGC GCCGGGTGTG GGAGGCGACC GTCACCAGGG GCGGGGAGGC CGCCGTCGCC
GTCGCCCGGG AGCGGCTGCA GGAGGCGGTG AAGCGGCTGT TCCAGGAGTC CGACGGGCTC
GACGGCGAGC CGCTGCTGCC GACGATGGCG AGCCTGCTCG CCGCGACGGT GCGCCGGGAC
GGCCCCGCCG CGGTCGGGGA GGACGACATC CGCCAGTTCC AGACGAAGAT CCACGGTCTG
GTGCCGGCCA GCTTCTCGCC CCAGGGCTCC GGCAATCTGA AGATCCTCGT CTCCTACCCG
GCCGGGGCCC GGGACACCCA GATCGAGGGC TACCTCGCCC GCGCGATCCG GCTGCCGAAC
GAGAGCGGCA TCTCGATGGA GTTCCGCCCG ATCAACGCGG ACTCGGTCGC GGTCGTCCTG
CTGCGCACCT CGATGAGCAT CACCGAGGTG CCCGAGCTGC GGGAGATCCT GCACCACTGG
GCCGACGCGC TGCGCAACGA GCAGTCGCAG GACTTCCTCA AGTGGCGCCA GCGGCTCGGC
TTCGACTACG GCTGGCTGGC CACCACCGAG GAGGACCGGG TCCGCATCCT GCACCGGCTG
CTCTGCGCGA TGTGGAACGG GCAGGTCCAG GCGCTGGCCG GCGGGACGGA GTCCCCGTAC
TCGATCCGGG TCTCGCTCGG TGACCCCAAC GCCGACACCG ACAGCGACGG CGTGAGCATG
ACCCTGGCGC TCTCCCCGTT CGAGCCGGCC TCGTCGTGGG GCAGCATGCT CCGCGCCTAC
GAGGACTGGT CGCTCGCGGA CGACGAGCGG GTGCGCCGGG ACTTCTCCGA ACAGCTGATG
CGGGTGGTGC CGATCGGGGT GACCGGCCGG GCCGCCAACC CGCACCCGGT GTTCCGGGCG
TTCGTCGACA ACGCGGACAA GCAGGCCGAC CTGCTCGCCG AGATGCTCAT GAAGCTGCCG
CCGGGCAGCC GGGGCTGGGC CGAGCAGCAA CACGCCTTCT GGGCGCACAC CGTGCCGGCC
GCGCTGGACA TGGGGTTCTC CAACGTCTCC ACCCCGGTGC GGGCGAACCT GCGCCAGCTC
TACGAGATGG TCGGCACACG CGGGAAGGAC CTCGGCAGGT GA
 
Protein sequence
MNIYQPMLFV GLGGTGCLVG AELERRLRHE LCGPDGRALN LRVPGKNYLP YQLPSCVQFV 
YADLNESELT RLRTRVVPSE EHANAAAPTQ HVTHGLIPTV DTYPDVARSL RVNARELVRG
WLPPAEGEPR VAPLMRGAGQ LPTVGRAALF ETFRSGVAPA RRPLSDAVGK IATSGQELAA
LGGRLGQTCD VFVAFSVAGG TGAGIFYDYL HLIGDALQRS KVRAQIYPLV LMPSAFDEGV
GGGRRARLNA GRALLDLFRL IDDQNGHDAG TELTARGTSG TLGVQYPDGE GQINLLPSTI
QTGFLFNRAD GVERDDLHRS VVSLILSLVG TGQEHDDGDA IADRIYQSFA DDFINRGVER
EVPASSGIGR RGVSTSLVAS MTIPVEELAD LVASRLLARG AYDLAAPAPG TAADDAGMIR
RFASAANIGP MVTRQPFQFT EPAPARGAAE ILSALRARLQ AMEGNLAGLG TMVRQQVPAI
AGAFDPAAGL DDLLGEVDLL HARRIVEGRA GESDPVSRGG VVGFLANRRT EPPAPAGISV
NPPAPLAIRD RRLGAKVRWG DREVVETIHR QDTWYSWRTR CVWNAAWDEQ QVRWEKALRR
FRAQLRLAAE AFDEHSRSEP GEFARRTADL FRPRVGVTYL LPPHGDMDDF YQRALQRLTS
ALGLRGAASE GDVMHRLLGP DGWRRVWEAT VTRGGEAAVA VARERLQEAV KRLFQESDGL
DGEPLLPTMA SLLAATVRRD GPAAVGEDDI RQFQTKIHGL VPASFSPQGS GNLKILVSYP
AGARDTQIEG YLARAIRLPN ESGISMEFRP INADSVAVVL LRTSMSITEV PELREILHHW
ADALRNEQSQ DFLKWRQRLG FDYGWLATTE EDRVRILHRL LCAMWNGQVQ ALAGGTESPY
SIRVSLGDPN ADTDSDGVSM TLALSPFEPA SSWGSMLRAY EDWSLADDER VRRDFSEQLM
RVVPIGVTGR AANPHPVFRA FVDNADKQAD LLAEMLMKLP PGSRGWAEQQ HAFWAHTVPA
ALDMGFSNVS TPVRANLRQL YEMVGTRGKD LGR