Gene Franean1_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2647 
Symbol 
ID5671040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3129662 
End bp3133573 
Gene Length3912 bp 
Protein Length1303 aa 
Translation table11 
GC content55% 
IMG OID641241562 
Producthypothetical protein 
Protein accessionYP_001506982 
Protein GI158314474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.958765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGGTC GTAGCCTAAT CGAGGGTCGA CTTCCGTCAA GTTCCATCCT GACTCTCGTT 
AAACCGCTCG TAGGTGGCCA TACCGACGCT CTTGTCTTTC TCTGCGACAT AGCAGGAAAT
CAGACTGCGG AAGAACTCAA TGGACAATTC ATCCTGAAGA TCGGGCGTGC GGAGCATAGT
CAGGGTATCG CCCATCAAGA GTTTTGCCAG AATCTGGGGG AGTTTGGGAT TGATCACGTC
CCGCGCCATC TAATGGGCGT GCTAGACATA GGCGTCAGTG TGGACCTGTA CGACGTTGCA
GGGTTCAGCC TGGACAGCGT ACGTACCGCA GAACAGCTAG ACCACGAAGA CCTTGTAAAG
GTGTGCTTCA GAGTCGCCAG CGAACTACTC CCTGCTCAAC TCCCCACAGG TCGTAGTCCA
CAATACACAG AGACAGTGGG CCAGGTGTTT CATCGCTGGC TCGGCTCCAA TTTTCCTGCC
AATAGGCGTG GTAAGTCACT CCGTCAGCTG GCTCGACGTA TAGGGGTGAC TGGACGAACT
TTTCGATTCG ATGGCGAGTT GCTACCAAAC CCACTCTCGG TGCTAGAGCG AAACTCCACA
CTTGAACGAA CAAAAATCCC GTACTTCCGA GGCTATGCGC ATGGAGATCT ACACCTCCGC
AATGTGCTAG TCGGCGGTTC AATCTACACG CGCAACCTAA GTTATTGGTT GATCGACGTG
AGCTGGGATG AACCGTCTCC GCTGCTATTT GATCAAGCAT ATCTTGAGCT TTCCGTTCTC
CTTCACACGC TGCCAAACGC AGGTAGCGGG CGGGTACTCG GCTTGCTGAG CAAGATAGAC
AACGAGCATC TAACCGTGCC CGCCAGGCTT AGCAGTACGG ACAGCGCCAT AGTCGATCTG
ATAGAAGGGA TCCGCTCTAC TACGATTGAG AAGCTGGAGG AACAACAGCC GAGGCGTCGG
GACGTTTGGC GTCAACAGTA CGTTCTTTCA CGTATCGCAG CAGGTCTAAA TTGGGCCGCT
AAGCCGTTGG GTGATATAGC CCTTCAGCGA GCAGCCTTCA TTGCTGCTGC CTGGGCTACC
AGGGGACTAC TACGCGACTT CGATGACTAC AATTCCTTGT GGAATGATCT AGCCAGAGAT
GATCTTCAAA CTCAGACAGG ATTCGCCCCT CTCGGCGAAC CAGTACTTGC TGCAGAGGCC
ATCGAGCGCT GGACTCCTTT TAGTGCGCTA GATACGGGAT CGGATCTCTT CCTTATCGCC
GACCGTACCT CTGCTGACGA GCAGCTAAAC TCCTTCGCTG CATGCCAATG GGCAGCGGTC
ATGGATCTTA ACCCCGAAAG CGATGAAACT GGTCTCGCGC GGGCGATCTT GCCATCCCTG
CGATCAAGAC GCCATGTAAG CATGTTTGGT GAGAACCGGC AGTTGACCTC GCCGGGAGAG
TCGACTAACT GGCTGATGGC CAACGGTTGG CGTAGCCGCA ACGAGCCGAC GGCAAGTTCA
GACGCAGAAT GGAGACGGCG GGGATATCGC CCGCGGGTAA GGCAGCTTGT CGATGATATT
GTGGTCGGTA CGCCGAACCG TGGTGCGGCC GTTCTATGTC TGCGCTCCGG GGATAACGAC
ATGCTGATCG ATTATATTCT TGATTATATT GACGAGAAAT ATGACGGCAT CGCGGCTCGA
TTGGACCTTG CTACGACGCC GGGTGCGGAA GGTCTGGACC TCGATGTCTT TCTTGCTGCT
GTAGCCACGT CGCTGCCGAT CGCGAGTATC GGGCGGGAGG CTTCCATACC GGGTGCCGAT
GGTCCGTTCC GCCTGGATCG TGCAGACTTG CATCGCCTAT CGGTAGATTT GGAGGTGCTC
CATTCGCATG TACTTGCCGA AGGGCAGGCA ACTATTAGGG AAACGGATGC CTTCTGGCGA
GGTCGTCCTC CAACATGGGC TGATCTGGAA GCCTGGATCG ATGTACAGCG CGACGCTTAT
CCGGACCTTC TTGGAGAGCT GAGAGATCGC CTTGAGGATC GCCAGCTTGC ATGTGTCGAA
TTTGAGCACT CACCCGGAGC TGGTGGTACA ACGCTCGCGC GTCGGGTTGC ATGGAACCTT
CACAGAATTC ATCCTGTTGT TCTCCTGCGA AACTACACTC CGACGACCGT TGAACGTGTC
AATGAGCTCT ACCAGGACGC GGGTCGTCCC CCTTTGGTGG TGGCGGAGTC GGCAGACCTG
CCCGAGTCCG ACCGGGATGA GTTGCTCCAT GATCTGCAAC AACGGAACAG TCGTGCGGTG
GTGCTTTGGG TCAACCGAAC GAACGCGCAG CGGAATCTAC GACATCAGCT AATCGATCCA
GTGTCGGGCT CCGAACGGCA GCGATTCATC ACAGAGTATC TTCGCCGTGC TACGACTCCC
AAAGCGCGCA GACTGCTTGG CGAGATCGCG GAAAGTGATT CGGCGTCGCT TCCAGCGCAA
CGCCTATCCC CCTTCTACTT CGGACTCTGT GTTTATGAGA GTCAGTTTGA AGGCGTTGAA
CCATACGTCC AATACCACAT GGCAAAGCTC GCGGGAACGC ACCTAAAAAT AGCAGGATAT
CTTGCTCTGG TGACCCGGTA CGCCCAGGTT GGTATCCCAA TCGACCTCGT CAGGCGGTGG
TTGGCCGAAT CGCCACCCCA GGCCGGGGGC TATGGGGATA AGGAGCTTCG CGCTCTGCTT
GGACCGGACC TACGTAACCT GGTGGTCAGC GAGCGCCATG GTCTGCGTTT GCTACATCCG
CTACTAGCCG AACACGTCCT TGCCGGCGAC CCAAGTAGAC CACGATTCGG ACTTGCCCAG
ATCTCGGTGG AGCTCATAAG AAAAACTACC GAGTACCTTG GCCCTGAGAA CCAGGCGACT
CGCCGTCTAC TTGAAGACCT ATTCGTGCGA CGCAAGGGCT GGAGTGAAGG TAGGCAGCGG
CCAGATCTCT TCTCGGAGCT TGTTCAGGAT ATGCCCACGG ACGCCGCAGA ATGGGTTTTT
GAGGAGCTGA CTACACGTTG TCCGAAGGAG CCCCACTTCT GGAATCACCG CGGTAGGTAT
CATATCTATC GAGTAAGAGG AGATTTCGGG CGGGCCGAAG GCTTCCTTCT GCGCGCCGTG
GAGGAGTCCT ACGGAAGGAA CTCTACCCAC CTTCACACGC TTGGTATGGT TCGACGGATC
TGGATTGAAA ACGAAATGGA GGAGCTAGCT AAATCGAGGA TGCAAGTCAG ACCCGAACAG
ATACTGGAGC ATTTCCGTCC ATTGTTTGAT AGTGCAATGG ATGCATTTGC GCGAGGCCGA
GATGATCCGA ACAGCGCCCA TAGTTGGGTG ACACCTATTC AGCTCATTGC AACTGTAGTG
GAATATCTGG TGAAATTTTC CGGAGCTCGA AACCTAGTAG AGTTCCTCGA AGGTCGCTAC
CAAACTAGCA ATTGGGTGGG GCAGCAGCTA GCCCAGGCCG AAGTGCTTCT AGACGGGCTC
CGGTCGAATT TTGCAGACGA CCGCCGTCAG GCCAAATACT ATGCTGAACT AGCCGAGCGG
TTCGACTTGC TTTACGGTGA CCTCAATGCG CTAGTTGAAC AATGGAGGAA TCTTCGGCAT
GCCATCAACG GGCGAGCCGC CGGACTCGGT GTCGCAATCG CTCGAACTTT GTACGCCCAT
GCAGGCCGAG ATCTTTCCAG GCTTTCCGAG GATGAGACGC GAGAGATTGT TTCTATGGTC
GAAGGGTTGG TGGAATCCGG CGAGGCGACC GATGCCGATC TGCGCCTGTG GTATCAGGCG
TATCGTAGAC TTCCTGAGTA CTCGGAGACG AGGTCTCTGG AGCGGTTCAG TTGGTACGCG
TCAACGCGCG GCAGCTTTGG ATTCAAATTA CTATCTGTAT GTAATGCATT TCTTGAGATG
GTTCCGTGGT GA
 
Protein sequence
MVGRSLIEGR LPSSSILTLV KPLVGGHTDA LVFLCDIAGN QTAEELNGQF ILKIGRAEHS 
QGIAHQEFCQ NLGEFGIDHV PRHLMGVLDI GVSVDLYDVA GFSLDSVRTA EQLDHEDLVK
VCFRVASELL PAQLPTGRSP QYTETVGQVF HRWLGSNFPA NRRGKSLRQL ARRIGVTGRT
FRFDGELLPN PLSVLERNST LERTKIPYFR GYAHGDLHLR NVLVGGSIYT RNLSYWLIDV
SWDEPSPLLF DQAYLELSVL LHTLPNAGSG RVLGLLSKID NEHLTVPARL SSTDSAIVDL
IEGIRSTTIE KLEEQQPRRR DVWRQQYVLS RIAAGLNWAA KPLGDIALQR AAFIAAAWAT
RGLLRDFDDY NSLWNDLARD DLQTQTGFAP LGEPVLAAEA IERWTPFSAL DTGSDLFLIA
DRTSADEQLN SFAACQWAAV MDLNPESDET GLARAILPSL RSRRHVSMFG ENRQLTSPGE
STNWLMANGW RSRNEPTASS DAEWRRRGYR PRVRQLVDDI VVGTPNRGAA VLCLRSGDND
MLIDYILDYI DEKYDGIAAR LDLATTPGAE GLDLDVFLAA VATSLPIASI GREASIPGAD
GPFRLDRADL HRLSVDLEVL HSHVLAEGQA TIRETDAFWR GRPPTWADLE AWIDVQRDAY
PDLLGELRDR LEDRQLACVE FEHSPGAGGT TLARRVAWNL HRIHPVVLLR NYTPTTVERV
NELYQDAGRP PLVVAESADL PESDRDELLH DLQQRNSRAV VLWVNRTNAQ RNLRHQLIDP
VSGSERQRFI TEYLRRATTP KARRLLGEIA ESDSASLPAQ RLSPFYFGLC VYESQFEGVE
PYVQYHMAKL AGTHLKIAGY LALVTRYAQV GIPIDLVRRW LAESPPQAGG YGDKELRALL
GPDLRNLVVS ERHGLRLLHP LLAEHVLAGD PSRPRFGLAQ ISVELIRKTT EYLGPENQAT
RRLLEDLFVR RKGWSEGRQR PDLFSELVQD MPTDAAEWVF EELTTRCPKE PHFWNHRGRY
HIYRVRGDFG RAEGFLLRAV EESYGRNSTH LHTLGMVRRI WIENEMEELA KSRMQVRPEQ
ILEHFRPLFD SAMDAFARGR DDPNSAHSWV TPIQLIATVV EYLVKFSGAR NLVEFLEGRY
QTSNWVGQQL AQAEVLLDGL RSNFADDRRQ AKYYAELAER FDLLYGDLNA LVEQWRNLRH
AINGRAAGLG VAIARTLYAH AGRDLSRLSE DETREIVSMV EGLVESGEAT DADLRLWYQA
YRRLPEYSET RSLERFSWYA STRGSFGFKL LSVCNAFLEM VPW