Gene Franean1_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0043 
Symbol 
ID5668469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp51034 
End bp56460 
Gene Length5427 bp 
Protein Length1808 aa 
Translation table11 
GC content70% 
IMG OID641238972 
Producthypothetical protein 
Protein accessionYP_001504417 
Protein GI158311909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGCG TCATTGGCAA GGTGCACGGG CCGGACGGAT CGTCGTTCGT CGGGCACAGT 
GTCCGCCTCC GTTACACGGT CGACCTCGGC GACCGCGATG CGACGCCGCC GACGGCACTG
CCCGTGCGAC TGGAGTCGGC CGCCCCGCTC GCGGCCGACG GCGACTTCGC GCTGGCGATC
GCCGACGCGC CGCTCGTCGG GACGGCGACG GTGACCATCG TCTCGCCGGC AGGCGTGACG
ATCAACGAGG GGACCCTGTC CGTCGACGCG CTGGCCAACC CGGTGATCGT CTCGGCGACG
CCGGCGGCGC CACTGCCCGT GCTGCCTAAC CATGAGCCGG GCACCGGGCC GCGGGTGCGT
CTGACCGGGC GGGTGATCGA CGACCGGGGC CACCGCGTTT CCAGCGGCCT GCCGGTCGTG
ATCTCCGGGC GCCGGGTCGA CCAGAACGGC GACCCCGAGA TCGCCGACCG CGTCCTGCTC
GTCACGGACA CCCAGCCGGG CGGGACGTTC GGCGCGGACT GGCCGACCGA CGAGCTGGCC
ACAGCGCACG GTGTGGTCAA GGGCCTGCCG CCGGTGCCGA TCCCATTGGT CGGCGGCCGC
CTGCCGCTCA AGGTCCTCCT CGTCGTCAGC CTCCCCGACG ACGAACGCGA GGACTGCGCT
TGCGACAACA CCGCACCGCG CGCCCCCGAC CCGTCCGATC TGACGAGCAA CCCGGCCGCG
TTCTCGCAGG ACATGGGCGG CACGTGCGTC GACCTGACCA CGCCGAACCG GGTGGTCGAG
GAGTTCGCCT ATTTCTTCGT CGTGCGCACG TCGGAGCCGG AGATCCGCGG GACCACGCTC
GGCACCCGTC GGGTGCTGCC GCCGGCGGTA ACCAGGGAGC TACTCGGCGC CGTGCGCGAG
GCCCTCCCCG CGGCGGAAGG AGCAAGCACT CGCGCGATCC GTGGCCTGTC GCTGGACACC
GCCGCGCTGA AGTCGCTCGT CGCCACAGAC GCGATCCCCT CGGTCGACCA GCTGCAGGTC
GCCGCCTACC ACTCCGAGGT GGCCGCGGTG TCCCGGCTCG TCGACGCGCT GCGCCGCCGC
CAGCCCGGAC GCGTCGCCCT CGACGCCTCC TCCGCCGTCG ACTGGGACGA CACGCCGACG
ATCTACCAGG CCGCCTCGGT CGCCCACGGC CACATCCTCC AGTACCGCGA GGTGTGGCGG
GCCGACGGCT ACTCGCTCGG CGACCTGCTC TACTCGCTGC CGCTCGCGCC CGGCCAGAAG
CGCAAGCTCG CTGTGCTCGA CTGGGAGCGC CAGACCACCT CAGCGCGCAC CGAACGACTC
GACTTCGAGG AGCAGCTCGA CGCATTCGCC GAGCGCGATC GCGACATTAG CGAGATCGTC
GGCTCGCACC TGAACGAGGA GTCGAGCGGC GGCTCTCGCT CGAACACGTG GGGCGTCGCC
GGCGGCATCG GTGCGGGGTT CATCGGCTCC GGTTTCGGCA TCTTCGGCGG CGTCGCCGGT
GGCGCCGGCG GGTCCAGCGC GCGGGCGTGG CAGGACAACG CCCGCGACCT GTCGGCCAAC
AGCCTCCAGC AGCTGCGCGA CCGGATGGTG CAGCGCTCCG CTGCGGTGCG CGATGTGCGC
TCGACCACCG TGCAGACCGT CGGGCAGGGC GAGACCGTGC GCGCCGAGAC GGAGACGGTG
GCCAACTACA ACCACTGCCA CGCGATGACG GTGGAGTACT TCAACGTGCT GCGCCACTTC
CTCGTCACCC ACGAGCTCGC CGATGTGCGC GAGTGCCTGT TCGTCCCGCT GCCGATCTCC
GAGTTCGACC GGGCCAAGGC GATGCGCTGG CGCTCGGCGC TGCAGCGCTT CGTGTCCGAT
CCGACACTGC GCCGCGGCTT CGACGCGATC GAGCGGATCA GCCGCAACTG GGTCGGCTAC
GACTTCCCGG TCGACCGCTA CAGCGAGGAG GCCCCGGAGT CGCTCGAAGG TGAGCTGTAC
GTCAGTTTCC TGCTGCCCCG CCCACGCGAT GCGGCCGACG GCACGTTCCA GGTCGACATG
TGGAAGTCGT GCCAATGGCT GCTGCCGATC GACGCGCTGG AGCTGTTCAC CGCCAAGCTC
GCGGCACGCG CCCAGCGAGA ACGAGACCTC GTGTTCCGCA ACGAGATCGC CCCTGGGATC
GCCGAGCAGC TCGTCCAGCA CCTGCGCTTC GCTTACGTCA CCGCGTCCGG CGCGCAGCTC
GGCGTCCGTC TCGACCCGAC GCTCGTGTCG CGCTACGCCG AGTCGACCCC GCTGTACGTC
ACGCTGCGCC CGTCCGGCGC GGTGCCGGCG ATCCCCCGTG AGGATATCGC CCGCTTCCGC
ATCTGGTACG ACGGCCCCGC GCTGCCGTCG GCGGCGCGTG TCATCGTCCA CCGCGGCAAG
GTGCGCTACC GCACCGAGCA CCTCCAATAC CTCTTGTTCG ACGAGCCCCG CCTGCTCAAC
GACATCTCCT CGACCGACGA CATCTACGTC GCCACGCCGC TCTCGCGCGC CGAGCTGCGC
AGCCCCCGCC GTGAGGACAT CGAGGCAGCC GACCGGTTGG TCAAGCACCT CAACACCAAC
ATCGAGCGCG CCCACCAAGC GATCTGGGCG TCGATGGACG AGAACCGCCG TTACCTGTTG
CTCGACGGCG TGATCGCCCC GAACAGCGGT GGGCGCAGTG TCGCCAGCGT CGTCGAGAAC
CGGTTGATCG GCATCGTCGG CAACTGCCTG GTGATGCCGT GCGGGCCAGG GATCCAGCTC
GATCCGAGCC GAGCCGTGCC GCAGGCGCGG ACCGACGGCG AGCCCGTCGA GCCGCAGCCG
CTGATCGACC TGTACGCCAC GGCGCCGCTG CCGTCGACTC GGATCAGCCT GCCGACGCGT
GGCGTCTACG CCGAGGCGGT GAGCGGGGAG TGTAACGCCT GCGAGCGCAT CGACGATACC
CGCTTCTGGC GCTGGGAGGA GTCTGCGATC ACGGACACCC CGCCGGACAT CGCCGCCCTG
TCGACTGCCA GCCGGGCGAG CGACGAGCCA GACCTCACGC CCACACCGCT GCCGGCGGCG
ATCGTCAACA TCCAGCAGCC AGCCAGCCTG CCCGACCCGC TCGGCCTGTC GGCGGCGATG
AAGGTGCTGG CGCGCAACGA CCTGTTCAAG GACATCACCG GGTTGGAGGG CAGCCAGAAG
AACGCGCTCG CCGGGTTCAA GGCGGCACTG GGCATGGCCC AGACACTGGG TGGCGAGGCG
GCCAACCTCG CCCGACAGAA CGAGTCGGCG CGCAACGCCG ACCGGCTGAT GGACAGCATC
CAGCAAGCCC AGCGCGACAA GCTGATCTCG CCGGATCAGG CCCAGCAGTT GACCCAGTCG
GTGCTGCGCA GCGCGTCGGG TGAGGCATCG GCGAAGGACC GCCCGGCCGC GCCGGCGGAG
GACCCGGGCG TGCAGAAGGC GATCGACTCG GCCGCCGACT CGCCGCGCGC CACAGTGTCG
GTGACCACGC CCACCGAGTC GCTGGACGCG ACGTTCGACG GCGGACCGAA GCCACCGGTG
ATCGGCGGCG CGCCGGCGAT CGGCGGCACC ACGCTGACGT TCGACCTCGC CATCCCGTTC
ATCGACGACT GGACACAGCC GGCACCCGGC ACGCCGCCGG GACCGCTCAC CCGCCAGCGC
TGGACCGAAA CGCGCAAGAT CGCCAAGCTG AGCACCGCGA GCTCGGTCAC CCAGCGGACC
GGTCCGACCA GCTTCAGCCG GTTCGACCTG GCCGCCGCGG CGTTGTCCAA CGGCCTGATC
AAGCGCGAGG CTCCCGGCTC CGACACGTTC GTCGTGCCGA CCAAGATGCG CCTCGCCCAC
CCGGCGCCGA GCGCCGGCTC CAAGGTCGTG GCCGGTTCGG GCGCACTGCC GCTGGTCGTC
TTCCTCCACG GCAACCACCA GGCCTGGGAC TTCACCTTCG GACCGCCGAC GTCGACGATG
ACGGCGCTGG ACGGCGCTGG CAACCCGGTC ACCGTGGACA TTGTGGACAC GACGACCGCC
TTCACGGTGA CACCGAACCA TGAGGGCTAC GCCTACCTGC AGGACTCGCT CGCCGCCTCG
GGGTACGCCT CGATCTCGAT CGACACGAAC TTCGCCAACA CGTTCAACTC GGCGATCGAC
ACTCGCGCCC TGACGGTGCT CGCCGCCCTC GACCGGCTGC GCACCGCGGC TGAGACGAAG
GGCAACGCCT ACTTCGGCAG GTTCGACTTC CACAAGGTCG TACTGGTCGG GCACTCGCGT
GGCGGCGACG CCGTGGTCCA GGTGGCCAGG CTGAACGCCA AGCGGGTGAC GAAGAAGTAC
GGCGTGCTCG CGGTGTGCTC GCTCGCGCCG ACCGACTTCA GCGGCACGGC CGCGGCCGGC
GACGGGCCGT TCGTCCTCAC GCCCAACGAG ACCGGGTTCT TCCTGGCGAT CCAGGCCGCG
CTCGACGGCG ACGTCTCCGG CGTCGGCGGG GCCACTGCCG GGACCGGCAC GGTGTTCCGC
CACTACGCCA GGGCCACGTG CCAGAAGGCG CTCGTGCAGC TCACCAAGTG CTGCCACAAT
CGGTTCAACA CGGTGTGGTC GAAGGAGCCG ATCGACCGTG CCCACGACGA CTCCGCGCTC
GTCGACGCTG AGTTCAACGC GCTGCACACG CATGCGACGC ACCGCCAGCT GTTCACCGAG
TACCTCACGG CGCTGCTGGA GAAGCAGGTG CGCAAGAACT CGAAGGACGT CGCCCTGTTC
ACCGGTCAGC GGTCCAACAG CGCCGGCGTC GACGCCGCCC ACCAGTGGGC GTTCGGGCGA
ACCGTGACGT TGATGGACAG CTTCGAGGCA GCCGCCAGCG ATCTGGGCGG CGCACGCACC
CTGACCGCCG CCGCGGTCGG GCCGTTCCTG GACGTCAAGG TGCCGGAGAG CACGCTCAAG
CGGGGCGGCA ACGCGGGGAC GCTCGTAGCG ATCACCGAGC ACGCCACGCA CGTCACCAAG
CTCGCCAACC TCGACACCGC CGTGGTCGCG TCAGGCACGG CGTTCACCAT CGACGTGCCA
TCGGCCAAGC GCGACTGGAG TGGCTTCGAG GTCGTCAACC TCGACGTGGT CGGCACCTTC
GACCTCACCA AGGCGTCCAG CCAACCGGTG CCGGTGCTGA CCGTGACGCT CACCGACGGG
TCGGGCACCA CCAAGCAGCT CACGGCGCGC AACGTGCTCA ACCAGCCGAG CACCCACATC
GTCAACTACG ACCGTGCCGT CCCCAAGGAT CCAACCAAGG AGCAGGACGC CACCCTGTTC
CGGCTGGAGA CGCTGAGCTT CGAGCTGCCG CGGACCAGCA CGGTCGGCGG GATCGACTCG
GGCGACATCG TCTCGCTGAC CGTCGACGTC GACCTGTCCG TCGGCAACAC CGTCCTGATC
GATTCGATCA CGCTCGTCGC GCTCTGA
 
Protein sequence
MESVIGKVHG PDGSSFVGHS VRLRYTVDLG DRDATPPTAL PVRLESAAPL AADGDFALAI 
ADAPLVGTAT VTIVSPAGVT INEGTLSVDA LANPVIVSAT PAAPLPVLPN HEPGTGPRVR
LTGRVIDDRG HRVSSGLPVV ISGRRVDQNG DPEIADRVLL VTDTQPGGTF GADWPTDELA
TAHGVVKGLP PVPIPLVGGR LPLKVLLVVS LPDDEREDCA CDNTAPRAPD PSDLTSNPAA
FSQDMGGTCV DLTTPNRVVE EFAYFFVVRT SEPEIRGTTL GTRRVLPPAV TRELLGAVRE
ALPAAEGAST RAIRGLSLDT AALKSLVATD AIPSVDQLQV AAYHSEVAAV SRLVDALRRR
QPGRVALDAS SAVDWDDTPT IYQAASVAHG HILQYREVWR ADGYSLGDLL YSLPLAPGQK
RKLAVLDWER QTTSARTERL DFEEQLDAFA ERDRDISEIV GSHLNEESSG GSRSNTWGVA
GGIGAGFIGS GFGIFGGVAG GAGGSSARAW QDNARDLSAN SLQQLRDRMV QRSAAVRDVR
STTVQTVGQG ETVRAETETV ANYNHCHAMT VEYFNVLRHF LVTHELADVR ECLFVPLPIS
EFDRAKAMRW RSALQRFVSD PTLRRGFDAI ERISRNWVGY DFPVDRYSEE APESLEGELY
VSFLLPRPRD AADGTFQVDM WKSCQWLLPI DALELFTAKL AARAQRERDL VFRNEIAPGI
AEQLVQHLRF AYVTASGAQL GVRLDPTLVS RYAESTPLYV TLRPSGAVPA IPREDIARFR
IWYDGPALPS AARVIVHRGK VRYRTEHLQY LLFDEPRLLN DISSTDDIYV ATPLSRAELR
SPRREDIEAA DRLVKHLNTN IERAHQAIWA SMDENRRYLL LDGVIAPNSG GRSVASVVEN
RLIGIVGNCL VMPCGPGIQL DPSRAVPQAR TDGEPVEPQP LIDLYATAPL PSTRISLPTR
GVYAEAVSGE CNACERIDDT RFWRWEESAI TDTPPDIAAL STASRASDEP DLTPTPLPAA
IVNIQQPASL PDPLGLSAAM KVLARNDLFK DITGLEGSQK NALAGFKAAL GMAQTLGGEA
ANLARQNESA RNADRLMDSI QQAQRDKLIS PDQAQQLTQS VLRSASGEAS AKDRPAAPAE
DPGVQKAIDS AADSPRATVS VTTPTESLDA TFDGGPKPPV IGGAPAIGGT TLTFDLAIPF
IDDWTQPAPG TPPGPLTRQR WTETRKIAKL STASSVTQRT GPTSFSRFDL AAAALSNGLI
KREAPGSDTF VVPTKMRLAH PAPSAGSKVV AGSGALPLVV FLHGNHQAWD FTFGPPTSTM
TALDGAGNPV TVDIVDTTTA FTVTPNHEGY AYLQDSLAAS GYASISIDTN FANTFNSAID
TRALTVLAAL DRLRTAAETK GNAYFGRFDF HKVVLVGHSR GGDAVVQVAR LNAKRVTKKY
GVLAVCSLAP TDFSGTAAAG DGPFVLTPNE TGFFLAIQAA LDGDVSGVGG ATAGTGTVFR
HYARATCQKA LVQLTKCCHN RFNTVWSKEP IDRAHDDSAL VDAEFNALHT HATHRQLFTE
YLTALLEKQV RKNSKDVALF TGQRSNSAGV DAAHQWAFGR TVTLMDSFEA AASDLGGART
LTAAAVGPFL DVKVPESTLK RGGNAGTLVA ITEHATHVTK LANLDTAVVA SGTAFTIDVP
SAKRDWSGFE VVNLDVVGTF DLTKASSQPV PVLTVTLTDG SGTTKQLTAR NVLNQPSTHI
VNYDRAVPKD PTKEQDATLF RLETLSFELP RTSTVGGIDS GDIVSLTVDV DLSVGNTVLI
DSITLVAL