Gene Franean1_6755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6755 
Symbol 
ID5675068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8215996 
End bp8218743 
Gene Length2748 bp 
Protein Length915 aa 
Translation table11 
GC content72% 
IMG OID641245604 
Productlantibiotic dehydratase domain-containing protein 
Protein accessionYP_001510995 
Protein GI158318487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAAG CTGCCCACGC GGCGTTGATC AGGGCTGCTT CCTACCCCCG AGATCTGACG 
CTGCCCACGT GGCCCGATCT GACCGCGAAC CAGGCCGACG AGTGGCTGGA GTGGCTGCGG
GAGGTATGGG CGTTGCCGGA GTTCGCGGCG GCTGTCGGGC AGGCAGCCCC TGACCTGGCC
GATCAGATCA CCCACGTGCT TGCCCAGGAG TCGATGCCAG CGCGCAGGGT GCGGCGTCTG
GTGGAGACGA CCGTGCGGTA TCTGCTGCGG TGGACGACCC GGCCCACGCC GTTCGGCAGG
TTCGCCGGGG TGGCTCCCCT CGCGTTCGGC CCCCGCGCGG CGGTCTGGTG GGGCGACCAG
CATCACGAGG TGGTCCGACT GGACGACCGC TCCGTCGCCG AATACACGGC GGTGACGGAG
CGGGACCTGG CGGTGCTGCG CGGGGTCACG GTCATGACGA ACACGCTGGG GTATCGGCGT
GGCGGGGTGT GGGTGCTGCC CTGCGCCCGC GTCGAAGGTG ACCGGGTGTG GGATGTCGAG
ATCAACCTGA CTGCCCCGGT GCTGGTGGCG GTGGAGAAGG CCCGTGCCCC GATCCCGTTC
CGGGAGCTGG CCGCGACGGT CGCCGAGGAC CAGGCCATCG GAACCGCGAA GGCGGAGCGA
CTACTCGGCG CCCTGGTGGG CGCGGGGGTG TTGCTGTCGG CGGTCCGGCC GCCGATGACC
GTGACCGACC CGGCCGCGCA CCTGGCCCGC CACATCGCTC TCCCCAACCC AGGCGAACGG
AGCGCGGTCG ATCTGCGGGT TGACTGCTCG GTGACGCTGC CGCCCGCGGT GGTCCGCGAG
GCGCAGGAAG CCGCCGCGGC GCTCGTCGCA GTTGCGCCAC GCCTGCCTGG CTGGGCCGCC
TATCATTCCG CGTTCAGTGA GCGGTGGGGG CCGGGCGCGG CTGTGCCGCT GCGGGAGGTC
GTGGGCATTC TCGGGTTCCC GGCCGGTTAC CGGGGCTCGC TGCGCCGTGA TGCGGCGACG
TTCACTGCCC GGGATGCTCT GCTCGCGACG CTCGCCCAGC GCTCTGCCCT GGACGGATGC
GCCGAAGTCC TCCTGGACGA CGATCTAATC GGGCAGCTTC GCAGCGAGGA CGACCGGCCG
CCGATCCCGC ACACCGAACT GCGGTTCACT CTCGCCGCAG GAACGCTTCA GGACCTCGAC
CGCGGCGCGT TCACCCTGAC GGTCGTCAGT GGAGCCCGCC ACGCCGGCGT GGCAGGTGGC
CGCTTCCTGC ACCTGCTCAC CCCCACCGAG CTGGACCAGT TCCGCAGCAT CTACACCAGC
CTGCCGACTG CCTTACCCGG CGCGGACGCC GTACAGCTGT CCGGGCCTCC GCTCGACCCC
AGGCTGGCCA CCGTCGCCCG CACGCCCGAG CTCCTACCGG TGCTGCCCGT CGGCGACCTC
CATGCCGACC CGGTGTGCAC GGTGGACGAC CTGGCGGTGG CCGCCGATGG GCAGCGGCTC
TGGTTGGTGT CGCGCCTAAC TGGTCGACCG GTCGAGCCGC TGCTGTTCAA CTGCGTGCTC
CTGGCTACCC ACCAGCAGCC ACTCGTTCGG TTCCTCACCG AGATCTGGAC GGCCTGGACG
GCGCCGTGCG CCCGGTTCGA CTGGGGACAC GCCCGCACAT TGCCGTTCCT CCCGCGGGTC
CGCCGGGGCC GTTCGATCCT GCACCCGGCC CGCTGGACCA TCCCCGCCGA GGCGTTACCC
GCCCGCACCG CGACGTGGCC GCAGTGGCGG GCCGCCTGGC ACCAGCACCA CGAACGCCGC
CAGCTGCCAC AGGAGGTGCT GATCGGCGGC GACGACGTAC GGCTACGCCT CGACCTGGAC
GAGAACGCCC ATCTCGCGGT CCTGCGTAGC CACCTCGACC GGCACGGACG CGCTGTCCTC
ACCGAGACGG ACGGGCCCTC AGGGTGGATC GATGGCAGAC CCGCCGAACT CCTGCTCACC
CTCACCCGCA CCCCACCAGC CCACCGCCTG GCCGCCCGCC GGGCCCGTCC TGTCAGCGTT
CCTGCCCACC GGCCCGGCCG GTCGCGCTGG TTGGACGCCC GCCTGGTCGG ACAAGCCGAT
CACGTCCTCG CCCGCCTGTC CGAGTTTCCT GGTCTGCCCG CAGGTTGGTG GTTCCTGCGC
TACCCGCACC CCGAACCCCA CCTGCGGCTG CGCATCCCGC TGCGGGGCAT TGCCCAGTTC
GCCGACGTCG CCCGCGGCCT CGCCGGCTGG GCGGAGCAGC TGCACGACGA CGGACTGCTG
GCCGACTACA CCCTGGCCAC CTACCGGCCA GAGACCCGCT GGGGCTGCGG GCAGACCCTC
GCCGCGGCCG AGGCAGTGTT CGCCGCCGAC TCCCGCGCCG CCCTCACCTG GACGTCCGGC
GACCGCCAGG CTGGCACCGC AGCAGGGATG ATCGCTATCG CCGGCGGGTT CACCGATGAC
GGAGCGCGTT GGCTCGTCGA GCACGCACCC CACGGCGGCG GGTCACGCCT GGAGCCCGCC
CAGATCGCCG GGGCCCGCCT GACCTACGGA GACGAGGCTC TGACCGCGAC GCTGGCCACC
TACCGGACCC TCGCCGCCCG AGACGGTCTC GACCTGGACC AGGTGCTGGC CGACCTGCTG
CACCTGCACC ACGCCCGCAT GATCGGCCCC GATCTGGCGT CCGAACGGCA CTGTCTGCGC
CTGGCCCGCG CCCTCGCCCA GACCACCCTG GCCAGGAGGC CATCGTGA
 
Protein sequence
MYQAAHAALI RAASYPRDLT LPTWPDLTAN QADEWLEWLR EVWALPEFAA AVGQAAPDLA 
DQITHVLAQE SMPARRVRRL VETTVRYLLR WTTRPTPFGR FAGVAPLAFG PRAAVWWGDQ
HHEVVRLDDR SVAEYTAVTE RDLAVLRGVT VMTNTLGYRR GGVWVLPCAR VEGDRVWDVE
INLTAPVLVA VEKARAPIPF RELAATVAED QAIGTAKAER LLGALVGAGV LLSAVRPPMT
VTDPAAHLAR HIALPNPGER SAVDLRVDCS VTLPPAVVRE AQEAAAALVA VAPRLPGWAA
YHSAFSERWG PGAAVPLREV VGILGFPAGY RGSLRRDAAT FTARDALLAT LAQRSALDGC
AEVLLDDDLI GQLRSEDDRP PIPHTELRFT LAAGTLQDLD RGAFTLTVVS GARHAGVAGG
RFLHLLTPTE LDQFRSIYTS LPTALPGADA VQLSGPPLDP RLATVARTPE LLPVLPVGDL
HADPVCTVDD LAVAADGQRL WLVSRLTGRP VEPLLFNCVL LATHQQPLVR FLTEIWTAWT
APCARFDWGH ARTLPFLPRV RRGRSILHPA RWTIPAEALP ARTATWPQWR AAWHQHHERR
QLPQEVLIGG DDVRLRLDLD ENAHLAVLRS HLDRHGRAVL TETDGPSGWI DGRPAELLLT
LTRTPPAHRL AARRARPVSV PAHRPGRSRW LDARLVGQAD HVLARLSEFP GLPAGWWFLR
YPHPEPHLRL RIPLRGIAQF ADVARGLAGW AEQLHDDGLL ADYTLATYRP ETRWGCGQTL
AAAEAVFAAD SRAALTWTSG DRQAGTAAGM IAIAGGFTDD GARWLVEHAP HGGGSRLEPA
QIAGARLTYG DEALTATLAT YRTLAARDGL DLDQVLADLL HLHHARMIGP DLASERHCLR
LARALAQTTL ARRPS