Gene Franean1_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1786 
Symbol 
ID5670188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2146808 
End bp2151091 
Gene Length4284 bp 
Protein Length1427 aa 
Translation table11 
GC content78% 
IMG OID641240707 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001506130 
Protein GI158313622 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.441157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGATG TGTCTGTGCA GGAGCGGCTG CCGTCCGAAC CGCCCGGCAG CCTCGCTGTG 
CCCGCCGCGC CGGACGTGGT GGCGGGGGGT CCGGTCGAGC TGCGGGGCGA GCCCGGGATG
TGGCCCGCGG GGGTGGTCGC CCCGTGGGCG GGAGATCCGC GCGGCCCGGA CCGCCGCCGG
GTGATCTTCC TCAGCCACAC CGCCGAGCTG CGGCGTTTCC CGCGGGACCG GTCGTTCGTG
GCCGCCGCCG AGCGCGCCGT CGCGCTGGCC GGTGACCGCG TCGTCGACAT GGAGTACTTC
GGCGCGCGCG CCGACAACCC GGCCGAGTTC TGCGTCCGGC AGGTCCAGGA GAGCGACGTC
TACATCGGTC TCATCGGGTT CCGCTACGGG TCGCCCGTCC GGGAGCTGCC GCACCTTTCC
TACACCGAGC TCGAGTTCCA GGCGGCCACC GAGGCGGGCA TGCCCCGCCT GGTGTTCCTC
CTCGAGGACG ACGCGGAGGT CCCCTTCCGC GAGTTCGTCG ACCCCGCCTA CGGCGAGCGG
CAGGAGGCCT TCCGGGCGCG GCTGCGGGAC GGCGGCACCA TCGTCGCCGG GTTCCGGGAC
GGCCGGCTGG AGACCGCCGT GCTCGACGCC CTGGTCAAGC TGCGGGACAG CGAGCGGCGG
GCGACCCGCA CGCTGCCCGG GTTCGGCGAC GTCCCCCCGG GCCCCCCGGG CCCGCGGCCC
GTCCTCGTCC CGGGTCTGGG CCCTGGCGGT GGATGGCTGC GCCGCCCCTG GATGGTGCCG
GCCGTCCGGG GGCTGGTGGA GCGCCCGGAG CTCACCGAGG CTGTGCTGCG CCGGCTGCTG
CCGCCCGCGA GCGCCCCGGA CGCCCCGGAC GCCCCGGCCC CCTCGGACCC CCCGATCGGC
GCCGAGGTCG TCCCGCGCCC CGTCGTGCTC GCGGGCGCCG GCGGCTTCGG GAAGACGACG
CTGGCCGCGG CCGTGTGCGG CAGCACGCGG ATCGCCCGCC GGTTCGGCGG CGGGGTGCTG
TGGGTGACCC TCGGCGAGTC GCTGGCCGGG GCGCACCTGG CCGACCGGAT CAACGACCTG
AGCGAGGCGC TGTCCGGGGT CCGCCCGACG CTGTCGGACC CCGAGCAGGC CGGGTTCCGC
CTCGGTGAGC TGCTCGGCGC CGAGCCTCGG CTGCTCGTCC TGGACGACGT GTGGCGGCGC
GCCCAGCTGC GCCCGTTCCT GCAGGGCGGG CCGGGCTGCG TCCGCCTGAT CACCACACGG
ATGCGCGGCC TGCCGCCGGA CGCCGACGTC GTCCAGGTGC CCGCGATGGC CGACGGCGAG
GCGGTCAGCC TGCTGACCAG GGATGTCCCC GCGCCGCTGC CCGACGCCGT CCTGCGCCGG
CTGCTGGTGG TCACCGGCCG CTGGCCGGTG CTGCTCGCGC TGGTCAACCG GGCGGTCGTC
CGCCAGACTC GGGACGGCAT GTCGGTGCCC CGCGCGGCCG AGCGGGTGCT GCGCCGGCTG
GAGCGGCGCG GCCCGACCGC ACTGGACGTC AGCCGGGTCG AGGAACGGAC GCTCGCCGTC
GAGGCGACGC TGTCGGCGAG CCTGGGCCTG CTCACCGGCG ACCGGCTCGA CCAGTATCTC
GAGCTGGCCG TCTTCGGCGA GGACGTGGAG ATCCCGCGGG ACGTCCTGGA GGCCTACTGG
GCGGCGACCG GCGATCTCGA CCCCGACGAG GTCGACGACC TGTGCCAGGA GTTCGCCGAC
CTTTCCCTCG TTGTGGCCTA CCGGCGGGAC CCGCCGTCGC TGGTGCTGCA GGACGTCCTG
CGGACCTACC TGCGGGCCCG GGTCGGGGCG GAGCGCCTGC GCGAGCTGGA CGGCGTCCTG
TGCGACGCGC TCGCCGGCAT GATCACCGGC GGCGCTGGCG GTGCTGACGA CCCGGCCGGT
GCTGACGCCC CGGCCGGCCG GGGCGGCCGC GGCGGCCCGG CCGGCTCCGA GCCGGCCGGC
GGGTCCGAGC CGGCCGGCGG GGTGCGTGCC CCGTGGTGGA CGGCGCCGGA GCGGGCCGGC
TACCTCTGGG AGCATCTCGC CCGGCACCTG GCCGGCGCCG GCCGGCACGA CGAGCTCGCC
GTGCTGCTCG GGGACCTGCG CTGGACGGTC GGGAAGCTCT CCGTCGCCCG GCTCGGGCCG
GTGGCGGTCG AGGCCGACCT GGCCGTCGCC GGCAGGGTGC GGCCGGACGA CCCGGTGCTG
CCCGCGCTGA GCCGGGCGCT CGGGCAGAAC GCGCACCTGC TCGGCCCCAC CGAGCCCGCA
GAGGCGCTGG GCACGACCCT GCTGAGCCGG CTCGACGGCA TCCGCGCGCT CGAACCGGCC
CGCGCCGCGT TGGCCCGGCG GCTGAGCGGG CCACGGCTGG TCAACCGGTG GACGCTGCCC
GACCAGCCGC ACCCCGCGCT GCGCCGGGTG CTGGCCGGCC ACCACCGCCA GGTCCTCGCG
CTGGCCGTCG CCCCGGACGG GTCGTGGCTG GCCTCCGCCG GCATGGACGG CACCGTCCGC
ACCTGGACGG TCGGCGCGGG CACGGCGCGC TCGGTGCTCA CCGGCCACAT CGGCCAGGTG
CTCGGCGTCG CGGCCGCGCC CGGCAGCGGC TGGCTCGTCT CGGCCGGCGA GGACGGCACC
GCCCGGATCT GGGACGTCCC CGGCGACGAT GTCCGTGGTG ACGATGTCCG TGGTGACGAT
GTCCGCGGCG ACCTGGACGA CCCCGAGCCG GGTGATCCCG GCGACACCGG GGAGCGGCGG
GGCCGGGATC CCGAGGGGGT CGACCCGGTG GCGCGGCTTG TCCTGCGCGG GCACGACGGC
CCGGTGAACG GCTGCGCGGT GACGGCCGAC GGCACCGGTG TGATCACCGT CGGCGACGAC
GGCTCGTTGC GGACCTGGGA CGCCACCACC GGCACGCCGA GGCTCGCGGT ACCGGTCACC
GGCGGGCGGC TGCGCTGCTG CGCCACCGGG CCAGGCGGAG CCGTCGTCGC GACCGGCGGC
GAGGACGGGA CCATCCGGCT GCACGACCCG CTGACCGGCG AGATCCTCCG GCGGCTGGCC
GGTCACGCCG GTCCGGTGCT TGCGCTGGCG TTCGGCCCGG ACGGCTCCTG GCTGGTCTCG
GCGGGCGAGG ACGGCACGCT GCGTCGCTGG GACACCGCCG CCGGCCGCCA GACCGGGGTG
CTCAGCGACG GCAGCCGCCC CGTACGGGCC TGCGCGGTCG CCCCGGACGG CTCCTACCTG
GTGGCGCCGG CCGGGGACGC GATCTCCGTC CGGGATCTCC CCACCGGCGG CCAGCGGGCC
GAGCTCACCG GGGCCACCGG GACGCGGGCC TGCGTCGTCG CCCCGGACGG CTCCTGGATC
GCCTCCGCCG GGCGCTACGG CACGATCCGG GTCTGGAGCA CCGGCAGCGA CCTGCCGCGC
CCGTCGACCG CGGGGCGCAA CGAGGGGGCC CGCGGCTGCG CCGTCGTGGC CGGCGGCCTG
GTGGTCTCCT CCAGCGACGA CGGCACCGTC ACGGCCTGGG ACCCGGTCAC CGGCGAGCCG
GGCGCGGCGA TGGCGGGCCT GCCCGGCCCG GCCCGCGGCT GCCGGGCCGG CCCCGGCGGG
CGCTGGGTCG TGGTGTCGGC GCAGCCGACC GCGCTGCGGC TGTGGGAGCC CGCCACCGGC
GTCGTCCGCG CCGTGCTGAC CGCCGACGTG GCGATCCTCG GCTTCACCGT CTCCGCGGAC
GGCTCCTGGG TGGCCGGCGG CTGCGAGGAC GGCTCGGTGC GGCTGTGGGA CACCGAGTCC
GGGGAGTGGA TGGCCACCTT CGCCGGCCAC ACCGAGGGGG TGCAGGCCTG CGTCGCCGGT
CCGGACGGCA CCTGGCTCGC CTCCGGCGGG GACGACGCCA CCGTGCGGAT CTGGGACGTG
GCGACCCTCG AGCAGCGCGC CTCGCTGCCC GGCCACACCG ACCCGGTGCT CGGGCTGACC
ACCGACCCGG CGGGCCGCGT CCTGGTCTCG ACCGGCGCGG ACCACACCGT GCGGGTCTGG
GAAGTCGCGA CCGGGCGGGC GCTGGCTGTC CTGCACGGTC ATGCGCACAC GGTGCGGGAG
GCGAGCTTCT CCCCGGACGG CGCCTGGCTC GCGACGGTCG GCGGGGACGG GTCGGTGCGG
GTCTGGGACC CGCTGATCTG GCAGTGCCGC ACGATGATCC GGTTCGAAGG CGCGGCCCGC
GGCTGCTGCT GGCTGCCGGA CTCCACGGGC CTGGCCGTCG CCGGCTCGGC CGGCCTGTAC
CTCTACAGCT TCGTGCCGGA CTGA
 
Protein sequence
MPDVSVQERL PSEPPGSLAV PAAPDVVAGG PVELRGEPGM WPAGVVAPWA GDPRGPDRRR 
VIFLSHTAEL RRFPRDRSFV AAAERAVALA GDRVVDMEYF GARADNPAEF CVRQVQESDV
YIGLIGFRYG SPVRELPHLS YTELEFQAAT EAGMPRLVFL LEDDAEVPFR EFVDPAYGER
QEAFRARLRD GGTIVAGFRD GRLETAVLDA LVKLRDSERR ATRTLPGFGD VPPGPPGPRP
VLVPGLGPGG GWLRRPWMVP AVRGLVERPE LTEAVLRRLL PPASAPDAPD APAPSDPPIG
AEVVPRPVVL AGAGGFGKTT LAAAVCGSTR IARRFGGGVL WVTLGESLAG AHLADRINDL
SEALSGVRPT LSDPEQAGFR LGELLGAEPR LLVLDDVWRR AQLRPFLQGG PGCVRLITTR
MRGLPPDADV VQVPAMADGE AVSLLTRDVP APLPDAVLRR LLVVTGRWPV LLALVNRAVV
RQTRDGMSVP RAAERVLRRL ERRGPTALDV SRVEERTLAV EATLSASLGL LTGDRLDQYL
ELAVFGEDVE IPRDVLEAYW AATGDLDPDE VDDLCQEFAD LSLVVAYRRD PPSLVLQDVL
RTYLRARVGA ERLRELDGVL CDALAGMITG GAGGADDPAG ADAPAGRGGR GGPAGSEPAG
GSEPAGGVRA PWWTAPERAG YLWEHLARHL AGAGRHDELA VLLGDLRWTV GKLSVARLGP
VAVEADLAVA GRVRPDDPVL PALSRALGQN AHLLGPTEPA EALGTTLLSR LDGIRALEPA
RAALARRLSG PRLVNRWTLP DQPHPALRRV LAGHHRQVLA LAVAPDGSWL ASAGMDGTVR
TWTVGAGTAR SVLTGHIGQV LGVAAAPGSG WLVSAGEDGT ARIWDVPGDD VRGDDVRGDD
VRGDLDDPEP GDPGDTGERR GRDPEGVDPV ARLVLRGHDG PVNGCAVTAD GTGVITVGDD
GSLRTWDATT GTPRLAVPVT GGRLRCCATG PGGAVVATGG EDGTIRLHDP LTGEILRRLA
GHAGPVLALA FGPDGSWLVS AGEDGTLRRW DTAAGRQTGV LSDGSRPVRA CAVAPDGSYL
VAPAGDAISV RDLPTGGQRA ELTGATGTRA CVVAPDGSWI ASAGRYGTIR VWSTGSDLPR
PSTAGRNEGA RGCAVVAGGL VVSSSDDGTV TAWDPVTGEP GAAMAGLPGP ARGCRAGPGG
RWVVVSAQPT ALRLWEPATG VVRAVLTADV AILGFTVSAD GSWVAGGCED GSVRLWDTES
GEWMATFAGH TEGVQACVAG PDGTWLASGG DDATVRIWDV ATLEQRASLP GHTDPVLGLT
TDPAGRVLVS TGADHTVRVW EVATGRALAV LHGHAHTVRE ASFSPDGAWL ATVGGDGSVR
VWDPLIWQCR TMIRFEGAAR GCCWLPDSTG LAVAGSAGLY LYSFVPD