Gene Franean1_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1489 
Symbol 
ID5669893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1783019 
End bp1786075 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content72% 
IMG OID641240409 
Producthypothetical protein 
Protein accessionYP_001505835 
Protein GI158313327 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.443999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACAG CGGGCGGCGC GCGGCGGCCT GACGGCGACC GGCCGGCCGG CGCGACGATC 
GTGACGCGAG CCGACGTCAT GCTGCCGGGC GCGCTGGCGG TCGTCCCGTC GGCCAAGGCG
ACTGAGCAGC TTCAGTGGGC GCCGACGGCA CTGGTGACGG AATTCAGCGA ACTGGCGGCC
CGAGGGGCCC TACCGGCCCG GATGGACCTC GATCGCGGCC AGCTCATCCT CTACACCAGG
CGGTATGTCG CCTTCCTCTA TCCGACGAAG TTCGGGGACG GGTACAGCCT CGGCCACGTC
TCCCGCCGGT CGTTCCGTGA CGAGGACAAG CTCAGCCGGG GAGCGCTGAT ACTGCACTGC
CGCGCTCCCT GGGGGGCGTT CAACGACGTG CGGGACATAC CTCGTCCGCA GGGCACGTTG
AGCCCTCGAT GGGCGTCCCA CTGGGACCAG CTGAGCCTCG AGTGGACCTC CCTGGGCCGG
GTTGTGGACG GCGCTTCGGA GCTGCCGCCG CACCACCGTG ACTACCTCGC ACTGCTCGAC
CACGTCGTCG AGGCGAGCCG GGACATCGAG CTCGCGAAAC TGAGCGGCAT CCCCGAAGCG
CCGTACCGGG AGGTGGAACC GACCCGCGAA GAACGCTACT CAGGCCGCGG CGTCTACTCG
TTCTCCCTGG CCCGGCCGCC CCGTGGGCTG CCCGTCGGTG CCCTGGTCGA GGTGGCGGAG
TGCGGCGACC CGCCGATGCG GGGCAAGGTC GTCCGCAGCG GCGGCCGGAC CCTCGTCGTC
CGCTTCGAGC GGACGGTCGA CTTCCAGCGG ATTCCGCCCC AGGGCCGGCT CAGGGCGTCG
GTCAGCGATC GAGTCTTCCA GGCTCAACGG GACGCGATCG AGACCCTGGC CAAGGGCGAG
GCAGCCAACC CGGCGTTGCT GTCGGTGCTC GTGGACGGCC GCTACGCGCG GTACCAGCCG
GATCTCTCCC GTCAGCCGGT ACGGCAGCTC GACGAACACC AGGCGGACGC GTTCCGGCGG
GCGCTGGCGG TGCCGGACGT CCTGCTTGTC CTCGGCCCGC CAGGCACCGG CAAGACGACC
ACGATCGTCG AGATCGTCAC CGCGTTGGTG GCCCTCGGCC AGCGGGTCCT GGTCACCTCG
CACACGAACC GGGCGGTCGA CAACGTCGTC GAGAACCTGC CACCCCAGAT CAACAGCGTC
CGGGTCGGCG CGGAGGACTC GATGACCTCA GCCGCCCGCC GCCGCGGCCT CGACACGCTG
ACCGAACAGG TCCGGGCCGG AATCCGGGAC GCGGCTGAAC CGCGCGGTCA GCTGACCGAG
TTCCGGCAGA GCCGCGCGGT CTTCGACCAG TGGCTCGGCC ACCTGAGGAA CAGTCTGGCG
GCGGTGGACG CCGCGGAGGC CGCGCTGGTG TCTGTCGAGC AGGCGGTCGA CCACGCCGTA
CAGGCACTCT CGCCGCAGCT CGCGGCCGAC CGCCGGTCGG CCGCGGACCG GCGGACCCGG
CAGCTCGGCT CGGTGATCCG GCTGCGGGAG GAGCTGGCCC GGCGGCGGGA GGTGCTCGCC
CGCCGCGCGA AGACCACCCC GTCGGACAGC CCCGAGGAGT TCTGGCGGCT GGTCACCCGG
TTCCGGCGCT GGTTGCTGGC CCGGGCCCGG CGGCGGGAGC AGGGCGCACG GGAGGCGCTC
GTGACGGCGC AGCAGGATCT GACCGAGGCC GAGGCGGCGG TGGAGGCGTT GTCGGCGCAG
GCCGTCGCGC TGATCGCCGG CGACCCGGAG CACCGTCCGC TGCTCGCCGA GCGGGAACGT
CAGATTGCTG CGCGTGACGG TGGCCACGGC GGCCTCCGGC GGACGGCCGA GGTCGTGCGG
GGTGGCCTGC GCGAGGTGCT GCCGGCCCTG CCGACGTTGG CGGCACTGGA CGACGAGCCC
GCTGACAGCG CCGGGTGGGT GCGGTTCCAC GACTGGGCGG TCGAGTCGTT CGCGACCGTC
GGCCAGCGCG CGGACCTGCT CGGGCATTGG CACGACCAGC TGGCCAGCGC GGAGACCGAG
CTCCAGCGCG AACTCGTCCG GTACGCCGAC GTGGTGGCCG CCACCTGCAT CGGCACCGCC
ACCAGCAAGG TGCTCAGCGA CACCGTTTTC GACGTGGCCC TGATCGACGA GGCGGGCCAG
ATCTCCACGC CGAATTTGTT GGTGCCGATG GTTCGGGCTC GCCGCGCGGT GCTGGTCGGT
GACCAGCACC AGCTCCCGCC CTTCCTGGAC GAGGAGGTCC GCGGCTGGGC CGCCGGTCTG
AAGAAGGACG GCCGGTACAC ACCGGAGGCG GTCACGAGGG TCAACGGGCT GCTGCGCGCG
AGCGGCTTCG AGCTGCTGGT GCCCGGGGCC GCGGCGACCC GGGCGAACTA CGTCGAGCTG
AACCTCCAGC GCCGGATGCC CGAACAGGTC GCGCGATTCG TTTCCGACGC GTTCTACGCG
GGCCGGCTGG GAACCCGGCA CGGCGGTGGG CGAGACGACC CGTTGTTCCG GTCCGCGTTC
GCGATGATCG ACACCTCCGA CCGTCCCGCC GGCGAACGGG GGGAGACGGC GCTGAAGGCC
ACCGAGACAC GGCACGAGCA GGGGTATGTC AACGCGCTCG AAGTGGAGCT GATCGTCTCG
CTCCTCGCGC GGGCCGCCGG CTGGTACCGG GACTGGGCCG TGATCGTCCC CTACAAGGCG
CAGGCGAGGC GGATCAGCGA GGCACTCGCG GGGGAGTTCG GAGACCCGAC CACGGTCGCC
GACAACGTCG GCACCGTCGA CTCGTTCCAG GGTGGCGAAC GGGATCTGAT CATCTACGGT
TTCACCCGCA GCAACAGCCG GAACTCCGTG GGCTTCCTCA CCGAGCTGCG GCGGCTCAAC
GTGGCGGTGT CCCGGGCCAA GCAGCAGCTC GTGATGGTCG GCGACCTCGC CACGCTGAGA
TCGGCCACGG ACACCGAGTT CCGGAAGAAG ATCAGCATGA TGGAGGACCA CCTGCGCCGG
GACGGCGACC TGAGGCTCTC CTGCGAAATC GAGAAGCTGC TGCGGGACCC GCGGTGA
 
Protein sequence
MVTAGGARRP DGDRPAGATI VTRADVMLPG ALAVVPSAKA TEQLQWAPTA LVTEFSELAA 
RGALPARMDL DRGQLILYTR RYVAFLYPTK FGDGYSLGHV SRRSFRDEDK LSRGALILHC
RAPWGAFNDV RDIPRPQGTL SPRWASHWDQ LSLEWTSLGR VVDGASELPP HHRDYLALLD
HVVEASRDIE LAKLSGIPEA PYREVEPTRE ERYSGRGVYS FSLARPPRGL PVGALVEVAE
CGDPPMRGKV VRSGGRTLVV RFERTVDFQR IPPQGRLRAS VSDRVFQAQR DAIETLAKGE
AANPALLSVL VDGRYARYQP DLSRQPVRQL DEHQADAFRR ALAVPDVLLV LGPPGTGKTT
TIVEIVTALV ALGQRVLVTS HTNRAVDNVV ENLPPQINSV RVGAEDSMTS AARRRGLDTL
TEQVRAGIRD AAEPRGQLTE FRQSRAVFDQ WLGHLRNSLA AVDAAEAALV SVEQAVDHAV
QALSPQLAAD RRSAADRRTR QLGSVIRLRE ELARRREVLA RRAKTTPSDS PEEFWRLVTR
FRRWLLARAR RREQGAREAL VTAQQDLTEA EAAVEALSAQ AVALIAGDPE HRPLLAERER
QIAARDGGHG GLRRTAEVVR GGLREVLPAL PTLAALDDEP ADSAGWVRFH DWAVESFATV
GQRADLLGHW HDQLASAETE LQRELVRYAD VVAATCIGTA TSKVLSDTVF DVALIDEAGQ
ISTPNLLVPM VRARRAVLVG DQHQLPPFLD EEVRGWAAGL KKDGRYTPEA VTRVNGLLRA
SGFELLVPGA AATRANYVEL NLQRRMPEQV ARFVSDAFYA GRLGTRHGGG RDDPLFRSAF
AMIDTSDRPA GERGETALKA TETRHEQGYV NALEVELIVS LLARAAGWYR DWAVIVPYKA
QARRISEALA GEFGDPTTVA DNVGTVDSFQ GGERDLIIYG FTRSNSRNSV GFLTELRRLN
VAVSRAKQQL VMVGDLATLR SATDTEFRKK ISMMEDHLRR DGDLRLSCEI EKLLRDPR