Gene Franean1_5420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5420 
Symbol 
ID5673751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6545309 
End bp6548449 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content73% 
IMG OID641244275 
Producthelicase domain-containing protein 
Protein accessionYP_001509681 
Protein GI158317173 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.795016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTACGT TCGCGCCGGG TTCGATCGTG GTGGTCCGGG ACGAAGAATG GCTGGTCACG 
GGGGCCGAGC AGGGCACGGA TGGCTGGCGG TTGGACGTGG TCGGCCTCGG TGAGCTCGTG
CGGGAGACCA CGGCGACCTT CTTCAGCGGG CTCGACCACA TCGAGTTGCT GGACCCCCGC
GAGGCAGAAC TGCTCCCGGA CGGCTCCCCG CGGCACCGGC GCACCCGGCT GTGGCTGGAG
GCGACACTAC GCAGGACGCC GATGCCGGCC GGCGAGACCT CGCTGACCGT CTCCGACGGG
ATGCTGGCGA CGCACCTCGA CTACCAGCGG CGGGCGGTCG CGCACGCGCT GTCCCCGACG
AACCTGCGGC CCCGTGTCCT CATCGCGGAT GCCGTCGGCC TCGGCAAGAC CCTGGAGATC
GGGATGCTGC TCGCCGAGCT GACCCGGCGC GGGCGGGCTG ACCGGGTGCT GGTGGTGACC
CCGCGGCACG TCCTCGAACA GATGCAGCAC GAACTGTGGT GCCGGTTCGG ACTGCCGCTC
GTCCGGCTCG ACAGTGATGG TCTGCAGCGG GTGCGGCAGA ACCTGCCGGC GAGCAGGAAT
CCGTTCACGT TCTACCGCCG CATCATCGTC TCGATCGACA CCTTGAAATC GCCGCGCTAC
CGGTCCTTTC TGGAGCGGCA CCGCTGGGAT GTCGTCGTCA TCGACGAATC ACACAACCTG
ACGAACACCG GCACGCTGAA CAACGAACTC GCGCGGGTTC TCGCACCGAA CACGGAAGCG
CTGGTCCTGG CATCGGCGAC GCCGCACAAC GGGAAGAAGG AGTCGTTCGC GGAACTCCTG
CGGCTGCTCG ACCCGACGGC TGTAGGCCCG GACGGCGAGT ACGACGTCGC TGACGTGGAG
CGGCTGTTCA TCCGCCGGCA CCGCCACTCC CCCGAGGTCG CCGCGGAGGT CGGCGCCGAC
TGGGCGCTGC GTCCCGAGCC CGTGGTGATC CCGGTGGCCG CGTCGCCGGC GGAGGACGCG
ATCGCCACGG AGATCTCCCG GACCTGGCTG TACCCACAGG GCCCGTCCCC CGTCGCGGGC
CGCGGGTCGG CGCTGTTCCC CTGGACGCTG GCGAAGGCGT CCCTCTCCTC CCCGGCCGCG
CTGCTGGAGA CGACCGAGGC CCGCCTCAAG CGGCTCGCGT CCCGAAGCGG CGGCGGGGCA
GGGGAGAGGA ACGGCGGCGG CGATCACGAG CTGGAGCGGC GGGCCCTCGA GCGCCTGCGT
GACCTCACCG AGCTGGCCCT GGCCGGGGAG AGCGCCAAGC TCACCGCGCT CGCCGAGTAC
CTGCGCACCA TCGGTGTCGG TGCCCGCTCG GCGACCCGCG CGGTGCTGTT CGCCGAGCGG
GTCGCCACCC TGCGCTGGCT CGCCAGCGAG CTCCCCGAGC GCCTGGGCCT CGCCAAGGAT
CAGATCGCCG TCATGCACGG CGGCCTGCCG GATGTCGAGC AGGAGCGGAT CGTCGACGAC
TTCAAGACCA CGGCGAGCCC GGTCCGGCTG CTGATCACCG GGGATGTCGC CAGTGAGGGC
GTCAACCTGC ACGCGCAGTG CCACCACCTG GTCCATGTCG ACATCCCGTG GTCGCTGATC
AGGATCGAGC AGCGCAACGG GCGCATCGAC CGCTACGGCC AGAAGCATCC GCCGCAGATC
GCGGCACTGG CGCTGGTGCC GTCCGACGAC CGGTTCAGCG GTGACGTGCG GGTGCTGCAG
CGCCTGCTGG CCAAGGAGCA TCTCGCCCAC ACGACACTGG GCGACGCGGC GACGCTGATG
CACCTGCACT CGGCGAGCGG CGAGGAGGAC GCGATCCGGG ACGCGCTCGC GCGCGGCCAG
AACCTGGACG AGGTCGTCCC CGACCCGGGT TCGGGGCAGG GCTCCGAGGA GTTCTTCGGG
TTCTTCGACG AGGAGTTCGC CGCCGCCGGC GACGACCTGC CGCCGGCTCC CCCGGACCGG
CCCCGCGAGT CGCTGTACCC CACCGACGCC GACTTCCTCG CCGACGCGGT CGCCGAGGTG
TACGACGATC CCGCCCGCGC GCCGGACGAC AAGGATCCCG CCCGGGGCGG CGTCGGCTGG
AAGGTCTTCC GGGACAAAAG TCTGATCGCG CTCCGACCGC CACGTGACCT GCGGGTGCGC
CTCGACGCAC TGCCGGCGTC GTATGTCGCC GAACGCGGGA TCCGCGAGCA GCTGCTGCTC
GCCGTGACAC CGTCGGTCGC CTTGGACGGG CTGCGCGCGG CACGCGAGGG GCAGACCGGC
GGCGGCCCGG GACGGCCGGC GCTGGTGGCC GCCGCGACGA CCGCGGCTGC GACGACTGCG
GCTGGGACGA CTGCGGCTGG GACGGCCGCG AGCGGGCCGG GGGGCGTCAC ACCGGCGGGT
CGCAGGGGAC GTCCGGCGAG GGCCCGCGAG GTCACCGCGC CCACCCCGTC CACGTGGCCG
GAGGCGCATT TCCTCTCCCC GCTGCACCCG GTGCTCGACT GGGCGGCGGA CAAGGTGCTC
GCGGCCGGCG GCCGCAACGA GGTGCCGCTG GTGCGCGGGC CGGTCGACGT CCCGCGGGTG
CTGGTGATCG CGACGCTGAT GAACCGGCGC GGCCAGGTCG TGACCCGCCA GATGGTCGTC
GTCGAGTTCC CGACCGGACG GGCCGATCTG CCGATCGCGC AGGTCGTCGA GGGTCTGGAG
CTGTTCGCGG GCACGGGGCT GATCCCCGGG CCCGGCGAGC GGGAGCCCGC GGTGAACCCG
GGCGCGGCCG TGGCCACCGA CGAGCTGCGC GCGCTGGTGC CGGCCGCGAT CGACGCGGCG
GCCCGCGACC TCGACATGGC CGAGGACATC CAGCACTCGG ACCTGGAGCG GCGGCTGGCG
GACTGGTCGA CCCGGCGGAC CCGCTGGCGG GAGCAGGCGG CGCAGCTCGA ACTCGAGATG
ACGGGGCCGG GGCTGGCGAA GGTCCGCCGG CTGTCGAAGC AGGTCAGCCT TGAGGAGGAG
ATCGCGCGGT CGCTGCGCGC CAGCCAGCGG CTGCTGCGCC CGCTCGCCGT CGTGATCCCG
GCGACCGCCG GCGGCAGGAG CGACGGCGAT GCGGCCGGCA CCGAGATCGG CACCGGGACG
ACCGACGGGG GGAACGTCTG A
 
Protein sequence
MTTFAPGSIV VVRDEEWLVT GAEQGTDGWR LDVVGLGELV RETTATFFSG LDHIELLDPR 
EAELLPDGSP RHRRTRLWLE ATLRRTPMPA GETSLTVSDG MLATHLDYQR RAVAHALSPT
NLRPRVLIAD AVGLGKTLEI GMLLAELTRR GRADRVLVVT PRHVLEQMQH ELWCRFGLPL
VRLDSDGLQR VRQNLPASRN PFTFYRRIIV SIDTLKSPRY RSFLERHRWD VVVIDESHNL
TNTGTLNNEL ARVLAPNTEA LVLASATPHN GKKESFAELL RLLDPTAVGP DGEYDVADVE
RLFIRRHRHS PEVAAEVGAD WALRPEPVVI PVAASPAEDA IATEISRTWL YPQGPSPVAG
RGSALFPWTL AKASLSSPAA LLETTEARLK RLASRSGGGA GERNGGGDHE LERRALERLR
DLTELALAGE SAKLTALAEY LRTIGVGARS ATRAVLFAER VATLRWLASE LPERLGLAKD
QIAVMHGGLP DVEQERIVDD FKTTASPVRL LITGDVASEG VNLHAQCHHL VHVDIPWSLI
RIEQRNGRID RYGQKHPPQI AALALVPSDD RFSGDVRVLQ RLLAKEHLAH TTLGDAATLM
HLHSASGEED AIRDALARGQ NLDEVVPDPG SGQGSEEFFG FFDEEFAAAG DDLPPAPPDR
PRESLYPTDA DFLADAVAEV YDDPARAPDD KDPARGGVGW KVFRDKSLIA LRPPRDLRVR
LDALPASYVA ERGIREQLLL AVTPSVALDG LRAAREGQTG GGPGRPALVA AATTAAATTA
AGTTAAGTAA SGPGGVTPAG RRGRPARARE VTAPTPSTWP EAHFLSPLHP VLDWAADKVL
AAGGRNEVPL VRGPVDVPRV LVIATLMNRR GQVVTRQMVV VEFPTGRADL PIAQVVEGLE
LFAGTGLIPG PGEREPAVNP GAAVATDELR ALVPAAIDAA ARDLDMAEDI QHSDLERRLA
DWSTRRTRWR EQAAQLELEM TGPGLAKVRR LSKQVSLEEE IARSLRASQR LLRPLAVVIP
ATAGGRSDGD AAGTEIGTGT TDGGNV