Gene Franean1_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0803 
Symbol 
ID5669219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp937766 
End bp941332 
Gene Length3567 bp 
Protein Length1188 aa 
Translation table11 
GC content74% 
IMG OID641239731 
Producttranscription-repair coupling factor 
Protein accessionYP_001505167 
Protein GI158312659 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0214899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCG CGCCGCTGCT CGACGCCCTA GTCGCCCGCC CCGGTGGTGA TCCAGCGCTG 
ACCCGCGCGC TGGGCTCGCC GGACGAACCC GTCCTCGACC TGGCCGGGCC CGCGGCGCTG
CGCCCGTTCG CGGCGGCGGC GATGGCCCGG GCCGGTCACA CCGTCCTCGC GGTGACGGCG
ACCGGCCGTG AGGCCGAGGA CCTGGCCGAC GCGGTGGGCA GCCTGCTGGG TTCCGAGCGC
GTCGCGGTGT ACCCGAGCTG GGAGACGCTG CCCCACGAGC GGCTCTCGCC GCGGGCGGAC
ACGGTGGGGC GCCGGCTGGC CGTCCTGCGC CGGCTCGCGC ACCCGGGCAC CGGCGGTGCC
ACCGGTGCGG GCCCGCTGGC CGTCGTGGTG GCCCCGGTCC GCTCGGTGCT CCAGCCCCAG
GTGGCCCGGC TGGGTGAGCT GGCCCCGGTC GCGCTGGCGA AGGGGGACAC CGCCGACCTC
GAGGACGTCA CCGCCCGCCT GGTGGGCATC GCCTACCACC GCGTCGACCT GGTCGAGCGG
CGCGGCGAGA TGGCCGTCCG CGGCGGGATC CTCGACGTCT TCCCGCCGAC CGAGGAACAC
CCGCTGCGGA TCGAGTTCTT CGGCGACGAG GTGGACGACA TCCGCCGCTT CTCCGTCGCC
GACCAGCGCG CGCTGCCCGC GGAGGACGGC GAGCCGGCGG CCGAGCTGTT CGCGCCGCCG
TGCCGGGAGC TGCTGCTCAC CGACGAGGTC CGGGCCCGCG CGGCGGAGCT GGCGCGGGAG
CACCCGCAGC TCGTCGACAT GCTTGACAAG ATCGCCGAGG GCATCCCGGT GGAGGGCATG
GAGGCGCTCG CGCCCGTGCT CACCGACGAG ATGACCCTGC TGCTGGACGA GTTGCCGGCG
GGCTCGCGCG TGCTGGTCTG CGATCCCGAG CGGGTGCGCA CCCGGGCCGG CGAGCTGGTC
CGCACCAGCC AGGAGTTCCT GGACGCCTCG TGGAGCGTCG CGGCGCTCGG CGGCGGGGCG
CCGATCGACC TGGGCGCGGC GACCTACCGT TCGGTCGCCG ACGTCCGGGA GCGGGCCGCC
GAGCTCGGTA TCCCCTGGTG GTCCGTCAGC CCGTTCGCCT CCGAGCCGGG AGCCGGGGCC
GAGGGTGAGG GTGGGGACAC CTTCGGTGAC GACACCGTCG TCGCCAGCCT GCGGCCGGCC
ACGGCCTACC ACGGGGACAC CGCGCGGGCC ACCGCGGACA TCAAGGGCTG GCTCGCCGAC
TCCTGGCGGG TGCTGCTCGT CACCGAGGGC CACGGCCCGG CCGAGCGGCT CGTCGAGATG
ATGCGGGAGG CCGATCTCGG CGCGCGGCTG GCGGCCGACG CCGAGCTCGC CGCGGGGGTC
GCCGTCGTCA CCCAGGCCCA GCTCGGCGCC GGGTTCGTCA GCCCCACGCT GCGCCTCGCC
GTGCTCACCG AGACCGACCT CGCCGGCGCC CGGGGGGTCA CCACCCGGGA CATGCGCCGG
ATGCCGAGCC GGCGGCGCAA GGGCATCGAC CCGCTCGCGC TGAGCGCCGG CGACCTCGTC
GTGCACGACG CCCACGGTGT CGGCCGCTAC GTCGAGATGG TCACCCGCAC GGTGGCCGGC
GCCAAGCGCG AGTACCTGCT GCTCGAGTAC GCCCGCGGCG ACCGCCTCTA CGTGCCGACC
GACCAGCTCG AGCAGATCAG CCGCTACGTC GGCGGCGAGG GCCCGAGCCT GGACCGCATC
GGCGGCGCCG ACTGGGGCAA GCGCAAGAGC CGGGCGCGCA AGGCCGTCAA GGAGATCGCC
GGCGAGCTGA TCCGGCTGTA CAGCGCCCGG ATGGCCGCCC CCGGCCACGC CTTCGGCCCC
GACAGCCCCT GGCAGCGTGA GCTGGAGGAC GCCTTCCCCT TCCGGGAGAC GCCCGACCAG
CTCGCGGCCA TCGACGAGGT CAAGGCCGAC ATGGAGAAGC CCGTCCCGAT GGACAGGGTG
ATCTGCGGCG ACGTCGGCTA CGGCAAGACC GAGATCGCCG TCCGGGCCGC GTTCAAGGCG
GTCACGGACG GCCGCCAGGT CGCCGTCCTC GTCCCGACCA CGCTGCTCGT CCAGCAGCAC
TTCCAGACCT TCGCCGAGCG GTACGCGCCG TTCCCGGTCA CGGTGAAGGC GGTGAGCCGG
TTCAACGCGC CGTCCGAGCA GCGCGCCGTG CTCGACGGCC TCGCCAACGG CACGGTGGAC
GTCGTCATCG GCACGCACCG TCTGCTCTCC AGCGAGACGA AGTTCTCCGA CCTGGGCCTG
GTGATCGTCG ACGAGGAGCA GCGCTTCGGC GTCGAGCACA AGGAGCACCT CAAGAAGATG
CGGACCGCGG TCGACGTGCT CACCATGAGC GCCACGCCGA TCCCGCGGAC GCTGGAGATG
TCGATCACCG GCATCCGGGA GCTGTCGACG ATCGACACCC CGCCGGAGGA GCGCCATCCG
GTGCTGACCT CGGTGGCGCC CTACGAACCG CGCCAGGTGA CCGCGGCCAT CCGGCGTGAG
CTGCTGCGCG AGGGGCAGGT GTTCTTCATC CACAACCGGG TGGAGAGCAT CGACCGGGCC
GCCGCCGCGC TGCGCGAGCT GGTGCCCGAG GCGCGGATCG CCACCGCGCA CGGCCAGATG
AACGAGGACG CCCTCGAGCA GGTCATGGTC TCCTTCTGGG AGAAGAAGTT CGACGTCCTG
GTCTGCACGA CGATCGTCGA GTCGGGCCTG GACATCTCGA ACGCGAACAC GCTGATCGTC
GAGCGGGCCG ACAACTTCGG GCTCTCCCAG CTGCACCAAC TGCGCGGGCG GGTCGGCCGG
GGCCGTGAGC GCGCCTACGC CTACTTCCTG TACCCGGCGG ACCGGCCGCT CTCGGAGACC
GCGCACGACC GGCTGGCCAC CATCGCCCAG CACAACGACC TGGGCGCCGG CATGGCCGTC
GCGATGAAGG ACCTGGAGAT CCGCGGTGCC GGGAACCTGC TCGGCGGCGA ACAGTCCGGC
CACATCGCGT CGGTCGGTTT CGACCTGTAC GTCCGGATGG TCGGCGAGGC CGTCGCCGAG
TACAAGGGCG AGGCCGAGGA GCCGCCGGAG GTCAAGGTCG AGCTGCCCGT CGACGCGAAC
CTGCCGCACG ACTACGTGCC CAGCGAGCGG CTTCGGCTCG ACGCCTACCG CCGGCTGGCC
GGCGCGGTCT CCGACGCCGA CATCGCCGAG GTCCGCGGCG AGCTGACCGA CCGGTTCGGC
CTGCTGCCCG AGCCGGTGGA GAACCTGCTC GCCGTCGCCG GCCTGCGGGT GCTCGCCCGC
CGGTTCGGCG TCACCGAGAT CACGGTGGCC GGCCGGCAGG TCCGGTTCGC CCCGCTGGAG
CTGCGGGAGA GCCAGACCCT GCGGCTGACC AGGCTCTACA AGGGCGCGGT GGTCAAGCCG
GCCATGCGCA CGGTGCTGGT GCCGGCGCCC ACCGAGACCG GCCGGATCGG CTCCCGGCCG
CTGCGCGACC GTGCGCTGCT GGCCTGGGCC GGCCAGTTGC TGGAGGCCGT CGCCGGCGAC
TCGGTGGCCG CCGCCGCAAG TCGGTGA
 
Protein sequence
MTLAPLLDAL VARPGGDPAL TRALGSPDEP VLDLAGPAAL RPFAAAAMAR AGHTVLAVTA 
TGREAEDLAD AVGSLLGSER VAVYPSWETL PHERLSPRAD TVGRRLAVLR RLAHPGTGGA
TGAGPLAVVV APVRSVLQPQ VARLGELAPV ALAKGDTADL EDVTARLVGI AYHRVDLVER
RGEMAVRGGI LDVFPPTEEH PLRIEFFGDE VDDIRRFSVA DQRALPAEDG EPAAELFAPP
CRELLLTDEV RARAAELARE HPQLVDMLDK IAEGIPVEGM EALAPVLTDE MTLLLDELPA
GSRVLVCDPE RVRTRAGELV RTSQEFLDAS WSVAALGGGA PIDLGAATYR SVADVRERAA
ELGIPWWSVS PFASEPGAGA EGEGGDTFGD DTVVASLRPA TAYHGDTARA TADIKGWLAD
SWRVLLVTEG HGPAERLVEM MREADLGARL AADAELAAGV AVVTQAQLGA GFVSPTLRLA
VLTETDLAGA RGVTTRDMRR MPSRRRKGID PLALSAGDLV VHDAHGVGRY VEMVTRTVAG
AKREYLLLEY ARGDRLYVPT DQLEQISRYV GGEGPSLDRI GGADWGKRKS RARKAVKEIA
GELIRLYSAR MAAPGHAFGP DSPWQRELED AFPFRETPDQ LAAIDEVKAD MEKPVPMDRV
ICGDVGYGKT EIAVRAAFKA VTDGRQVAVL VPTTLLVQQH FQTFAERYAP FPVTVKAVSR
FNAPSEQRAV LDGLANGTVD VVIGTHRLLS SETKFSDLGL VIVDEEQRFG VEHKEHLKKM
RTAVDVLTMS ATPIPRTLEM SITGIRELST IDTPPEERHP VLTSVAPYEP RQVTAAIRRE
LLREGQVFFI HNRVESIDRA AAALRELVPE ARIATAHGQM NEDALEQVMV SFWEKKFDVL
VCTTIVESGL DISNANTLIV ERADNFGLSQ LHQLRGRVGR GRERAYAYFL YPADRPLSET
AHDRLATIAQ HNDLGAGMAV AMKDLEIRGA GNLLGGEQSG HIASVGFDLY VRMVGEAVAE
YKGEAEEPPE VKVELPVDAN LPHDYVPSER LRLDAYRRLA GAVSDADIAE VRGELTDRFG
LLPEPVENLL AVAGLRVLAR RFGVTEITVA GRQVRFAPLE LRESQTLRLT RLYKGAVVKP
AMRTVLVPAP TETGRIGSRP LRDRALLAWA GQLLEAVAGD SVAAAASR