Gene Franean1_0925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0925 
Symbol 
ID5669339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1076063 
End bp1079470 
Gene Length3408 bp 
Protein Length1135 aa 
Translation table11 
GC content78% 
IMG OID641239852 
ProductUvrD/REP helicase 
Protein accessionYP_001505287 
Protein GI158312779 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases
[COG2887] RecB family exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.963076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGGACA CGGCGCAGAC CGACGTGGTG GGACACGCCG GTGGTCCGCT ACTAGTCCTC 
GCCGGTCCGG GAACGGGAAA GACCACAACC CTGGTCGAGG CCGTCGCCCG GCGGATCGAG
GCCGGGGCGG ATCCGGCGTC CCTGCTGGTG CTCACATTCA GTCGGCGGGC CGCCCGCCAC
CTGCGCGAAC GGTTCGCCGT CCGGCTCGGG GCGGCCGCCG TCGGGCCGGC GGCGTGGACG
TTCCACGCCT GGTGCCTCGG GCTGCTGCGC GCGTACGGCG CCGCCACCCC GCTCGGCGGC
TTCCGCCTGC TGTCGGGCCC CGAGCAGGAC GCCCGGCTGC GCGACCTCAT CGAAGGCTCC
CGCGAGCTGG GACGTCCGGT CTGGCCTGAT CCACTCGCCG GCTGTCTGAA CACCCGCGGC
TTCGCCGAGG AGATCAGGAT GCTGCTCACG CGGGCCCGGG AGATCGGCGT CGATCCGCCG
GCGCTGGCGC GCATCGCCCG GCGGGTGGGC CGCTCCGACT GGGCGGCCGT CGCCGAGTTC
TACAGCGACT ACCTCGACGC GCTCGGCGCG GAGGGAGCCC TCGACTACGC CGACCTGGTG
CACGCCGCGG CGAACCTCGC CGCCGAGCCG GACGTCCTCG CCCAGCTGCG GCAGCGGTAC
CGGGCGGTGT TCGTCGACGA GTACCAGGAC ACCGACCCGG CGCAGGAGCG GCTGCTGACC
GCCGTCGCCG GCGGCGGGGG GGACCTGGTC GTCTTCGGTG ACCCGGACCA GTCGATCTAC
GCCTTCCGCG GGGCGGAGGT CCGGGGCCTG CTCGACTTCC CGAACCGGTT CCCCCGCGCC
GACGGCTCAC CGGCGCCCGT CGCCGCGCTG CGGCGGTGCC GTCGGATGGC CCCGGCGCCG
CTGGCCGCCT CCCGCCGCGT CGCGGCGTCG CTTCCTTCGG CGGGCCTGCC GGTCCGCCAC
GTCCGCGCGC ACCGCGATCT TGTCTCCGCG GGGGCCGTCG GGCCGGGCCA GGTCGCGGCC
CGCACCTATC CGACGGCCGG CGCCGAGGCG GCGGCGATCG CCGAGCTGCT GCGCCGCGAG
CATCTCGAGA ACGGCGTCGA CTGGGACGAC ATGGCCGTCC TGGTCCGCTC CACCGCGCCG
CTCGGGCTGC TGCGCCGGGT GCTGACGGCG GCGGGCGTCC CGGTGAGCGT GGCGGGTGGG
GAGCTCCCGC CGGCGCGGGA GCCGGCGGCG GCGCTGCTCC TCACGGCCCT GCGCTGCGCG
GACGACCCGG CGGTCCAGCT GACGCCGGAG ACGGCGCGCG CGCTGCTGAC CTCCCCGCTG
GGCGGCGCGG ACCCGGCCGG CCTGCGCGCC CTCGGCCGCG AGCTGCGCGC CCACGACGCC
GCCGCCCGCG CCGCCGCCCG CGCCGCTGCC CGGGGCACAG CTGCCCGGGG CACCACTGCC
CGCGACAGCA CTGCCCGCGA CGCCGTCGGC CCGATCGCCC TGCCGGAGCC GCGCTCGGCA
CCGGAGGCGG ACGGGTCGGT GCTCGGTCTG CCCGCGTCCT CGGCCGAGCT GCTGCGCGAG
GCGGTCGCCG ATCCGGAGCG GGTTCTGCTC GCCGTCCCGG ACGAGCTCGC CGCGCCCGTG
CGCCGGCTCG GGCGGTTGCT GCGAACCGTG CGCGCCGAGC TGCGGCGGGG GGCGGCCCCG
GAGGACGCGC TGTGGACACT GTGGACGGCC AGCGAGTGGG GCCCCAGGCT GGAGCGGGCG
TCCGCGGTCG GCGGGCCGGC CGGGCGGTCG GCCGACCGTG ACCTCGACGC CGTCGTCGCG
CTGTTCGACG CCGTCGGGCG TCTCGGTGAG CGGCGCGGGC CGGGCTTCGG GGTGAGCTCC
GTCGTCGAGG AGCTGACCCG CCAGCAGATC GCGCCGGAGA CGGCGCAGGC CCGGGCCGGT
CGCCCGGCCA CCGTCCGGCT GCTCACCGCG CACCGCTCCA AGGGGCTGGA GTGGGAGGTC
GTGGTCGTCT GCGGCGTGCA GGACGGCGTC TGGCCGGACC TGCGCCAGCG GCACACCCTG
CTCGGGGCCG AGCAGCTCGA CTCACCGCGC AACGGCGGCG TCCGCCCGCC GCGGACCAGG
GAGGAGCTGC TCGCCGACGA GCGTCGCCTG TTCTATGTGG CGCTCACCCG GGCGCGCCGC
CGCCTGGTGG TGACCGCGGT GGACGGCGCC GACGACGACG GGGAGCAGCC CAGTCGGTTC
CTGGCCGAGC TCGGGGTGCA GGTGGAGAGC GTCCGGGCCC GCCCGCCGCG TCCGCTCACC
CTGGTCGGCC TGGTCGCCGC GCTGCGGCGG CTGACAGTTG ACCCGGAGGC GTCGGCGGCG
ATGCGGGAAG CCGCGGGGGT CCGACTCGCG GCCCTGGCCG CCGCCTCCGA CGAGACCGGC
CGGGCGCTGG TCCCCGCCGC CCATCCCGAC CGCTGGTGGG GGCTGCTGGA CGTGACGAGC
TCGGACGTCC CGGTCGTCGC GCCCGACGCG CCGATCCGGC TCTCGGGGTC CTCGCTGTCG
GGCCTGAGCT CCTGCGGGCT GCGCTGGTTC CTGGAGCACG AGGCGAACGC GGCCGAGCCG
GCGTCGCCCG CGCAGGGCTT CGGCAAGGTC GTGCACGCGC TGGCCGACGA GGTGACGACG
GGCCGGACGG AGGCGCGGCT CGAGGCCCTC GACGCGCGGC TCGACCTGGT CTGGCGGCAT
CTGGAGTTCG ACTCGCGGTG GCGCTCCGAG CAGGAGCGCG CGGCGGCGCG GGAGGCGCTC
GGCCGGTTCC TCGAGTGGCA CGCCGCGCAG CGCGGCCGCG AGGTCGTCGC GTCCGAGGTG
CGTTTCTCCT GCGACATCGA GGTGGCGGGC CGGGTCGTGC GGCTGCGCGG CTTCATCGAC
CGGGTCGAGC TCGACGACGC CGGCCGGGTG CACGTCGTCG ACTTCAAGAC CGGCCGGACG
CCGCCGAGCC GGCGTGACCT GGCGACCCAC GCACAGCTGG GCAGTTACCA GCTCGCCGTC
CGCGAGGGCG CCCTCGACCC GGTACTGGAC GAGGCCGCCC GCCGCCACGA TCCGCCCGCG
CCCGGCGAGG TGCCCGGGGC TGGGGACCTG TCCGGGGCGG GGGCGCCGTC CACCGCCGGT
GAGCGGCCGG CGCCGGTGCC CGGCGGGGCG GAGCTGGTGC AGCTCCGGCG GGACGCGGGC
GGCGACAACA CCGGCCCGCG CCTGCCGGCG GAGGAGCCGG GGCCGCCCGA GGTGCAGCGT
CAGGACGCGC TCCCCGGCGG CCGGGCGTGG ATCGACGACA TGGTCGAGAA CGCGGTGCGG
ACCGTGACGA CCGAGTCCTT CCGGCCCACA CCCGGCGAGG GGTGCACCAT GTGTTCGTTC
CGTCGGTGCT GCCCGGGGCG GCCGGAGGGG GAGCAGGTGA TCGATTGA
 
Protein sequence
MLDTAQTDVV GHAGGPLLVL AGPGTGKTTT LVEAVARRIE AGADPASLLV LTFSRRAARH 
LRERFAVRLG AAAVGPAAWT FHAWCLGLLR AYGAATPLGG FRLLSGPEQD ARLRDLIEGS
RELGRPVWPD PLAGCLNTRG FAEEIRMLLT RAREIGVDPP ALARIARRVG RSDWAAVAEF
YSDYLDALGA EGALDYADLV HAAANLAAEP DVLAQLRQRY RAVFVDEYQD TDPAQERLLT
AVAGGGGDLV VFGDPDQSIY AFRGAEVRGL LDFPNRFPRA DGSPAPVAAL RRCRRMAPAP
LAASRRVAAS LPSAGLPVRH VRAHRDLVSA GAVGPGQVAA RTYPTAGAEA AAIAELLRRE
HLENGVDWDD MAVLVRSTAP LGLLRRVLTA AGVPVSVAGG ELPPAREPAA ALLLTALRCA
DDPAVQLTPE TARALLTSPL GGADPAGLRA LGRELRAHDA AARAAARAAA RGTAARGTTA
RDSTARDAVG PIALPEPRSA PEADGSVLGL PASSAELLRE AVADPERVLL AVPDELAAPV
RRLGRLLRTV RAELRRGAAP EDALWTLWTA SEWGPRLERA SAVGGPAGRS ADRDLDAVVA
LFDAVGRLGE RRGPGFGVSS VVEELTRQQI APETAQARAG RPATVRLLTA HRSKGLEWEV
VVVCGVQDGV WPDLRQRHTL LGAEQLDSPR NGGVRPPRTR EELLADERRL FYVALTRARR
RLVVTAVDGA DDDGEQPSRF LAELGVQVES VRARPPRPLT LVGLVAALRR LTVDPEASAA
MREAAGVRLA ALAAASDETG RALVPAAHPD RWWGLLDVTS SDVPVVAPDA PIRLSGSSLS
GLSSCGLRWF LEHEANAAEP ASPAQGFGKV VHALADEVTT GRTEARLEAL DARLDLVWRH
LEFDSRWRSE QERAAAREAL GRFLEWHAAQ RGREVVASEV RFSCDIEVAG RVVRLRGFID
RVELDDAGRV HVVDFKTGRT PPSRRDLATH AQLGSYQLAV REGALDPVLD EAARRHDPPA
PGEVPGAGDL SGAGAPSTAG ERPAPVPGGA ELVQLRRDAG GDNTGPRLPA EEPGPPEVQR
QDALPGGRAW IDDMVENAVR TVTTESFRPT PGEGCTMCSF RRCCPGRPEG EQVID