Gene Franean1_4895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4895 
Symbol 
ID5673235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5874032 
End bp5876362 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content77% 
IMG OID641243750 
Productexodeoxyribonuclease V 
Protein accessionYP_001509166 
Protein GI158316658 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR01448] helicase, putative, RecD/TraA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0563011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG ACGCCGCCGA CTCCCCGTTT CGGACCGCTG CCGCGGTCGA GGACCGCGCC 
GCCGGCCCGC CGTCGGCCGG CCCGGGACCA ACCCCGGACG ACGCCGTAGT CTTCAAGGCC
TTCTGCGGCG CCGGCCTGTG GCCGGGCCTG GGCCGGACGA CGGCGGGCCG GTTGGAGGAG
GCCGGCATCG AACGCCGCGA ACACGTCGAC CTGCACCGCC TTTCCTCGGT GCCGGGTGTC
TCCGGGCCGC GGGCTCGCCG CCTGCTCGAC ACATTCCGCG CGGCGGAACC CACCTACGCC
GCGGTGGAGA TGCTGGTCGC GGCGCGCCTG CCGGCGCGGC TTGCCCGCGG CCTGGCCGAC
GCGCTCGGCC CCACGGTGGC CAAGGCGCTG CGCGCGGACC CCTGGTCGCT GCTCGAGGGA
GGCGAGGCCG AGCTCGGCGA CGCCGATCGC TTCGCCCGCC ACCTCGGGCT GGACAGGGCG
GACCCCCGCC GCGGGCCGGC GGTCCTCATC CACCTGCTCG GTCGCGCGGC CAGCCGCGCC
GGCGACACCG CCGCCCCGGT CGCCGAGGTG GCCCGGGCGG CGGTCCGCGA GGGGGTCACC
GAACCCCTCG ACGCGCTGTC CGCCGCGCTC GACTCCGGCC GGGTGATCCA GGTCGACGAC
CGTCTCGCCC TCGAGCGGTA CGCGATGGCC GAGCAGGCGA TCTCGGACGG GGTGGAGCGC
CTGCTCGCGA CCGCCGAACC GTTCCGGCCC GAGGCCGGCG GGCGGCGGGC CCGGCGGCGC
GCCGGCAGCG GACTGGGCGA GGAGCGGCCG GCAGTCGCCG CGCCGCCCGA ACCGCCGGCG
CCGACGTCGA TGTTCGACGA CGACGCACCG GACGTCCACA CACCAGACGA CGAGCCACCC
GACAACGGCG CGGCCGGCTC CGCCTACGGC GCGGCTGGGT CCGATGCGGC TGACGGCGGC
GCGGACACGG TCCCCGGCGG CGCCGAACCG GGCGCCGCGG CCGACCGCGC CGTGGCCGGG
CTGGACGAGG TCCAGCTCCA AGCCGCCCGC ACCGCGCTCG AGGTCGGCGT GAGCGTGCTG
ACGGGTGGTC CCGGCACGGG CAAGAGCCGC ACCGTCGCCG CGGTCGTCCG GCTCGCCGAG
GCCGCCGGCG TCGAGGTGGC GCTCGCCGCG CCAACCGGCC GGGCGGCGAA ACGGCTCGAG
GAGCTGTGTG GCGCGCCGGC CAGCACCCTG CACCGGCTGC TCGGCGCGCA GGGCCGCGGC
GGCGGGTTCG CCCGCGACGA GCACAACCCC ATCGAGGCCG ACCTGGTCGT CGTCGACGAG
ACGTCCATGC TGGACGCCGA ACTCGCCGCC GCCCTGCTGG ACGCCTGCGC CGACGGCACC
CACCTGCTCC TCGTCGGCGA CCCTGCGCAG CTCCCCAGCA TCGGACCGGG TCAGGTCCTG
GCCGACCTGC TGGAGGCGGA GGTCGCGCCA GTCACCGAGC TGACCCGGCT CTACCGGCAG
ACCGACGGCG GCGCGATCGC CACGATGGCC GCGGCGGTCC GCCGCGGTGA GCTGCCGCCA
CCCGGTTCGG GCCGTGAGGT CGTCGTCGTG GCGGCGCGCT CCTCCGGCGA GGCCGCGCAC
CGCGTCGTCC AGCTCGTCAC CGACTCGATT CCGCGGGCCC TGGGCATACC CACCGAGGAC
GTGCAGGTGG TGACGCCCGT GCACGGCGGG CCGGCCGGCA CCGGCGCGCT CAACGCGGCG
TTGAAGAACG CGCTCAACCC GGGCCGCGGC GAGGTCTCCG GCTTCGACGT CGGCGACCGG
GTGGTGGCCA CCGCGAACCA CCTTGACCTG GGGTTCGCCA ACGGTGAGAT CGGCGTGGTC
GTCGCGCTCG GCGAACGGGG CGGCCTGCGG GTCGCCTTCC CCGGCGGCGA GCTCGACGTC
CCCTCGCACG GCGCCGTGGA CCTGCGCCAC GGATGGGCCG TCACGGTGCA CCGCGCCCAG
GGCAGCGAGT GGGCCGCCGT GGTCGGCGTC TTCCCACCGG AGGCCGGCCG GATGCTTACC
CGGCCGCTGA TCTACACGGC GATGACGCGG GCACGCAGCC ACCTGTCAGT CGTGTCGGTG
AACGGCCCGG CGCTGCGGGC CGCGGTCCGC GACGCGGGTG GCCGGCGGCG GGCGACCCTC
CTCCCGGCGC TGCTGACCGG CGAGTCCGCG GAGCCATTCG GGCCGATGGA CGACCTGGAC
GACACCGACG GCCAGCCCGG CGAGGACGGC TTCGGCGCCG GCCTCGCTGA CGGCGCCGCC
GCCGGGGCGG CCGTTCTCGC GCCGGCCACG AAGGAGCCGG TCAGACCATG A
 
Protein sequence
MPEDAADSPF RTAAAVEDRA AGPPSAGPGP TPDDAVVFKA FCGAGLWPGL GRTTAGRLEE 
AGIERREHVD LHRLSSVPGV SGPRARRLLD TFRAAEPTYA AVEMLVAARL PARLARGLAD
ALGPTVAKAL RADPWSLLEG GEAELGDADR FARHLGLDRA DPRRGPAVLI HLLGRAASRA
GDTAAPVAEV ARAAVREGVT EPLDALSAAL DSGRVIQVDD RLALERYAMA EQAISDGVER
LLATAEPFRP EAGGRRARRR AGSGLGEERP AVAAPPEPPA PTSMFDDDAP DVHTPDDEPP
DNGAAGSAYG AAGSDAADGG ADTVPGGAEP GAAADRAVAG LDEVQLQAAR TALEVGVSVL
TGGPGTGKSR TVAAVVRLAE AAGVEVALAA PTGRAAKRLE ELCGAPASTL HRLLGAQGRG
GGFARDEHNP IEADLVVVDE TSMLDAELAA ALLDACADGT HLLLVGDPAQ LPSIGPGQVL
ADLLEAEVAP VTELTRLYRQ TDGGAIATMA AAVRRGELPP PGSGREVVVV AARSSGEAAH
RVVQLVTDSI PRALGIPTED VQVVTPVHGG PAGTGALNAA LKNALNPGRG EVSGFDVGDR
VVATANHLDL GFANGEIGVV VALGERGGLR VAFPGGELDV PSHGAVDLRH GWAVTVHRAQ
GSEWAAVVGV FPPEAGRMLT RPLIYTAMTR ARSHLSVVSV NGPALRAAVR DAGGRRRATL
LPALLTGESA EPFGPMDDLD DTDGQPGEDG FGAGLADGAA AGAAVLAPAT KEPVRP