Gene Franean1_5987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5987 
Symbol 
ID5674308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7295139 
End bp7297694 
Gene Length2556 bp 
Protein Length851 aa 
Translation table11 
GC content72% 
IMG OID641244835 
ProductATP-dependent DNA helicase PcrA 
Protein accessionYP_001510237 
Protein GI158317729 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0573497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0186357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTC TTTTCGACGA CACCTTCGGC CCCGGGGCCC TGGTCGACGG CGGTGCCCCG 
GCCGGCTCCG GCGACGGCCA CGAGCCGCAC GGTGATCTTC CTGGCGGCGG TCGGGCGTCC
CGTGGGGCCT TCGGCCCGGC GGTGCTCGAG CCGGGCGGCG CCACCCGCGC CCGGGCGCCC
AGGCTCGACC CGGACGAGCT GCTCGAGGGG CTCAACCCGC AGCAGCGTGC CGCGGTGGTG
CATGCCGGCT CGCCGTTGCT GGTCGTCGCG GGTGCCGGGT CCGGCAAGAC CCGCGTGCTC
ACGCACCGGA TCGCGTACCT GCTCGCGGCG CGCGGCGCGC GGCCCGGCGA GATCCTCGCG
ATCACCTTCA CCAACAAGGC CGCCGGCGAG ATGAAGGAGC GCGTCGAGGC GATCGTGGGC
GGCCGCGCCC GGGCGATGTG GGTGAGCACC TTCCACTCCG CCTGCGTGCG CATCCTGCGC
TCGGAGGCGA GCCGGCTCGG CTTCGGCTCC TCCTTCTCGA TCTACGACGC GGCGGACGCC
CAGCGCCTCA TCACGCTGGT CACCAGGGAT CTCGACCTCG ATCCGAAGCG GCACACGGCC
CGTGGGCTCG CCGCGGCGAT CAGCGCGCTG AAGAACGAGC TGGTCGACTG GGAGACGGCC
CGGGACAGGG CCACGAACCA CATCGAGCGC ACGGTCGCCG AGGTCTACGC CGCCTACCAG
CAGCGCCTGA CCCAGGCGAA CGCGCTGGAC TTCGACGACC TGATCATGAC CACGGTCAAC
CTGCTGCAGG CGTTCCCGGA CGTCGCCGAG CACTACCGGC GCCGGTTCCG CCACGTCTTG
GTGGACGAGT ACCAGGACAC CAACCACGCC CAGTACGTGC TCATCCGTGA GCTCGTGGGC
CGCTCGGCGT CCTCGGCTGA CGAGCCGGCT GCGGGCTCCG GCGAGGCTGT GTACTCCGGC
GGTTCCGCGG AGCCCGACGG GGCGGGGTGG CCCGCGAAGG CGGTGCCGCC GTCGGGTGCC
GTTCCGTCCG CGGAGCTGTG CGTCGTCGGT GACGCGGACC AGTCGATCTA CGCCTTCCGT
GGCGCGAACA TCCGCAACAT CGTCGAGTTC GAGCAGGACT TCCCGAACGC GGCGGTCATC
CTGCTGGAGC AGAACTACCG CTCCACGCAG ACGATCCTCT CCGCGGCGAA CGCGGTCATC
GCCCGCAACG CCCAGCGCAA GCCCAAGCGC CTGTGGTCGG ACGCCGGTGA CGGCGAGCGG
ATCGTCGGCT TCGTCGCCGA CAACGAGCAC GACGAGGCCG CGTTCGTCGC GGAGGAGATC
GACCGCCTCG GCGACGAGAA CCTGGCCAGG CCGGCGGACG TCGCCGTCTT CTACCGGACG
AACGCGCAGA GCCGCGTGTT CGAGGAGATC TTCATCCGGG TCGGCCTGCC GTACCGGGTC
GTCGGTGGCG TCCGGTTCTA CGAGCGCCGC GAGATCCGTG ATCTTCTCGC CTATCTGCGG
GTGTTGGTGA ACCCGACGGA CACGGTGAAC CTGCGGCGCA TCCTGAACGT TCCGCGGCGC
GGCATCGGTG ACCGCGCGGA GGCGATGGTC TCGGCCTTCG CCGAGCGGGA GCGGATCGCC
TTCTCCGAGG CGCTGAGCCG GGTGGACGAG GTGCCCGGCG TGGCGACCAG GTCGGCCCGC
TCGATCCGAG ATTTCGTCGC CCTCCTGGAG GGCCTGCGCG AGCAGCTGCC CGCCGGCCCG
CTGGCCGTGA TCGAGGCGAC GCTGGAACGG ACGGGCTACC TCTCGGAGCT GGTCGCGGAG
GACACCATCG AGGCACAGGG CCGGGTGGAG AACCTGCGCG AACTCGTCGG GGTCGTCCAG
GAGTTCACCG AGCGGCGCCC GGACGGCACC CTCGCCGAGT TCCTGGAGCA GGTGGCGCTC
GTCGCCGATG CCGACCAGAT ACCCGACGAC GACGGCTCGG ACGGCGTGGT CACGCTGATG
ACCCTGCACA GCGCGAAGGG CCTCGAGTTC CCCGTCGTGT TCCTGACCGG GCTCGAGGAC
GGCGTCTTCC CGCACATGCG CACGCTTGGC GACCCGACCG GGCTGGAGGA GGAGCGCCGG
CTCGCCTACG TGGGCGTCAC CCGGGCCCGT GCGAGGCTGT ACCTCACCAG GTCCCAGGTA
CGCAGCGCCT GGGGGCAGCC GGCCTACAAC CCGCCGTCCC GGTTCCTGGG GGAGGTGCCG
GACTCGCTCG TGGAGTGGCG GCGGCTGCCC GAGCCTGCCC CGGCCGCGGG CGGTGGGCGG
TTCTCCGGCG GTGGGTCGGC GGGCGCGGGC AACGGTTCCG GTGGGTCCGG CGGGAAGCTG
CCGAGCTCCC CGTTCGGGGC GAAGCCCAGC CGCCCGGCGG CGCGTCCGAT CATCGAGCTG
ACCCCCGGCG ACCGTGTCAC CCACGACACG TTCGGTCTCG GTGTGGTCGT CACGACGAGC
GGGGTGGGGG ACTCGCGCCA GGCCAAGGTC GACTTCGGCG CCGAGACCGG CACGAAGGAC
CTCCTCCTGC GCTACGCCCC GGTCGAGAAG CTCTAG
 
Protein sequence
MSTLFDDTFG PGALVDGGAP AGSGDGHEPH GDLPGGGRAS RGAFGPAVLE PGGATRARAP 
RLDPDELLEG LNPQQRAAVV HAGSPLLVVA GAGSGKTRVL THRIAYLLAA RGARPGEILA
ITFTNKAAGE MKERVEAIVG GRARAMWVST FHSACVRILR SEASRLGFGS SFSIYDAADA
QRLITLVTRD LDLDPKRHTA RGLAAAISAL KNELVDWETA RDRATNHIER TVAEVYAAYQ
QRLTQANALD FDDLIMTTVN LLQAFPDVAE HYRRRFRHVL VDEYQDTNHA QYVLIRELVG
RSASSADEPA AGSGEAVYSG GSAEPDGAGW PAKAVPPSGA VPSAELCVVG DADQSIYAFR
GANIRNIVEF EQDFPNAAVI LLEQNYRSTQ TILSAANAVI ARNAQRKPKR LWSDAGDGER
IVGFVADNEH DEAAFVAEEI DRLGDENLAR PADVAVFYRT NAQSRVFEEI FIRVGLPYRV
VGGVRFYERR EIRDLLAYLR VLVNPTDTVN LRRILNVPRR GIGDRAEAMV SAFAERERIA
FSEALSRVDE VPGVATRSAR SIRDFVALLE GLREQLPAGP LAVIEATLER TGYLSELVAE
DTIEAQGRVE NLRELVGVVQ EFTERRPDGT LAEFLEQVAL VADADQIPDD DGSDGVVTLM
TLHSAKGLEF PVVFLTGLED GVFPHMRTLG DPTGLEEERR LAYVGVTRAR ARLYLTRSQV
RSAWGQPAYN PPSRFLGEVP DSLVEWRRLP EPAPAAGGGR FSGGGSAGAG NGSGGSGGKL
PSSPFGAKPS RPAARPIIEL TPGDRVTHDT FGLGVVVTTS GVGDSRQAKV DFGAETGTKD
LLLRYAPVEK L