Gene Franean1_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2447 
Symbol 
ID5670843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2906394 
End bp2908772 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content76% 
IMG OID641241364 
Productputative ATP-dependent DNA helicase 
Protein accessionYP_001506785 
Protein GI158314277 
COG category[R] General function prediction only 
COG ID[COG3973] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.250058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.190451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACG GCGCGGCCGA GGGGCGCGAT GAGGATCCCC GGTCTGTGGA TGTCCGGCCG 
GCAGATCTCC GGCCAGCGGA CGCCCGGTCG GTGGATCCGT GGTCGGCGGA GCTCGCCGCC
GAGCGCCGTC ACCTGGACCG CGCCCGCGCG GCGCTGGTTC GGATGCGGCA CGAGGCCGAG
GCGACGGAGA TCGCCGAAGG CGACCCCGTC GCTGACAAGG TGACGAACGC GTCCCTGAAA
GCGGCGCGGC GGCGGCGCCT CGAGGCGCTG AGCGACCCCG CGAGCGGGGC ACCGTTGTTC
TTCGGACGGA TCGACTACGC CGCGGGCGCC GCCGCCGCGC CGGGTCGGCG CGTCCACCTG
GGACGGCGCC ATGTGCGGGA GGCCGCCGGC GATGATCCGC TGGTCGTCGA CTGGCGCACC
GACATCGCCC GTCCGTACTA CCGCGCCCAT AGGGGCGACG CGATGGGGCT CGTGGCGCGG
CGCCGGTTCG GGTTCGACGG CCCCGAGCTC ACCGCGTTCG AGGAGGAACC ACTCGTCGAA
CCGTCGCCCG GCCCGGCAGG CGCGTCACGA GCCGGCGGGC CGGCCGGAGG CGACCTGCTC
GCCCGCGAGA TCGCGCGGCC CCGCTCGGGG CCCATGCGTG ACATCGTCGC GACGATCCAG
CCCGAGCAGG ACGAGATCGT CCGGGCCGAT CTCGAGGTGA GCGTCTGCGT CCAGGGCGCG
CCGGGCACCG GCAAGACGGC GGTCGGCCTG CACCGGGCTG CCTACCTGCT GTTCGCCTAC
CGGGAGCGGC TCAGTCGCGC CGGGGTGCTC GTCGTCGGGC CGAACCGGGC CTTCCTGCGT
TACATCGGCG CGGTCCTGCC CGCGCTTGGC GAGTTCACCG TCACCCACCG TAGTCTCAGC
GCCCTCGTGG ACATCGCGAC GGCGCGCGGA ACCGACCCGG AGCCAGTGGC CGAGATCAAG
AGCGACGCCC GGATGGCCGT GGTGATCCGC CGGGCGCTCG GCGCCACCCG GCCCGGCGCC
TCCCCGCCGG CTGCCCTGAT CACGAGCTCG CTCGGTCGGT GGACGGTGAG CGGCGACGAG
ATCGCCGGCT TGGCCGCCGG CATCCAGGCT GGCGGCCACC GCCATCGCGT GGCTCGGGAC
CTGCTCGCCC GGCGGATCGC CGCCGCGGTG GTCCGCCGCG CCGAGGAGCG GGGGCACCTG
CCGGCCGACT CCGCGGTGGA GCGCCTCGCC CGCACCCGGT CCGTCAGGGC GGCGGTCGAC
GCCGTCTGGC CGCGCACCGA CGCGGCCGGG CTGATCCACC GGCTGTTCAC CGACCCCGCC
CTGCTCGCCC ACGCCGCCGA CGGCGTGCTC TCCGCGCCCG AGCAGCGCCT GCTGCTGGCC
GACCGGCCGA CGTCGCGGCG GGCCGTCCGC TGGTCGCCCG CCGACCTGTT CTGCCTGGAC
GAGGCACTGG ACCTGATCGA CGGCGTGCCG GCGTTCGGGC ACGTCATCGT GGACGAGGCC
CAGGACCTCT CGGCCATGCA GTGCCGGGCC GTCGGACGCC GCTGCGCCAC CGGCTCGCTC
ACCGTCCTCG GGGATCTGGC CCAGGGCACC ACCGCCTGGG CCGCGCGCTC CTGGGAGCAG
GCGCTTGCCC ACTTCGGTAA GCCCGCCGCG CGGCTGGAGG TGCTGACCCG AGGCCACCGG
GTACCGGCCG AGATCCTCTC GTTCGCCGAC CGTCTGCTCC CGAGCATCGC GCCGGATCTG
CCTCCCGCGT CGTCGACCCG CGCCGTCCCG GGTGCGCTGC GGGTGATCGC CGCCCCGCCG
GACGACCTGG CCGGCGAGGC TGCACACCAG GTTCGTGCCG GGCTGGGCCG CGGCGGCTCG
GTGGCGGTGA TCGCCCCCGA CGAGGCCGTC GGCGGCCTGA CGGCGTCGCT GCGCGCGGCG
GGCCTGCCCG CCACCGCCCT CACCGACGAC GCCGCCCTCA CCGACGACGC CGCCCTCACC
GACGCCGCCC TGACGGAGGG CGCTTCGCTC ACCGACAAGC CCGTGCCGGA AGGGCGGGCG
TCCCGGCCGG TGACTGTCGT GCCCGCGTCG CTCGCGAAGG GACTCGAGTT CGACCATGTG
GTCCTCGTCG ACCCGGCGTC CGTCGCCGGC TCCGGGCCGG GCTCCGTCCA GGGGCTGCGC
CGGCTCTACG TGGTGCTGAC CCGGGCGGTG ACCTCCCTGG CAGTGATCCA CGCGGGTGAG
CTCCCGAGCG CTCTCGCGGA TCCGGGCGGC CCGTTCCCGC GGGTACCCCG GCCCGGGGGA
GCGGGCCGCC CGGATCCCGC GCTGGTCAGG GGACTGGAGG TGGCTCCGGA GTCTGCGCGA
CCTGCGAGTC GCCGTTCGAC CGGGGCTGGC TCACGGTGA
 
Protein sequence
MGDGAAEGRD EDPRSVDVRP ADLRPADARS VDPWSAELAA ERRHLDRARA ALVRMRHEAE 
ATEIAEGDPV ADKVTNASLK AARRRRLEAL SDPASGAPLF FGRIDYAAGA AAAPGRRVHL
GRRHVREAAG DDPLVVDWRT DIARPYYRAH RGDAMGLVAR RRFGFDGPEL TAFEEEPLVE
PSPGPAGASR AGGPAGGDLL AREIARPRSG PMRDIVATIQ PEQDEIVRAD LEVSVCVQGA
PGTGKTAVGL HRAAYLLFAY RERLSRAGVL VVGPNRAFLR YIGAVLPALG EFTVTHRSLS
ALVDIATARG TDPEPVAEIK SDARMAVVIR RALGATRPGA SPPAALITSS LGRWTVSGDE
IAGLAAGIQA GGHRHRVARD LLARRIAAAV VRRAEERGHL PADSAVERLA RTRSVRAAVD
AVWPRTDAAG LIHRLFTDPA LLAHAADGVL SAPEQRLLLA DRPTSRRAVR WSPADLFCLD
EALDLIDGVP AFGHVIVDEA QDLSAMQCRA VGRRCATGSL TVLGDLAQGT TAWAARSWEQ
ALAHFGKPAA RLEVLTRGHR VPAEILSFAD RLLPSIAPDL PPASSTRAVP GALRVIAAPP
DDLAGEAAHQ VRAGLGRGGS VAVIAPDEAV GGLTASLRAA GLPATALTDD AALTDDAALT
DAALTEGASL TDKPVPEGRA SRPVTVVPAS LAKGLEFDHV VLVDPASVAG SGPGSVQGLR
RLYVVLTRAV TSLAVIHAGE LPSALADPGG PFPRVPRPGG AGRPDPALVR GLEVAPESAR
PASRRSTGAG SR