Gene Franean1_4192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4192 
Symbol 
ID5672547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4987001 
End bp4989853 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content73% 
IMG OID641243065 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001508482 
Protein GI158315974 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.443512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCG CCGGCGCCGG CCGGCCCTCG GCGAGGGTGC CCGGCAAAGG CGGGGATACC 
CACGCCGCCG CGGGTCGATG CGCCGCCGCG CCTCGGCCGG TGGCGGTTCA CCCGGCGTGG
TTCACCCGGC GTGGCCGACG AGCCTGCGCG CCACGGACCG GACGGCGCGC GCCGCCGCGG
GGGCGGCCCG TCGCCCGCGC CCGCCGTCCG TGGGGCACGA TCGAGAGCAC CATGACGCCG
CACACCTCCA CGCCCGAGAA CACCTCACCC GACCTCGCCC CCGGCGGCAG CGCGCCCGGC
GGCACACCCG GCAGCGCGCC GGGCCCGGCC GCCGGCTCGC CGTCGCTGGC CGACCGCCTG
CCGGCCGGTC CCGAACCCGA CGCGGACGCC CTGTTCGACG CGTTCGGCGC CTGGGTGACC
GACCAGGGCC TGGAACTGTA TCCGGCGCAG ACCGAGGCGC TGATCGAGGT GGTGAGCGGA
GCGCACGTCA TCCTGGCGAC ACCGACCGGC AGCGGGAAGA GCCTGGTGGC CACCGGCGCC
CACTTCGCCG CGCTCGGCTG GGGCCGGCGC ACCTTCTACA CCGCGCCCAT CAAGGCCCTG
GTGTCGGAGA AGTTCTTCGC GCTGTGCGCG GTGTTCGGCC CGGCGCGGGT CGGCATGATG
ACCGGTGACG CCTCGGTCAA CGAGACCGCG CCGATCATCT GCTGCACCGC CGAGATCCTG
GCGAACATCG CGCTGCGTGA CAGTGCCACG GCCGACGTCG GCCAGGTCGT CATGGACGAG
TTCCACTACT ACGCCGACCC GGACCGGGGC TGGGCCTGGC AGGTGCCGCT CATCGAGCTG
CCCGACACGC AGTTCCTGCT CATGTCGGCG ACGCTGGGCG ACGTCACCCG GTTCGAGGAG
GACCTGACCC GGCGCACCGG GCGGCCCACG GCGGTGGTGC GCTCGGCGCA GCGCCCCGTC
CCGCTGTTCC ACCGGTATGT GACGACCCCG CTGCACGAGA CGATCGAGGA GCTGCTCGAG
ACCCGCCAGG CGCCGGTCTA TGTCGTGCAC TTCACCCAGG CCTCCGCGCT GGAACGGGCC
CAGGCGCTGA TGAGCGTGAA CGTCTCCACC CGGGCGGAGA AGGACGCCAT CGCGGAGATG
ATCGGCGGCT TCCGGTTCAC CGCCGGGTTC GGGAAGACGC TGTCCCGTCT GGTCCGGCAC
GGGATCGGCG TCCACCACGC GGGGATGCTG CCGAAGTACC GCCGACTGGT CGAGCTGCTC
GCGCAGGCCG GCCTGCTCAA GGTCATCTGC GGCACCGACA CGCTCGGGGT GGGCATCAAC
GTCCCGATCC GCACCGTGGT GTTCACCGCG CTGTCGAAGT ACGACGGCAC CCGCACCCGG
ATCCTGACCG CGCGCGAGTT CCACCAGATC GCCGGGCGGG CCGGGCGGGC CGGCTACGAC
ACCGTCGGCA ACGTCGTCGT GCAGGCCCCC GAGCACGTCG TGGAGAACGA GAAGGCGCTG
GCGAAGGCCG GTGACGACCC GAAGAAGCGG CGCAAGGTCG TGCGCCGCAA GCCCCCCGAG
GGCTTCGTCT CCTGGGGCGT GCCGACCTTC GAGCGCCTCG TCGCCGCCGA ACCCGAGCCG
CTGACGTCGC AGTTCGCCGT CAGCCACTCG ATGCTGCTCA ACGTGGTCAA CCGGCCGGGG
GACGCGTTCA GCGCGATGCG CCACCTGCTC ACCGACAACC ACGAGTCGCC GGCCGCGCAG
CGCCGGCTCA TCCGCCGGGC GATCGCGATC TACCGGGCGC TGCTCGCCGC CGGGGTGGTG
GAGAAGCTCG ACGCCCCGGA CGAGACCGGG CGCACCGTGC GGCTCACCGT CGACCTGCAG
CTCGACTTCG CGCTCAACCA GCCGCTGTCC CCGTTCGCGC TGGCCGCGAT CGAGCTGCTC
GACCGTGAGT CCCCCGACTA TCCGCTGGAC GTCCTGAGCC TGCTCGAGTC GACGCTGGAC
AACCCGCGGC AGGTGCTGTC CGCGCAGCAG TTCCGCGCCC GCGGGGAGGC GGTGGCGGCG
ATGAAGGCCG AGGGCGTCGA CTACGACCGG CGGATGGAGC TGCTCGACGA GGTCACCTGG
CCCAAGCCGC TCGACGAGCT GCTGCACGCT GCGCTCGCCA CCTACCGGCG CGGCCACCCC
TGGGTCGACG ACTACGAGCT GTCGCCCAAG TCCGTCGCCC GGGACATGTA CGAGCGGGCG
ATGACCTTCG CCGAGTACGT CGACTTCTAC GGCCTGGCCC GCTCGGAGGG GCTGCTGCTG
CGCTACCTGG CCGACGCGTT CAAGGCGCTG CGCCAGACGG TGCCCGAGGA CGCGCGCACC
GAGGAACTCA CCGACCTCAC CGAGTGGCTG GGCGAGCTGG TCCGCCAGGT CGACTCCAGC
CTGCTCGACG AGTGGGAGGC GCTGCGCGAC CCCGCTCCCG ACAGCGGCGC CGCGCCGGGC
GGGCTGGTCG AGCGGACCTC GGCGGTCACC GCGAACACCC GCGCGTTCCG GGTGCTGGTG
CGCAACGCGC TGTTCCGCCG GGTCGAGCTG GCCGCGCTGC GCCGCTACGA CCTGCTCGGC
GACCTCGACG CGGCCGCGGG CTTCGACGCG CAGGCCTGGC GCGACGCCCT CGAGCCGTAC
TTCGACACGC ACGCCGAGAT CGGGACGGGA CCGGACGCCC GTGGCCCCGG GCTGCTCATC
GTGGAGACCG CCGCGGACCG CTGGACGCTG CGCCAGATCT TCGACGACCC GGCCGGGGAC
CACGACTGGG GAATCAGCGC CGAGGTGGAT CTCGCCGCCT CCGACGAGGC CGGCATCGCC
GTCCTGCGGG TGACGGCCGT CGACCAGCTC TGA
 
Protein sequence
MSRAGAGRPS ARVPGKGGDT HAAAGRCAAA PRPVAVHPAW FTRRGRRACA PRTGRRAPPR 
GRPVARARRP WGTIESTMTP HTSTPENTSP DLAPGGSAPG GTPGSAPGPA AGSPSLADRL
PAGPEPDADA LFDAFGAWVT DQGLELYPAQ TEALIEVVSG AHVILATPTG SGKSLVATGA
HFAALGWGRR TFYTAPIKAL VSEKFFALCA VFGPARVGMM TGDASVNETA PIICCTAEIL
ANIALRDSAT ADVGQVVMDE FHYYADPDRG WAWQVPLIEL PDTQFLLMSA TLGDVTRFEE
DLTRRTGRPT AVVRSAQRPV PLFHRYVTTP LHETIEELLE TRQAPVYVVH FTQASALERA
QALMSVNVST RAEKDAIAEM IGGFRFTAGF GKTLSRLVRH GIGVHHAGML PKYRRLVELL
AQAGLLKVIC GTDTLGVGIN VPIRTVVFTA LSKYDGTRTR ILTAREFHQI AGRAGRAGYD
TVGNVVVQAP EHVVENEKAL AKAGDDPKKR RKVVRRKPPE GFVSWGVPTF ERLVAAEPEP
LTSQFAVSHS MLLNVVNRPG DAFSAMRHLL TDNHESPAAQ RRLIRRAIAI YRALLAAGVV
EKLDAPDETG RTVRLTVDLQ LDFALNQPLS PFALAAIELL DRESPDYPLD VLSLLESTLD
NPRQVLSAQQ FRARGEAVAA MKAEGVDYDR RMELLDEVTW PKPLDELLHA ALATYRRGHP
WVDDYELSPK SVARDMYERA MTFAEYVDFY GLARSEGLLL RYLADAFKAL RQTVPEDART
EELTDLTEWL GELVRQVDSS LLDEWEALRD PAPDSGAAPG GLVERTSAVT ANTRAFRVLV
RNALFRRVEL AALRRYDLLG DLDAAAGFDA QAWRDALEPY FDTHAEIGTG PDARGPGLLI
VETAADRWTL RQIFDDPAGD HDWGISAEVD LAASDEAGIA VLRVTAVDQL