Gene Franean1_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4867 
Symbol 
ID5673207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5839016 
End bp5841988 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content75% 
IMG OID641243722 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001509138 
Protein GI158316630 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.335008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCTCCG CGCCCGGAGC CGTCGAGGAG TTCGCGGCCC GGTACCCGTT CGGGCTCGAC 
CCGTTCCAGT CCGAGGCAGT CGCGGCGCTC GCGCAGGGCG AGGGTGTCCT CGTCGCCGCC
CCGACGGGCG CCGGCAAGAC CGTCGTGGGT GAGTTCGCCG CCCACCTGGC GCTCGCGACC
GGCACACGTT GCTTCTACAC CACCCCGATC AAGGCCCTGT CCAACCAGAA GTACGCCGAC
CTGGTCGCCC GGTACGGCGC GGCCTCGATC GGCCTGCTGA CTGGCGACAC CTCGCGCAAC
GGGGACGCTC CGGTGGTCGT GATGACCACC GAGGTCCTGC GCAACATGCT CTACACCGAG
GCCGCGGGCA GCGCCCGGCT GGACTCCCTC GGCTACGTCG TGATGGACGA GGTCCACTAC
CTCGCCGATC GCCAGCGCGG AGCCGTCTGG GAAGAGGTGA TCATTCACCT GCCCCAGCAC
GTGCGGCTCG TCTCGCTGTC GGCGACGGTC AGCAACGCGG AGGAGTTCGC CGAGTGGCTG
GTCACCGTCC GCGGGCACAC CCGGGTGATC GTCAGTGAGC ACCGCCCGGT GCCGCTGTTC
CAGCACGTCC TCGCGGACCG GACGCTGCAC GACCTGTTCG TCGACCAGCC GTCCGGGCTG
GATCCGGGCG TCCCGGCGTT CTCGCGTCGC GGCCCGGGGC CGAACGGGCG GGGCGCTGCC
CCAGGGTCCG TCGGGGGCGC CACCCCGGGC AGCCGGCCCG GCGCCCGTGC CGCTGAAGCC
CGTGCCGGTG ATATCGCCGG TGCTGGTGCG GCGGCCGGGG GGCGGGCGGT CAACCCGGAC
CTGCTCCGGC TCGCCCGGGA GGAGTCCCGC GCGGTGTACG AGCGCGGCCG CGGGCCCCGG
TCGTCGCGGC CGGGCCGGCC CGGCGCCGGG AACGGGGCCG GGAACGGGCG GCGCCGCTCC
GGGCCGCCGA ACCGGCCCGA CGTCATCGTC CGGCTGGACC GTGCCGGCCT GCTGCCCGCG
ATCCTGTTCG TCTTCAGCCG GGTCGGCTGC GACGCCGCGG TGGCGTCCTG CATCCAGGCG
GGTCTGCGGC TGACGTCCCC CGACGAGCAG CGCGAGATCC GGGAACACGT CCGGGCCCGG
ACAGCGGGAG TGCCCCAGGC CGACCTAGCC GTCCTGGGCT ACTGGCAGTG GCTCGAAGGG
CTGGAGCGGG GTATCGCCGC GCACCACGCA GGCATGCTCC CCACCTTCAA GGAGGTGGTG
GAGGAGCTGT TCGTGCGTGG GCTCGTCCGC GCGGTGTTCG CCACCGAGAC CCTGGCGCTC
GGGATCAACA TGCCCGCGCG CACCGTCGTC CTGGAACGGC TCACCAAGTT CAACGGCCAG
ACCCGCGCGG ACATCACGCC CGGCGAGTAC ACGCAGCTCA CCGGGCGGGC CGGGCGGCGC
GGGATCGACG TCGAGGGCCA CGCCGTCGTG CTGTGGCAGC CCGGGCTGGA CCCGCTCGCC
CTCGCCGGCC TGGCCTCGAC CCGGACCTAT CCGCTGAAGT CGTCGTTCCG GCCGTCCTAC
AACATGGCTG TCAACCTCGT CGGCCGCCTC GGCGCCGAGC GCGCGCGCAC CGTCCTGGAG
TCGTCGTTCG CCCAGTTCCA GGCGGACAAG GCCGTCGTCG GCATCGCCCG CGCGGTGCGC
CGTAACCAGA CGGCGATCGA GGAGCTGACC GCCGCCCTCG AATGCGACCG CGGCTCGGTC
ACCGAGTACG ACGGGCTGCG CCGCCAGATC CGGGAGCGCG AGGCCGACCT GTCCCGCGCG
GGGACGGTGC GCCGCCAGTC CGAGGTCGCC GCGGCGCTGG CGAAGCTGCG CAGCGGCGAC
GTGGTGCGCG TGCCGGTCGG GCGGCGCGGC GGGCTTGTCG TGGTGCTCGA CGCCGGTGTC
GACGGGGGCT CGGCGGAAGG GCCGCGCCCC GTCGTCCTCA CCGAGGACCG CCAGGTCCGG
CGCCTGTCCA TGATTGATTT TCCGGTGGCC GTCGAGCCGC TGGCCCGGGT CCGTATCCCG
AAGTCGTTCA ACCCCCGCTC GCCGCAGGCG CGCCGCGACC TCGCCTCGTC GCTGCGCAAC
ATCCGGCTGC CGGAGGAGCC CGGCCGCCGC GAGCGGGCGC GTTCGCTGGC CGCGGACGAC
GCCGAGCTGG CCCGGCTGCG CCGCGCGATG CGGGCCCATC CCGTCCACGA CTGCCCGGAG
CGGGAGGCGC ACCTGCGCTC GGCGGAACGC ATCGACCGCC TGCGCCGCGA GACAGCCGGG
CTCGAGCGCA AGGTCGAGGG GCGGACGAAC ACGGTGGCCC GGACCTTCGA CCGGGTCCGC
GACACGCTCG CCGAGCTCGG TTACCTGGCC GTCGGCGGCG ACTCCGTCAC CGACGCGGGC
GCGATGCTGG CGCGCATCTA CACCGAGCAG GACCTGCAGG TCGCGGAATG CCTGCGCACG
GGAGTGTGGG AGGGGCTGAC CCCACCCGCG CTGGCCGCCG CGGTCTCGAC GCTGGTCTTC
GAGCCACGCG GTGACGACAT CGCCGCGCCG ACGATCCCCG GTGGGGGCGC GCTGCGCGAC
GCGCTCGCGG ACATGGCGGG TGTCTACACC CGGCTGGGCG CCGCGGAGGA CCATCACCGG
CTCGGGTTCC TGCGCCCGCC CGACCTCGGC TTCGTCGCGG TCGCGCACGG CTGGGCCTGC
GGGCGCGGGC TGGAGAAGGT CCTCGAGGAC GCCGGCGCGG ACCTGACGGC CGGGGACTTC
GTCCGCTGGA TGCGCCAGCT CATCGACCTG CTGGACCAGA TCGCGCAGGT CGCCCCGGTC
TACGCGGCGA AGACGTCCGC CGGGCCGGCC TCGGGTGGGC CCACGGCGCA GGAGCGCGCC
GGCACCGACG GGATCCTCGG CGTCGGCCGG GCCGCCCGGG CTGCGATAGA CGCGATCCGC
CGCGGCGTGG TCGCCTACTC GATGTCCGTG TGA
 
Protein sequence
MSSAPGAVEE FAARYPFGLD PFQSEAVAAL AQGEGVLVAA PTGAGKTVVG EFAAHLALAT 
GTRCFYTTPI KALSNQKYAD LVARYGAASI GLLTGDTSRN GDAPVVVMTT EVLRNMLYTE
AAGSARLDSL GYVVMDEVHY LADRQRGAVW EEVIIHLPQH VRLVSLSATV SNAEEFAEWL
VTVRGHTRVI VSEHRPVPLF QHVLADRTLH DLFVDQPSGL DPGVPAFSRR GPGPNGRGAA
PGSVGGATPG SRPGARAAEA RAGDIAGAGA AAGGRAVNPD LLRLAREESR AVYERGRGPR
SSRPGRPGAG NGAGNGRRRS GPPNRPDVIV RLDRAGLLPA ILFVFSRVGC DAAVASCIQA
GLRLTSPDEQ REIREHVRAR TAGVPQADLA VLGYWQWLEG LERGIAAHHA GMLPTFKEVV
EELFVRGLVR AVFATETLAL GINMPARTVV LERLTKFNGQ TRADITPGEY TQLTGRAGRR
GIDVEGHAVV LWQPGLDPLA LAGLASTRTY PLKSSFRPSY NMAVNLVGRL GAERARTVLE
SSFAQFQADK AVVGIARAVR RNQTAIEELT AALECDRGSV TEYDGLRRQI REREADLSRA
GTVRRQSEVA AALAKLRSGD VVRVPVGRRG GLVVVLDAGV DGGSAEGPRP VVLTEDRQVR
RLSMIDFPVA VEPLARVRIP KSFNPRSPQA RRDLASSLRN IRLPEEPGRR ERARSLAADD
AELARLRRAM RAHPVHDCPE REAHLRSAER IDRLRRETAG LERKVEGRTN TVARTFDRVR
DTLAELGYLA VGGDSVTDAG AMLARIYTEQ DLQVAECLRT GVWEGLTPPA LAAAVSTLVF
EPRGDDIAAP TIPGGGALRD ALADMAGVYT RLGAAEDHHR LGFLRPPDLG FVAVAHGWAC
GRGLEKVLED AGADLTAGDF VRWMRQLIDL LDQIAQVAPV YAAKTSAGPA SGGPTAQERA
GTDGILGVGR AARAAIDAIR RGVVAYSMSV