Gene Franean1_5464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5464 
Symbol 
ID5673795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6607621 
End bp6609438 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content71% 
IMG OID641244319 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001509725 
Protein GI158317217 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGTGC TCCGCCGCGT CTTCGGCTAC GAGTCCTTCC GCGACGGCCA GCAGGAGATC 
ATCGATCACG TCGTCGGCGG TGGCGACGCG CTGGTCCTCA TGCCGACCGG CGGCGGCAAG
TCGCTGTGCT ACCAGATCCC GGCGCTCGTC CGGCCTGGCA CCGGCGTCGT CATCTCCCCG
CTCATCGCGC TGATGCAGGA CCAGGTCGAC GCGTTGCTGG CGCTCGGGGT CCGGGCGGGG
TTCCTCAACT CCACCCAGCA GGCCGACGAG CGCCGCGCGG TGGAGTCCGC GTTCCTCGCC
GGCGAGCTCG ACCTGCTCTA CCTGGCCCCC GAGCGGCTGC GGGTACGCGC GACGATCGAG
CTCCTCGACC AGGGCGAGAT CGCGCTGTTC GCGATCGACG AGGCGCACTG CGTCGCGCAG
TGGGGGCACG ACTTCCGGCC CGACTACCTC CTGCTCTCCG AGCTGCACCA GCACTGGCCG
CAGGTGCCGC GCGTCGCACT GACCGCCACC GCCACCCCGG CGACCCACCA GGAGATCACG
AACCGGCTCG GGCTGGGAAA CGCCCGCCAT TTCGTCGCCG ACTTCGACCG GCCGAACATC
CAGTACCGCA TCGTGCCGAA GAACGACCCG AAGGCACAGC TGCTGGAGCT CCTGCGCACC
GAGCACCCCG GCGACGCGGG CATCGTGTAC TGCCTGTCCC GGGCGTCGGT CGAGAAGATC
GCGGAGTTCC TGGTGTCGAA CGGGATCGCC GCGCTGCCCT ACCACGCCGG CCTCGAAGCA
CGCGTCCGCG CCGAGCACCA AGCCCGTTTC CTCCGTGAGG ACGGCCTGGT CATGGTCGCC
ACGATCGCCT TCGGGATGGG CATCGACAAG CCGGACGTCC GCTTCGTCGC CCACCTCGAC
CTACCGAAGT CGGTAGAGGG CTATTACCAG GAGACCGGCC GCGCCGGCCG GGACGGGCTG
CGCTCCACCG CCTGGCTCGC GTACGGCCTG CAGGACGTGG TCCAGCAGCG CCGGCTCATC
GACGCCTCCG ACGGCGACGC CACGCACCGC CGGCGGCTCA ACTCCCACCT CGACGCGATG
CTCGCACTGT GCGAGACCAT CGAGTGCCGC CGGGTCGGCC TGCTGGCGTA CTTCGGGCAG
CAGGGGTCGA GCGCCTGCGG CAACTGCGAC GCCTGCCTGC ACCCGCAGCA GTCCTGGGAG
GCGACCGTGC CCGCCCAGAA GCTGCTGTCG ACGGTGCTGC GCCTGCAGCG GGAACGCCGG
CAGAAGTTCG GCGCCGGCCA CATCGTCGAC ATCCTGCTGG GCCGGCGCAC ACCCAAGGTC
AACCAGCACG GCCACGACTC GCTGACGGTC TTCGGCATCG GCACCGAGCT CAGCGAGGCC
GAGTGGCGTG GCGTGGTCCG CCAGCTGCTG GCCCAGGGGC TGCTGGCCGT CGAGGGTGAG
CACGGAACGC TCGTCCTCAC CGACGCGAGC GCGGACGTCC TGCGCGGGCA GCGGACGGTC
TCCATGCGGC GCGAACCGAA GCGACCGGCG AAGGCCACCA GCTCCTCGGC GACGAAGGCA
CGCCGGGCCG AGCCGGTCGA ACTGTCCGCG GAGGCGGCGC CGCTGTTCGA ACGGCTGCGC
GCCTGGCGGG GCGCCACCGC CAAGGAACAG GGCGTTCCCG CCTACGTGAT CTTCCATGAC
GCGACGCTGC GCGAGATCGC CACCCGCGCG CCCTCCTCGC TCGCCGAGCT CGCGACGGTG
AACGGCGTCG GCGAGAACAA GCTGGCCAAG TACGGCGAAC ACATCCTCCC GCTGTGTAGC
GCCACGCCCA CGGACTGA
 
Protein sequence
MEVLRRVFGY ESFRDGQQEI IDHVVGGGDA LVLMPTGGGK SLCYQIPALV RPGTGVVISP 
LIALMQDQVD ALLALGVRAG FLNSTQQADE RRAVESAFLA GELDLLYLAP ERLRVRATIE
LLDQGEIALF AIDEAHCVAQ WGHDFRPDYL LLSELHQHWP QVPRVALTAT ATPATHQEIT
NRLGLGNARH FVADFDRPNI QYRIVPKNDP KAQLLELLRT EHPGDAGIVY CLSRASVEKI
AEFLVSNGIA ALPYHAGLEA RVRAEHQARF LREDGLVMVA TIAFGMGIDK PDVRFVAHLD
LPKSVEGYYQ ETGRAGRDGL RSTAWLAYGL QDVVQQRRLI DASDGDATHR RRLNSHLDAM
LALCETIECR RVGLLAYFGQ QGSSACGNCD ACLHPQQSWE ATVPAQKLLS TVLRLQRERR
QKFGAGHIVD ILLGRRTPKV NQHGHDSLTV FGIGTELSEA EWRGVVRQLL AQGLLAVEGE
HGTLVLTDAS ADVLRGQRTV SMRREPKRPA KATSSSATKA RRAEPVELSA EAAPLFERLR
AWRGATAKEQ GVPAYVIFHD ATLREIATRA PSSLAELATV NGVGENKLAK YGEHILPLCS
ATPTD