Gene Franean1_5950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5950 
Symbol 
ID5674271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7254946 
End bp7257195 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content74% 
IMG OID641244798 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_001510200 
Protein GI158317692 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0569292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATC AGGCGACGGT CAGCGGGACG CTGCGCGATC GGGCGGAGGA GCTGCTCCGG 
GCTCTGGCTG GGCCGGGCGC GGTCCTGCGT GACGACCAGT GGCAGGCGAT CGACGCGCTC
GTCAGCGGGC GGCGGCGGGT GCTGCTCGTC CAGCGCACGG GCTGGGGGAA GTCCGCGGTG
TACTTCCTCG CCACGGCGCT GCTACGCGGT CTCGGCGACT CGGATCCGGG TGACTCGGAT
CCGGGTGACT CGGAACCAGG TGACTCGGAA CCAGGTGACT CGGGTTCAAG CGGTCCGAGG
CCGGGCGGCG GTCCGGCCGC CCAGGTGGGG CCCACGGTGA TCGTCTCCCC GCTGCTGGCG
CTGATCCGCA ACCAGGCGGC CGCGGCGGCA CGGGTGGGCA TCCGGGCCGG GGAGATCCAT
TCGGGGAACC TCACCGAGTG GGACGAGGTC TACGCCGCGC TGCGGGCGGG GGAGCTGGAC
CTGCTGCTCG TCGGCCCGGA GCGGCTGAAC AACCCGACCT TCCGCGATGA GTACCTCCCG
GAGCTCGCGG CCACGACGGG GCTGCTTGTC GTCGACGAGG CGCACTGCAT CTCCGACTGG
GGCCACGACT TCCGTCCGGA CTACCGCCGG CTGCGGTCAC TGATCGGCGG GCTGCGGCCG
GATGTGCCCG TGCTCGCGAC GACCGCCACG GCCAACGCGC GGGTGGTCGA GGACGTCAGC
GAGCAGCTCG GGGCCGGCTC CACGAGCGCG GAGACCGTGG TCCTGCGGGG GTCTCTCGAC
CGGGAGAGCC TCCGGCTGGC GGTTGTGTCG CTTCCCGCGG CCGAGCAGCG GTTCGGCTGG
CTGGCCGACC ATCTCGCAGA GCTACCCGGC TCCGGGATCA TCTACACGCT CACCAAGCCG
GGCGCGGAGG AGCTGACCGC GTTCCTTCGC GGCCAGGGCC ACGAGGTGAC GACCTACCAC
GGCGGCACGG AGCCGGCCGA GCGCATCGCC GCCGAGGAGG ACCTGCTCGG CAACCGGGTG
AAGGCGCTGG TGGCGACGAG CGCTCTCGGG ATGGGCTTCG ACAAGCCCGA TCTCGGGTTC
GTCGTCCACG TCGGTGCGCC GAACTCGCCG ATCGCCTACT ACCAGCAGAT CGGGCGGGCC
GGGCGTGCGG TGGAGTCCGC CGAGGTCGTC CTGCTGCCCG CCACCGAGGA CCGTGACATC
TGGCGGTACT TCGCCGACAC GTCGTTCCCC CCGGAGCCGG TCGCCCGGCA GGTGCTGGAC
GTCCTGGCCC AGGCCGGGCG GACGATGTCG ACGCAGGCAC TGCTCGCCGC CGTCGACCTG
GGGCACTCCC GGCTCGAGCA GATGTTGAAG GTGCTCGACG CGGACGGCGT CGTCCGGCGG
GTCAAGGGCG GCTGGGAGGC GACGGGCGAG CCGTGGGTCT ACGACGCCGA GCGCTTCCGT
CGCGTCGCCG CCGCGCGGGC CCGCGAGCAG CGGGCGATGC TCGACTACAT CGCGACGCCG
TCCTGCCGGA TGGAGTTCCT GCGCCACGCG CTCGACGATC CGTACGCGGT GCCGTGCGGC
CGGTGTGACC GGTGCACCGG TCGGGTGTGG TCCACCGAGG TCTCCTCCCG GTCGAGGGAC
AGCGCCCGCG AGGAGCTGCG CCGGACGGGC GTGCCCGTTG AGCCGCGCCG GATGTGGCCG
ACGGGCATGC GCACGCTGGG GGTCGCCGCG TCCGGCCGGA TTCCCGCGTC GGTCACCGCC
GAGCCCGGCC GCGTGGTCGC ACGGCTCAAC GACCTGGGCT GGGGCAACCG GCTGCGCCAG
CTGTTCGCCC CTCCTCCCGA GGCCCAGGAC GGCCCTGTCC CCGACGACCT GTTCGAGGCG
GTGGTGCGGA CCCTGACCGA ATGGCGGTGG GAGCGGCGCC CGGCGGCCGT GGTGACTGTG
GCGTCCAGGA CACGCCCGTG CCTGGTGGCG GACCTCGGTG AACGGATCGC GGGTGTCGGG
CGGCTGCCGT TGCTGGGGCA GCTCCCCCGA GTGGCCGGCG GGCCCGCGGC CGGCCGGGTG
CATAACAGCG CCCACCGCCT CGCCGGCCTG TGGTCGGCCT TCGAGGTGCC ACCGCCTATC
GCCGGCCCGC TCGCCGAGCT GACCGGGCCG GTGCTGCTCG TCGACGACCT GATCGTCACG
GGCTGGACGA TGACCGTCGC GGCCAGGGCG CTGCGCCAGG CCGGCGCGCC GGGCGTGCTG
CCGTTCGCCC TCGCCGTCGA GACCGGCTGA
 
Protein sequence
MADQATVSGT LRDRAEELLR ALAGPGAVLR DDQWQAIDAL VSGRRRVLLV QRTGWGKSAV 
YFLATALLRG LGDSDPGDSD PGDSEPGDSE PGDSGSSGPR PGGGPAAQVG PTVIVSPLLA
LIRNQAAAAA RVGIRAGEIH SGNLTEWDEV YAALRAGELD LLLVGPERLN NPTFRDEYLP
ELAATTGLLV VDEAHCISDW GHDFRPDYRR LRSLIGGLRP DVPVLATTAT ANARVVEDVS
EQLGAGSTSA ETVVLRGSLD RESLRLAVVS LPAAEQRFGW LADHLAELPG SGIIYTLTKP
GAEELTAFLR GQGHEVTTYH GGTEPAERIA AEEDLLGNRV KALVATSALG MGFDKPDLGF
VVHVGAPNSP IAYYQQIGRA GRAVESAEVV LLPATEDRDI WRYFADTSFP PEPVARQVLD
VLAQAGRTMS TQALLAAVDL GHSRLEQMLK VLDADGVVRR VKGGWEATGE PWVYDAERFR
RVAAARAREQ RAMLDYIATP SCRMEFLRHA LDDPYAVPCG RCDRCTGRVW STEVSSRSRD
SAREELRRTG VPVEPRRMWP TGMRTLGVAA SGRIPASVTA EPGRVVARLN DLGWGNRLRQ
LFAPPPEAQD GPVPDDLFEA VVRTLTEWRW ERRPAAVVTV ASRTRPCLVA DLGERIAGVG
RLPLLGQLPR VAGGPAAGRV HNSAHRLAGL WSAFEVPPPI AGPLAELTGP VLLVDDLIVT
GWTMTVAARA LRQAGAPGVL PFALAVETG