Gene Franean1_4189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4189 
Symbol 
ID5672544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4982299 
End bp4985007 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content76% 
IMG OID641243062 
Producthypothetical protein 
Protein accessionYP_001508479 
Protein GI158315971 
COG category[R] General function prediction only 
COG ID[COG3973] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.060673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAAACG AAGCGGTGCC CGCCGCGGCT GTGCCCCCCG CACCCCCGGT GCCCCCTGAG 
AGCGCCCCGA CCGGAGACAC CTCCGGGGGC GACGAGCACG CTCGGGAGCA GCAGTACCTG
ACCATGCTGC ACGAGCGGCT GGACGGGCTG CGCGCCGCAG CCGCCGCCGG GCTCGCCGAA
GCCCTGCTGC GCGACGATCC CGAACCCGGC GCGCGCGCCG ACCGCGACGC GCTCGCCGCC
CGCCACGCCG ACCAGGTCGC CCGCTTCGAC GCGATGCGCG ACCGGCTGCT GTTCGGCCGG
CTCGACATGG CCGACGGCGA GCAGCGCTAC ATCGGCCGGA TGGGCGTCCT CGACCCCGAA
GCCGATTACC AGCCGCTGCT CATCGACTGG CGGGCCCCGG CCTCCCGCCC GTTCTACCTG
GCCACCGGGG CCACCCCGCT GCACGTCGCC CGCCGCCGCC ACATCCGCAC CGTCGGGCGG
CGGGTGACCC ACCTCGACGA CGAGGTGTTC GGCCCGACGA CCCTCGGCGG CGGGGTCCTC
GGCCTGCGGA CCCTCGACGG GCTCGGCGCC GACGAGGGCG ACCCCGACCT CACCGCCGGC
CCGCCCACCG CGACCGGTGA ACCGGACGGC ACACCGGCGC ACGACAGCCT CGTCGGCGAG
GCGGCGCTGC TGGCCACCCT CGGCGCGCAC CGCACCGGCC GGATGCGCGA CATCGTCGCG
ACCATCCAGG CCGAACAGGA TCGGATCATC CGCTCCGAGC AGGACGGCAT CCTCGTCGTC
GAGGGCGGGC CGGGTACCGG CAAGACCACC GTCGCGCTGC ACCGTGCCGC ATACCTGCTG
TACTCCCGGC GCGAGCAGCT GTCCAAGCGC GGGGTGCTGA TCGTCGGGCC GAACCCGGCC
TTCCTGCGCT ACATCGAGCA GGTGCTGCCC TCCCTCGGCG AGACCGGGGT GCTGCTGTCG
ACCGTCGGCG ACCTCTTCCC CGGCGTGCGG GCCCGCCGCG CCGAGAGCAC GACCGCCGCG
GAGGTCAAGG GCCGGCCCGA GATGGTCGAC ATCCTGGCCG CCGCCGTCCG CGACCGCCAG
CGGCTCCCCG ACACGCCCCT GGAGATCACC GTCGCGGACG GGGTCGCCCG CCTCGACGAG
ACGATCGTCG TGCCGGCCCG GGCCGCCGCC CGGCTCACCG GACGCCCGCA CAACCACGCC
CGGTCGGTGT TCGTCCGCGA GGTGATCACC GCCCTGACCC GCCAGATCGC CGACCGGTAC
GAGTCGTCGA TCGACGAGGT CGACATCCCG GACTTCGTCG ACGACTTCAT GCTCTGGCCC
GACACCGACG CCGCGCGCGA CGCCCTCGGC GACGGCACCG AAGGCGGCGA GCCGGGTGGG
GGCAGCGCGG CGGCGGACGG CGGCGACTGG CCCAGCACGG CCCTGGCCGC CGCGGCCGCG
GCGGCGGACC GGCCGGTGCT CGACCCCTCC GACCTGGCCG ACCTGCGCCG CGATCTGAGG
TCCGACCCCA GGCTCGCCGA GGCGATCGAC GGCCTGTGGC CGCTGCTGAC CCCGCAGGCG
CTGCTGGAGG ACCTGTTCGC CTCCGCCGAC GCGCTGTCGC ACGCCGCCCC CGGGCTGACC
GACGCCGAGC GGGCGGCACT ACGCCGCGAC CCCGGCGGGC GCTGGGCCCC CGGCGACTGG
GCCCCCGGCG ACTGGGCACC CGCCGACACC CCGCTGCTCG ACGAGGCCGC CGAACTGCTC
GGCAACGACC CCCGGGCGGC GATCGACGCG GCCGTCGCCG CGCACCTGGA GCGCCAGCAG
CGGATCGACT ACGCCGGCGG CGTGCTGGAC ATCCTCTCCC GCGGCGACAC CGAGGATCCC
GACGGCGAGC TGCTGATGGC CTCCGACCTC ATCGACGCCG ACCGGTTCGG CGAACGCCAG
GAGGAGGTCG ACACCCGCAG CACCGCCGAG CGCGCGGCCG CCGACCGCAC CTGGGCGTTC
GGCCACGTGA TCGTCGACGA GGCGCAGGAG CTGTCCCCGA TGGCCTGGCG GATGGTGATG
CGCCGCTGCC CGACCCGGTC GATGACGCTG GTCGGCGACG TCGCGCAGAC CGGCGACGCG
GCGGGGAGCT CCTCGTGGGC GCAGGCGCTC GACCCGTTCG TCGGCCCGCG CTTCACCCTC
GAGCGGCTCA CGGTGAACTA CCGCACCGGC GCGGAGATCA TGGACATCGC CGGCGACGTG
CTCGCCGCTC AGGGGCGTGG GCTGCGCGCG CCGCGGTCGG TGCGCCGGTC GGGCCGCGCG
CCGTGGCGGC TCACGGTCGG CGCCCACGAG CTGGCCGAGC ATCTGGAGCA GCTGGTACGT
GCCGAGCGCG CGGCGGCCGG CGGCGGGCGG CTGGCTGTGA TCGTGCCGCG CTCCCGCCGC
GACGAGCTGA CATCCCTCAG CGTGGACGCG GAGGGGGGCG CGGACGGGGA TCCGGAGCGG
CCGGTCGTCG TGTTGACGGT CCGTGAGTCC AAGGGGCTGG AGTTCGACGC GGTCATCGTG
GTCGAGCCCG AGCGCATTCT GGCCGAGTCG CCGCGGGGCG CCGGTGACCT GTACGTCGCG
CTGACCCGGC CGACGAACCA GCTCGGGGTC GTGCACGTCG GCGCGCTGCC GGCGGTGCTG
AGCCGGCTGC GACCGCGCGA GGCGACCGGT GGCGTTCCGG CGGCCGGTCC GGCGGCATCC
GGTCGCTGA
 
Protein sequence
MSNEAVPAAA VPPAPPVPPE SAPTGDTSGG DEHAREQQYL TMLHERLDGL RAAAAAGLAE 
ALLRDDPEPG ARADRDALAA RHADQVARFD AMRDRLLFGR LDMADGEQRY IGRMGVLDPE
ADYQPLLIDW RAPASRPFYL ATGATPLHVA RRRHIRTVGR RVTHLDDEVF GPTTLGGGVL
GLRTLDGLGA DEGDPDLTAG PPTATGEPDG TPAHDSLVGE AALLATLGAH RTGRMRDIVA
TIQAEQDRII RSEQDGILVV EGGPGTGKTT VALHRAAYLL YSRREQLSKR GVLIVGPNPA
FLRYIEQVLP SLGETGVLLS TVGDLFPGVR ARRAESTTAA EVKGRPEMVD ILAAAVRDRQ
RLPDTPLEIT VADGVARLDE TIVVPARAAA RLTGRPHNHA RSVFVREVIT ALTRQIADRY
ESSIDEVDIP DFVDDFMLWP DTDAARDALG DGTEGGEPGG GSAAADGGDW PSTALAAAAA
AADRPVLDPS DLADLRRDLR SDPRLAEAID GLWPLLTPQA LLEDLFASAD ALSHAAPGLT
DAERAALRRD PGGRWAPGDW APGDWAPADT PLLDEAAELL GNDPRAAIDA AVAAHLERQQ
RIDYAGGVLD ILSRGDTEDP DGELLMASDL IDADRFGERQ EEVDTRSTAE RAAADRTWAF
GHVIVDEAQE LSPMAWRMVM RRCPTRSMTL VGDVAQTGDA AGSSSWAQAL DPFVGPRFTL
ERLTVNYRTG AEIMDIAGDV LAAQGRGLRA PRSVRRSGRA PWRLTVGAHE LAEHLEQLVR
AERAAAGGGR LAVIVPRSRR DELTSLSVDA EGGADGDPER PVVVLTVRES KGLEFDAVIV
VEPERILAES PRGAGDLYVA LTRPTNQLGV VHVGALPAVL SRLRPREATG GVPAAGPAAS
GR