Gene Franean1_3490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3490 
Symbol 
ID5671861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4149716 
End bp4152568 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content77% 
IMG OID641242378 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001507798 
Protein GI158315290 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.368778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG CCGCGATGAC CACCGCCGAG CTCACTGCGA CGGGTGCGCG CGGCACCGTC 
CGAACAGCGG AGCACGGCCG GATCCTGAAC GTGCTCGAGG GCGCGGACGG CGGTGGCCCG
GCGATCGTGG AGCTCACCGG CGATCCCGGC TACGGCAAGA CCCGGCTGCT GACGGCCGTG
GCGGACGAGG CCCGCAAGCG TGGCGTCGAC GTGCTGCGGG GCAGGTCGCT CGAGACCCGC
CGGCACCGGC CCTTCCACCC GTTCATCGAC GCCTTCAGCA CCTGGCGGCC CACCGGCGCC
GGGCCGCTGC CGCAGGCCAC CGCGTTCGTC CAGTCCCTGG CGACCGCCGG TGACATGCTC
GCCGACACCG AGACCGGCCG CTGCCGGCTC TTCGCCGAGC TGCGGTCGCT GCTGGAGGAG
ACACTGGCGG CGGCGCCCGA CCCGCTGCTG CTGCTGCTCG ACGACGTGCA CTTCGCCGAC
CGGGCGTCGG TCGAGCTGCT CGAGATCCTG ACGCGTTGGC CGCTGGGGCA GCCGCTGTCG
GTCGTGGTGG CGCATCGCCG GCGTCAGATG TCCTCCTGGC TGCAGGTCAC CCTGCAGCAG
GGCGTCGAGG TGGGCGCGGT GGACCAGGTG GCGTTGCCGG CGCTCAGCCT GCGCCAGAGC
GCCGAGCTTC TCGGGCTCGC GGTCACCGCG CCGGGGCTGG CCGACCTGCA CGAGCGCAGT
GCCGGCAATC CGCTCTACCT GACGGCGTTG GCCGAGCGGG AGCGCGGCCT GAACGGAGTG
GACCGGACCG GCCCGGACCC GACCGGCGCG GGTGGCCACG ACCCGGCCGG CGAGCCGCCC
GACCTGTGGA CGCGCAGCGC GCTCGGCGCC CGGCTGCTGG CCGAGACCGT GCATCTCGAT
GCCCGCCAGC GGCTGGTCGC GCACGCGGCG GCCGTGCTCG GCGACACCTT CTCGGTCGAC
GCCGTGGCGG CCGTCGCGCA GCTCGACCGG GAGGAGGCCT GCCAGGCGCT CGGCCAGCTG
CGCGGCCGCG ACCTGGTGCG CATGGTGCCG GGCGGCGAGC TGGTCTTCCG CCACCCGCTG
CTGGGCAGCT GTGTCTACGG GGAGACCGAC TCCTGCTGGC GGGCCGGGGC GCACCGACGC
GCCCTCGAGC ACCTCCGGTC GACGTTCGCC GCGCCGGCCG TGCTCGCCCG GCATGTCGAG
CGCTCCGGAT CGCACAGCCA TGCCCTGGAC CTGACGGTGC TGCTCGACGC GACCCGGGCG
GCCGTGCAGA GCGAGCAGGG CACCGAGGCC GCCCACTGGC TCAGCGCCGC CCTGCGGCTG
CGCCGCGCCG CCCCGGAAAC CCCGAACGGC ACCGGCAGCA CCGGTGAGAC GGCCGGTGCG
GCCGGTGCGG GGCTCCCGCC GGTCGGCCCG GAGGTCTGGC GCCCGGTGAT CGACCTGCTC
GCCCGCCCCG ACGCCGTCGC GCGCCTGCGC GCCCTCGGCC AGGAGGTCCT GAACACCCCG
GCCGCGTACG GGCCGGCCGG CCGGGTGGGC ACGGCGGCGC ACCTGGCGAT GGTCTCGGCC
TCGCTGGGCT GCCACGACCA CGCCGTCGCC CTGCTGTCGA TCGCGCTCGC CGACGCCCGG
GGGCCGGCCG AGGAGGGGCG CGTGCAGCTG TTCGCCCAGC TCGCCCGCAT CGTCTCCGGA
GGCATACCGA GCCGCGCCGA GGTGGACGCG CTGACCGGGC CGGAGCCGGC CGGCGACCCG
CTGACCCGGG CCGGCGGGCT CGCGGTCCGG GCGCTGTGCG CCGCGCTCAC CACCGACCGC
GGCGCCACCG GGAACAGGTC CGCGGGCGCG GAGGCGGACG CGGCGGCCGC CGCGCTCGAC
GCCCTGGCGG TGAACGGCCC GGACCCCTCG GGCCCGGAGC TGTACTTCCT GCAGTGCCTG
GGCTGGACGG AGGCGATGCT CGGTCGCTAC GACAGCGCGC ACGCCAGGAT CACCCGCGCG
GTGCGCGGCG CCCGCAAGCA CGGCGAGCGC CACCTGCTGC CCGCCCTGCT CAACAGCCTC
GCCTACATCC ACTACCAGTC GGGGCGCCGG AGCGAGGCGA TCGAGGTCAC CCGGGAGGCC
CAGCGCGTGT CGCAGCGGGC CGGTCGCGCC GACCAGGTGG CACTGGCCCA GGCCGTGGCC
ACGGCCGCCT GGGCCGGGCT GGGGCGCTCC CCCACGTTCG GCCCGGAGCT GCCGGAGGGC
GTGCACGCCG ACGCGCCCCG CACGCCGCTG ACCGCGCTGC TGTTCGCCGA GGCCGCGCTG
GCCGCCGGCG ACGGACCGGC CGCCCTGGCC CTGCTGGGGT CGAAGCGGGA GACCTGGCGG
GTCGCCGAGC CGATCCCGGT GCTGGGTGCC CGCATCTTCG AGGTGCTGGC CGCCGCCGCG
CTGCTGGCCG GGGAGGATCC ACGCCCGTGG GCGGCCCGAG CGGCCGAGGA GGCCGCCCGG
GTCGGGCTGC CCGAGCAGCG GGGCCACGCG CTGCTCGCCC GCGGCTACGC GCTGGGTCGC
GGCCCGGAGG CCGACCGGTG CTACGCCGAG GCGGCCGGCC TGCTGGCGGG CGCGCCCGCC
GGCGGCCGGG CCCGGACGTT CGCGCGGTCG GCTCAGCAAC GGCGCCACCG CCCCGGCCCC
GACCCGCTGG TCGAGCTCAC CACCAGGGAG CAGGAGGTCG CCCAGCTCGC CGGCCAGGGG
CTGCGGACCC GGGACATCGC CAGCCGGCTG CGGGTGAGCC CGCGCACCGT CGACACCCAC
CTCTCCCACA TCTACGACAA GCTCGGCATG AGCTCGCGGG TCGAGCTGGC CCGGTTGCTG
TCCCCGGTCC CGCTGGTGCC CCCGGGCCAC TGA
 
Protein sequence
MTTAAMTTAE LTATGARGTV RTAEHGRILN VLEGADGGGP AIVELTGDPG YGKTRLLTAV 
ADEARKRGVD VLRGRSLETR RHRPFHPFID AFSTWRPTGA GPLPQATAFV QSLATAGDML
ADTETGRCRL FAELRSLLEE TLAAAPDPLL LLLDDVHFAD RASVELLEIL TRWPLGQPLS
VVVAHRRRQM SSWLQVTLQQ GVEVGAVDQV ALPALSLRQS AELLGLAVTA PGLADLHERS
AGNPLYLTAL AERERGLNGV DRTGPDPTGA GGHDPAGEPP DLWTRSALGA RLLAETVHLD
ARQRLVAHAA AVLGDTFSVD AVAAVAQLDR EEACQALGQL RGRDLVRMVP GGELVFRHPL
LGSCVYGETD SCWRAGAHRR ALEHLRSTFA APAVLARHVE RSGSHSHALD LTVLLDATRA
AVQSEQGTEA AHWLSAALRL RRAAPETPNG TGSTGETAGA AGAGLPPVGP EVWRPVIDLL
ARPDAVARLR ALGQEVLNTP AAYGPAGRVG TAAHLAMVSA SLGCHDHAVA LLSIALADAR
GPAEEGRVQL FAQLARIVSG GIPSRAEVDA LTGPEPAGDP LTRAGGLAVR ALCAALTTDR
GATGNRSAGA EADAAAAALD ALAVNGPDPS GPELYFLQCL GWTEAMLGRY DSAHARITRA
VRGARKHGER HLLPALLNSL AYIHYQSGRR SEAIEVTREA QRVSQRAGRA DQVALAQAVA
TAAWAGLGRS PTFGPELPEG VHADAPRTPL TALLFAEAAL AAGDGPAALA LLGSKRETWR
VAEPIPVLGA RIFEVLAAAA LLAGEDPRPW AARAAEEAAR VGLPEQRGHA LLARGYALGR
GPEADRCYAE AAGLLAGAPA GGRARTFARS AQQRRHRPGP DPLVELTTRE QEVAQLAGQG
LRTRDIASRL RVSPRTVDTH LSHIYDKLGM SSRVELARLL SPVPLVPPGH