Gene Franean1_4346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4346 
Symbol 
ID5672701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5187882 
End bp5190974 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content74% 
IMG OID641243219 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001508636 
Protein GI158316128 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGAAC GGGGTTCGCT GGTCGGGCGC GACCGGGAAA TGGCCGCGGT GGGCGCCGCG 
TTGGACCGGC TCGCCTCGGC GCGGCGCCGG CCAGGTTTCG ACTGGATGGA GATCCGTGGC
GAGCCCGGCA TCGGCAAGAG CCGCCTGATC GGCGAACTCA CCGAGCTTGC CGACGCCCGT
GGCCATCTCG TACTGCGGGG CCACGGCGCC GAGCTCGAAC AGGACATTCC GTTCAACGCC
CTCGTCGATG CGCTCGACGA CTACCTCGGC GCGGTGGATC CGGCGCGGTT GCGCCAGCTC
GGCGCTGACC GGCTCGACGA GCTGGCCGCC GTGTTTCCGG CGTTGCGCTC GCTCGGGACC
GTGCCGACCG GCAACACCTC CGAGATGCAG CGCTGGCAGA CCTACCGGGC GGCGCGAGCG
CTACTCGAGC TGCTCGCTGC CAACCGGCCG CTGGTACTGG CCCTGGACGA CCTGCACTGG
GCCGACACCG CGTCGGCCGA GCTCGTCGCT CACCTGCTCC GCCGGCCGCC GAAGGCGCGG
GTGCTGCTGG TCGTCGCGGT GCGGCCCCGC CAGGTGCCGA AGGCCCTGGC CGCGGCGTTG
TCGGCCGCGG CGGCGTCCGC CGTTGTCGCG ACCACCGACG GCCAAGCCGG GGCGAACTCA
CACGCGACCG TGGGCGTTCG ACTGGAGCTT GCCCCGCTGT CGCCCGAGGA CAGCGACTCG
CTGCTCAGTC CGGTGACCCG TGACCCGGCG CTCCGGCGTC GCTGGTATCG GGAGAGCGGC
GGTAACCCGT TCTACCTCGA ACAGCTTGCC CGCACCCCCA CCCGCCCGCT GTCCGCAGCT
GCCGGGCCGT CGTCCGTACC GCCGGCGTTG GCGGCCTCGA CCGGCCTCGA CGGACCGGAC
GACGTGCCGC CGGCGGTGAC GGCGGCGGTC GGCGCCGAGA TCGCACAGCT GCCCGAGGTC
GTACGGCGGT TCGCGCAGGG CGCTGCTGTC GCCGGCGAGG TGTTCGAGCC GGACCTGGCC
GCCGAGGCCG CCGCGCTCCC CGAAACCGAC GCCTTCGAAG CACTCGACGC GCTGGTCGCG
GCGGACGTGG TGCGACCGAC CGAGATGCCG CGCCGGTTCA CGTTCCGGCA TCCGATCGTG
CGCCACGCCA TCTACATGTC CGCCGGGCCG GGCTGGCGGA TCGCCGCTCA CGCACGAGCG
GCTGATGCGC TCGCCGCCCG GGGCGCCGCG GCCACCGCCC GAGCCCACCA CGTCGAGCGT
GCCGCCCGTC CCGGCGACGA GGCCGCGATC ACGCTGCTGA CCAAAGCGGC CGCGGCCAGC
CGGCTACGCG CGCCGGCAGC CGCCGCGCAC TGGTACGCCG CCGCGCTCAG ACTGCTACCG
GGCGGTGAGT CGAGCCCCGG CCAGAGTACC GACGTTGGCG TCGACGTGGA CGCGGACGGC
GGGTTGCGGC GACTGGAGCT GCTGTCCCGG CTGGCTGCCG CACTCGACGC GAGTTACCAG
CCGCAGCAGG CCCGCATGGT ACTCGACGAG ATCATGGGTC TGATTCCGCG GGAATTCGGT
GCCGAGCGGG CTCATCTCGT CGCGCTGCGC TCGGCGGTCG ACCATGTGCT GGGCCGGCAC
GGCGAGGCGC GTGCCCTGGT GTTGGACGCC GTGGCGAGCG CCGAACCGGG CACCCGAGAG
AGCTGTCTGC TGCGGCTTCA GCTTGCCATC GACCATTTCT ACACCGGTGA ATACGATGGG
ATGCGCCGCT GGCAGCAGGA AGCGCACGCG CTCGCTGGCA CGCTCGACGA CGCCCCGCTA
CTGGCCGCCT CGGCCGGACT GCTGGCGGGT GCCGAATACA TGGTCGGTGA CGTCCCGGCC
GCGATCGCCG AGGCCGCCGG CGCGGCACGC CGCTACGACC TGCTCTCGGA CGACCAGGTC
ACCCCGCACC TCGACAAGCT CGCGTGGCTG GGCTGGACCG AGGCGTTCCT CGAACGGTTC
ACCGACGCGC TACGCCACCT CGACCGGGTC GACGCGCTGG CGCTGCGCAA CGGCCGGGGC
AGCATCGGCA CACTGACGGC AGTCGCACGG TCGCTGGTCC TGACATCGCG GGGACGGTTG
CCGGAGGCCG CGGCGGCGGC CGAAGCCGCG GTGGAGGCGT GCCTGCTCAC CCCGCACCTG
CCCTTCCTGT CCTGGGCGCT GGCGGCGCGG TGTGCCGCGG CGACGCTGGC GGGAGACCTC
CCCGAGGCAC TGCGGTCGGG TGCGCAGGGA GCACGGGCGG CCAACCCGGA GACTGACGCC
GTCTCGGTGA TGGCCGGTTC CTATTTCGCG GAGGCGCTGG TCGAGGCCGG TGAGCCCGAC
CGCGCCGTCG ATGAACTCCT GGGTGCGGCC GGCGGCGCCG AGCTGCCGCG CATCGAAGCT
CCCATTCGGC CGTACTGGTA CGAGGTGCTC ACGCGGGCCG AACTGGCTCG CGGGATGCCG
GAGGCGGCCG CCGGCTGGGC GGTGCTCGCC GAGCGGACGG CCACCGACGC CGGCGGTGGG
CTCGCGGGAC GGAAGGCCTC GGCGATGCGA GCGAGAGCAG CCGTCGAGTT CGCCAGGGGA
CAGCCGTCGG CCGCCGCCGC GTCGGCGCTG GCCTCAGCCG CCGAGGCCGA GCGGGCCGGC
CTGCCGATCG AGCTCGGCCG GTCGCTGATC GTGGCTGGGC GGGCGCTGGC CGCCGACGGG
CAGAACGCGC GGGCGGTCAG CGAGCTGCGC CGGGCCGAGG CCCGACTCGA CGCGTGCGGT
GCCAACCGTC CCCGGGACGA GGCGGCCCGG CTGCTGCGCC AGCTCGGCGA GCGGGTGTCT
CGGGGCGGTC GACCGTCGAC GCGGACCCAA GCCCGGGCCG CTCAGACCCT GGCTCATCAC
CAGACCCAGA CGGTCGTCCT CAGCGTGGCA GCCGGCACGC TCAGCACCCG AGAACGTCAG
ATCGCCGAGC TGGTCGCCGC CGGCCAGACG AACCGCCAGA TCGCCGCCGC GCTGTTCGTC
AGCGAGAAGA CCGTCGAGAG CCATCTGACG AAGGTGCTCG CCAAGCTCGG CGTCCCCACC
CGAGCCGGTG TCGGTTCCGC GCTCCGTTCC TGA
 
Protein sequence
MGERGSLVGR DREMAAVGAA LDRLASARRR PGFDWMEIRG EPGIGKSRLI GELTELADAR 
GHLVLRGHGA ELEQDIPFNA LVDALDDYLG AVDPARLRQL GADRLDELAA VFPALRSLGT
VPTGNTSEMQ RWQTYRAARA LLELLAANRP LVLALDDLHW ADTASAELVA HLLRRPPKAR
VLLVVAVRPR QVPKALAAAL SAAAASAVVA TTDGQAGANS HATVGVRLEL APLSPEDSDS
LLSPVTRDPA LRRRWYRESG GNPFYLEQLA RTPTRPLSAA AGPSSVPPAL AASTGLDGPD
DVPPAVTAAV GAEIAQLPEV VRRFAQGAAV AGEVFEPDLA AEAAALPETD AFEALDALVA
ADVVRPTEMP RRFTFRHPIV RHAIYMSAGP GWRIAAHARA ADALAARGAA ATARAHHVER
AARPGDEAAI TLLTKAAAAS RLRAPAAAAH WYAAALRLLP GGESSPGQST DVGVDVDADG
GLRRLELLSR LAAALDASYQ PQQARMVLDE IMGLIPREFG AERAHLVALR SAVDHVLGRH
GEARALVLDA VASAEPGTRE SCLLRLQLAI DHFYTGEYDG MRRWQQEAHA LAGTLDDAPL
LAASAGLLAG AEYMVGDVPA AIAEAAGAAR RYDLLSDDQV TPHLDKLAWL GWTEAFLERF
TDALRHLDRV DALALRNGRG SIGTLTAVAR SLVLTSRGRL PEAAAAAEAA VEACLLTPHL
PFLSWALAAR CAAATLAGDL PEALRSGAQG ARAANPETDA VSVMAGSYFA EALVEAGEPD
RAVDELLGAA GGAELPRIEA PIRPYWYEVL TRAELARGMP EAAAGWAVLA ERTATDAGGG
LAGRKASAMR ARAAVEFARG QPSAAAASAL ASAAEAERAG LPIELGRSLI VAGRALAADG
QNARAVSELR RAEARLDACG ANRPRDEAAR LLRQLGERVS RGGRPSTRTQ ARAAQTLAHH
QTQTVVLSVA AGTLSTRERQ IAELVAAGQT NRQIAAALFV SEKTVESHLT KVLAKLGVPT
RAGVGSALRS