Gene Franean1_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0072 
Symbol 
ID5668497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp86789 
End bp89791 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content74% 
IMG OID641239000 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001504445 
Protein GI158311937 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.996458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCC TGCGCAGCGT CAACCGCTCG GCGACGGAGG CGGCCCGGCG GTTGCGCCGG 
ATCGAAGCGC AGGACGCCGA CCCGGGACCG GCCCCGGTGT CCGGCCTGAG GCCGGGGCCG
CGATCCGGCC CGGCGGCCGG CGCCCTACGT GACGCCCTGC TGGTGGACGG CGGGCGGGCG
TCGTTCGTGG GGCGGCCGGA GGCGTTGCTT CAGCTGCGCC GGGCGGTCGC CGATGTCCTC
GGCGGGCGGC CGCGGGTCGC GCTGCTGGAG GGCCCGGCGG GAATCGGCAA GACGGCTCTG
GCCCGTCATC TCCTCGGCGA GGCGGATGCC GGCTGCGTGC TGTGGGCGAG CGGCGAGGAG
GACGAGACGG GTCTCAGCTT CGGGGTGCTC GACCAGCTGA CCGCCGAGGC GGCCCGGGTA
CTCGGAGCGG ACCAGCCCTC CCCGAGCGGT CCGGGTGAGC TGACCGGCAC CGCGGGATCC
GTCCGCGGCC CGTCACCCGA TCCGCTGGTC GCCGGCGCGG CTCTGGTCGA ACTGCTCGGC
GAGCTGCAGC GGTCCGGCCC GGTCGTGCTG GTCGTCGACG ACGCGCACTG GGCGGATCGG
CCGTCCCTGC AGGCGCTGAC GTTCGCGCTG CGGCGGTTGC GGGCCGACCG GGTGCTCGCC
CTGGTGATGG TCCGGGATCT GGTGGACGCG CAGCTTCCGG ACGGGCTGCG CCGGCTGCTG
ATGGACGAGC GGACCGTCCG TCTCCTGCTC GAGGGGTTGG ACGACACCGA GCTGCGGGCC
CTGAGCTCCG CGCACGCCAC CGGGCCGCTG TCCTGGCGAG CCTGCGCCCG CCTGCGAGCC
CACACAGGCG GCAATCCACT GCACGCCCGT GCGCTGCTCG AGGAGGTCCC GACGGAGGCC
TTCGACAATC TCGACGTGCC GCTGCCGGTG CCGCGCTCGT TTGCGATGCT CGTCGTGGCG
AGGCTTGCGG CAGCACCGGT CCAGGGCCGG GAGCTGGCGA CCGCGGCCAG CGTCCTCGGC
GCGCATCCCA CCCTGGTCCA GGCTGCAGCG GTCGGCGGGG TGGACGAGCC GCTGTCTGCC
CTGGAGCAGG CGATCGCGGC GGGCCTGCTG GTGGAGCAGC CGGTCGCGTG GGGTCTTCGG
TTCCCGCATC CGCTGGTGCA CGCCGCGATC TACGAGCAGG CCGGCCCGGC GCGGCGCGCC
GAGCTGCACA CCCGTGCGGC GGCTCTCGCC GAGGAGGAGG CCCTGCGACT GCACCATCTG
GCCCGTGCCG CCACCGGGCC GGACGGCGGG CTGGCAATGG AGCTGGCCAG GCAGGGCCGG
GAGCAGGCGA CCGCGGGGGC GTGGGCCGCC GCCGCGAACC ATTTGTCCAC CGCCGCCCGG
CTCGCCCCGT CGCGCACCGA CTACGAGCAG CTGACGTTGG AGGCGGCGGA CTGCCAGCTG
CTGGCGGGTG ATGTGCCTGA TCCTCTCGGG AAGGTCGGTG AGGTTCGCGG TTTCCGCCCG
ACCGCCTGGC GTGACTACCT GCTGGGCCGC TTCGCCTTCC AGCGCGGTGA CGCGGACGAG
GCCGAGACAC GGCTGCGGAG CGCCTGGAAG CGCTGCGACC CGGACGGTGG TGCCGACGCC
GACCCGCTGC TCGGTGCTCG GGTCGCCGGC TGGCTCGCTG CTCTTTACCA GACGAAGCTC
CTCGGCGCGG AGTCCGCCGA GTGGGCCGGG CGGGCGCTGG CGTTGAGCCC CGGCCACGCG
GCGTTCGACC TCATCGGGCA GATCCGGATG GGCGGGCTGG CGATGAGCGG GCATCTCGAC
GCCGTCCTCG ATTCGGTCGC CGACCTGCCG GACCCCGCCG TGGCCTCCAT GACTGACCTG
GACATGCTCG CGGGCAGGGG ATCCCTGCAC GCTCTGGTCG ATGATTTCGC CGGCGCGCGC
CGGGACCTCG GCGGGGTCCT GGCCGCCGGC CGGGACCGCT CGCTCCTTTT CCGGGTGTCG
GTCACCACCG CGCTCGCCCA GGCCGAGTAC CGGGCGGGGC TCTGGGACGA TGCCGCGATC
CACTGCGATC ACGCGTTGTC GCTGATGGCC GACTCCGACG ACGGCGCTCT CGTCCCGTAC
TGCCATCAGG TCGGCGTCCT GGTACCCGCG GCTCGTGGCC ATTGGGTGGA GGCCGAGGAA
CACGTCCGGA CGACGCAGGC CTTCGCCGAA AGCGGGTTTC CGTATCTCGT CGCGAGCGCG
GCCATCGCCA CGGCCCACCT GGCACGGGCC CGTGGCCTAC CCGCCGAGAT CGTCGCGGCC
CTCGAACCGC CGGTCCGGTC CGGGCTGCTG GACCTTGATG CCGAGCCTGG AGTCTCTGGG
TGGCAGGACA CCCTTGTCGA CGCTCTGGTG GAGGTCGGTG ACCTCGGACG GGCCGAGGAG
GTGCTGATCC GGTTCGAGGC GGCAGCCAGA CTGCGTGGGC GCCGCGCTGC AATGGCCGCC
GCCGCCCGCA GCCGCGGCAA CCTGGAAGCA CGCCGGGACA ACGTGAGCGC AGCCGACAGG
GCCTTTCGCG CGGGCCTCGC CGAATGGGAA CACGTCGACC TGCCGTTCGA GCGGGCGCTG
CTCCATCACC TCTACGGTGC CTTCCTGCGC AGGGCCGGCC GGCCCGCCGA CGCCATCAGG
CAGCTCAGAG CCGCCCACGA CACCCTCAGC CTGCTGGACG CCCGCCCCTA CCTGGACCGC
TGCGAGAGCG AACTCGTCGC CGGCGGATGG GCACCTCGGG ACGGCTGCCG CCGGGAACCG
GACGCATCCC GCCTGACCAC CCCGGAGCTT GCCGTCGCCA AGCTGGTCGC CGCGGGTCTC
ACCAACCGTC AGGTCGCCCG GGAGCTCGTC CTCAGCGAGA AGACCGTCGA ACACCATCTA
CGTGCCGTCT TCGGGAAGCT GGATGTGACG GCGCGAACCC AGCTGTCTGG CAGGATCTTC
GCCGCGGCCG GAGGCCTCGC CCAGTGGAAT GGTGCCGACA GGGTGCTTCT CCGGTCCGAG
TGA
 
Protein sequence
MAGLRSVNRS ATEAARRLRR IEAQDADPGP APVSGLRPGP RSGPAAGALR DALLVDGGRA 
SFVGRPEALL QLRRAVADVL GGRPRVALLE GPAGIGKTAL ARHLLGEADA GCVLWASGEE
DETGLSFGVL DQLTAEAARV LGADQPSPSG PGELTGTAGS VRGPSPDPLV AGAALVELLG
ELQRSGPVVL VVDDAHWADR PSLQALTFAL RRLRADRVLA LVMVRDLVDA QLPDGLRRLL
MDERTVRLLL EGLDDTELRA LSSAHATGPL SWRACARLRA HTGGNPLHAR ALLEEVPTEA
FDNLDVPLPV PRSFAMLVVA RLAAAPVQGR ELATAASVLG AHPTLVQAAA VGGVDEPLSA
LEQAIAAGLL VEQPVAWGLR FPHPLVHAAI YEQAGPARRA ELHTRAAALA EEEALRLHHL
ARAATGPDGG LAMELARQGR EQATAGAWAA AANHLSTAAR LAPSRTDYEQ LTLEAADCQL
LAGDVPDPLG KVGEVRGFRP TAWRDYLLGR FAFQRGDADE AETRLRSAWK RCDPDGGADA
DPLLGARVAG WLAALYQTKL LGAESAEWAG RALALSPGHA AFDLIGQIRM GGLAMSGHLD
AVLDSVADLP DPAVASMTDL DMLAGRGSLH ALVDDFAGAR RDLGGVLAAG RDRSLLFRVS
VTTALAQAEY RAGLWDDAAI HCDHALSLMA DSDDGALVPY CHQVGVLVPA ARGHWVEAEE
HVRTTQAFAE SGFPYLVASA AIATAHLARA RGLPAEIVAA LEPPVRSGLL DLDAEPGVSG
WQDTLVDALV EVGDLGRAEE VLIRFEAAAR LRGRRAAMAA AARSRGNLEA RRDNVSAADR
AFRAGLAEWE HVDLPFERAL LHHLYGAFLR RAGRPADAIR QLRAAHDTLS LLDARPYLDR
CESELVAGGW APRDGCRREP DASRLTTPEL AVAKLVAAGL TNRQVARELV LSEKTVEHHL
RAVFGKLDVT ARTQLSGRIF AAAGGLAQWN GADRVLLRSE