Gene Franean1_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3799 
Symbol 
ID5672163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4503917 
End bp4506775 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content75% 
IMG OID641242678 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001508098 
Protein GI158315590 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00558742 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGGTG ATGATCGCCC GTTCGTCGGG CGGCTGGAGG AGTTGACGCT GTTGCGTTCG 
CGGGCGCGGA CGGCGGAGGC GGGTCGGGGC GGTGTCGTGT TGGTGACCGG CCCGGCGGGG
ATCGGCAAGA CCCGGTTGGT GGAGGAGGCG TTGCGGGGGG GTGCGAGGCC CCCGGCCAGC
CCGCCGGCGG GCGTGTCGGC GGCGGTGCGG GTCGGCCGTG GCTACAGCCT CCCCGACCAT
GCCGCTCCGG CGCTGTGGCC CTGGCAGCAG GCGCTGCGCG CCCTCGCTCG CCGCAGCGGG
TGGCCGGCGA GGATCTCCTC GGCCGTGGAT TTGCTGGAGG GGGCCGGAGG CTGCGACGCA
CCTGGCGGCG CTCCTGGTGA CGTGGCGAGC GCGGCCGCCG TGCGGTTCGC CGCCCTGGCC
GCGGTCGCCG ACGCGGTGCT GACGGCCGCC GCCGAACCAC TGGTGATCGT GTTGGAGGAT
CTGCACTGGG CCGACACGAA CACCGTCGAG CTGGTGCGGC AGGTCGCCGG CGGGATCGTC
GACAGCCACC TGCTGCTGAT CGCGACGATG CGCGCCGAGG CCGGCGACGT CATGCGTGAG
CTGTCCCGGC TGGGCAGCGT CCACGCCGTG CCGTTGCGGC CGCTGACAGT GGCCGCTGTC
GGGGACTATC TCACCGCGCT CGACCCGGCG GTCTCCTCCG CCGCCCGCGC GGAGCAGGTG
GCGCGGCGCA GCGGCGGGCT GCCGCTGCTG CTCGACGCGG CGGCGTCCGA CGCCGCCGCC
CCCGAACCGG TCGTCGTCCG TGACCTCGTC GAGGCACTGC TGGCTCAGCT TCCGGAGCCG
GCGCGGCGCG TTGTCCAGGT CGCCGCGTTG CTGCGCTCGC CGGTGGACAC CGACGTGGTC
GGCCAGGTTG CCGAGGTCGA CGGGTCGATG GCCGGGCTGG CGCTCGACGC GGCGTGCCGG
GCCGGTCTGC TGACGCCGGT GACCTCGTGG GCCGGGGTCG AGGGACTGGG GTTTGCCTTC
ACCCACGGCC TGCTGGCCGA GGGGCTGGCG GACCTTCTCG ACCCGGTCGA GGGGCGCGAG
ATCCACCGGC GTGCCGCCAT CACGTTGCAG GTGCGACCGT CGGCCTTACC CGGGCTCGCC
GCCGCGGTCG CCGGTCACTG GCGCCGTGCC GGCGGCGACG AGCAGGCACA GCGCTGCGCG
GCCCGGTGGG CGAGGCAGGC CGCTGTCGAG GCCGTCCAGG CATTGGCCTA TGACGAGGCG
GCCACGCACC TGCTCACCGC GGTAGCGGCG CTGCGGCAGG CTGGTGCCGG TGAGGCGGAA
CTCGCGCAGA TGACGTTGGA GCTGGCGCGC GCCTACTACC TGGCCGGAGC GCTGACCGAA
GCGCTCCGGC AATGCGAACA GACCGCTGCG GCGGCGCAGC GGGCAGGCAG GGACGACCTG
GTGGCCGCGG CGGCGCTGGT GGTGCGTGGG GTGACCTTCC CGCAAGCCAT CGAGGCGATC
TCCCGGCTGT GCCGCACGGC GCTCGCCGCA CAGCAGCCGG CCGCATTGCG GGCCCAGCTG
CTGTCCCAGC TCGCCACGAT CCTGGGCGAG GCCGGCTCCA CGGAGGCGGC CCGCCGGCAC
GTCGACGAGG CGATGGAGCT CGCGCAGGCG AGCGGCGACG CCCAGGCGTT GCTGGATGCC
GCCCGCGCCC GGGAGATGTC GCTGGCCGGT CCGGACGACG CGGAGGAACG GCTGCGGCTG
GCCGCGACCG CCGTCGAGCA GGCCGATCGG CTCGGCCAGC CGCTGGCCGC CGTACTCGGG
CAGCAGTGGC GGATCAGGGC CGCCTACCAG CTCGGTCGTC TCGACGTCGT GGACGAGGCG
ATGGCCGCCG TCGCGATGCT GGCTGACCGC AGCGGCCTTC CGTTGGCGAG CTGGCATCGG
TTGCGCGGCG CGGCGGCCCG AGCCGCGTTG GAAGGCAGGT TCGCCGCCGC GCGGTCGGCC
AACCTTGAGG CCCAGCGGCT GGGCTCGCGG GCCGGAGACG CGCAGGCGGC CGGCCTGAGC
TACGCCTTCG CCGCCCACCT GGCCATGCTG CGAGGGGACG CCGCCGAACT CCCGGACGAC
TTCTGGCCGA TGCTGGAAGC ATTCCCGTCG TCGCCGCTGA TGACGGTGTT CCGTGCCAAC
GCTCTCCGAC TCGAGGGCCG CCGCGAGGCG GCGAAGGCAT ACTACGAGCA GCTGCGTCAG
ATGCTCGACG ACCCGGTCGT CGATCTCCGC TGGGGCGGGG TACTGACGCA GCTGATCGAC
CTGGTCGAGG CATTCGGGGA CCCTCCGGCA GCGGAGGTGC TCGCCACCCA CCTGAAGCCC
TACGCCGCGT ACTCGGGGGC GGTCGGAACA CCGACGGTCT TCTTCGTTGG CTGTGCGCAG
GGCCACTACG GGCGCGCGCT GGGCCACGCG GGAGAGCTGC GCGACGCCAA GACGGCGCTA
CGGACGGCGA TCACGCGTGA CGCCGCGCTC GGCGCGCGCC CTCACATCGT GCTCAACCAG
CTGGCCCTCG CCGACGTGCT GCGCCGCCTC GACGACCCTC ACGCCGCCGT GACCCTCGCC
GGCGCGGCCG CGGCCGAGGC TCGCCGGCTG GACATGCCCG GGCCGCTGGC CCGCGCCGAC
CAGCTGCTCG CCGACCTGGC GGCCGCCCGC CGAGAGACCG ACCCACTGAC TGCCCGCGAG
CACGAGGTGG CCGCGCTCGT CGTCCAGGCC ATGTCCAACC GGGAGATCGC GGCACGGCTG
GTACTCAGCG AACGCACCAT CGAAAGCCAC GTCCGCTCAA TCCTCGCCAA ACTCGGCTAC
ACCAACCGGA CCGAGCTGGT CGCACGCTGG CGGCCCTGA
 
Protein sequence
MRGDDRPFVG RLEELTLLRS RARTAEAGRG GVVLVTGPAG IGKTRLVEEA LRGGARPPAS 
PPAGVSAAVR VGRGYSLPDH AAPALWPWQQ ALRALARRSG WPARISSAVD LLEGAGGCDA
PGGAPGDVAS AAAVRFAALA AVADAVLTAA AEPLVIVLED LHWADTNTVE LVRQVAGGIV
DSHLLLIATM RAEAGDVMRE LSRLGSVHAV PLRPLTVAAV GDYLTALDPA VSSAARAEQV
ARRSGGLPLL LDAAASDAAA PEPVVVRDLV EALLAQLPEP ARRVVQVAAL LRSPVDTDVV
GQVAEVDGSM AGLALDAACR AGLLTPVTSW AGVEGLGFAF THGLLAEGLA DLLDPVEGRE
IHRRAAITLQ VRPSALPGLA AAVAGHWRRA GGDEQAQRCA ARWARQAAVE AVQALAYDEA
ATHLLTAVAA LRQAGAGEAE LAQMTLELAR AYYLAGALTE ALRQCEQTAA AAQRAGRDDL
VAAAALVVRG VTFPQAIEAI SRLCRTALAA QQPAALRAQL LSQLATILGE AGSTEAARRH
VDEAMELAQA SGDAQALLDA ARAREMSLAG PDDAEERLRL AATAVEQADR LGQPLAAVLG
QQWRIRAAYQ LGRLDVVDEA MAAVAMLADR SGLPLASWHR LRGAAARAAL EGRFAAARSA
NLEAQRLGSR AGDAQAAGLS YAFAAHLAML RGDAAELPDD FWPMLEAFPS SPLMTVFRAN
ALRLEGRREA AKAYYEQLRQ MLDDPVVDLR WGGVLTQLID LVEAFGDPPA AEVLATHLKP
YAAYSGAVGT PTVFFVGCAQ GHYGRALGHA GELRDAKTAL RTAITRDAAL GARPHIVLNQ
LALADVLRRL DDPHAAVTLA GAAAAEARRL DMPGPLARAD QLLADLAAAR RETDPLTARE
HEVAALVVQA MSNREIAARL VLSERTIESH VRSILAKLGY TNRTELVARW RP