Gene Francci3_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3038 
Symbol 
ID3904391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3605027 
End bp3607879 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content74% 
IMG OID637880358 
ProductLuxR family transcriptional regulator 
Protein accessionYP_482124 
Protein GI86741724 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCGC GAGCACGGTC GCCCGAGCGG CGCGCGTCGG GGACTCGCAC AGGTGGCCCG 
TGGTCATGGA CACCGGGACT CCCTCCGTCG AGTCGGGTAG TGGCTCGACA CGAGGCTCGC
ACTGGTCCGG CACCGCCCGC GTCGGTGAAA ACCACCTACA TCGGCGCGCA TACTGTTCCC
GTTGTGGGGA CACCGGTCCG GACGTTCGTC GGCAGGGGCG AGGAACGCCG CAGGCTCGAG
GTGCTGGTCG CGGGCGCCAA GAACGGCGAG GGCGGGGTGC TGGTGCTCCG CGGCGAGGCC
GGCATCGGCA AGAGCGCTCT GCTCGACCAC CTGCGGCGCT CGGCGCAGGG GATGCGGGTC
ATCGAGGCGA CCGGATCCCA GTTTGAGACC GAGCTCCCGT TCGCCGCGCT GCACCAGCTG
TGCGTCCCGG CCCTGGGCGA CCTCGCGGCA CTACCCCCGC CGCACCGCGC GGCACTGGAG
GCAGCGTTCG GCCTCGCCGA CGGCACACCA GAGATGTTCC GCATCGGCAT GGCGGCCCTG
GAACTGCTGG TCACCGCGGC GGGGTCGGAA CCGGTGCTGT GCCTCGTCGA CGAGGCCCAC
TGGCTCGACG ACGCATCGAC ACGGATACTG ACGTTCCTCG CGCGGCGCAT CTCGGCCGAA
CCGGTGGCGA TGGTGCTAGC CGCACGGCAC GGACTCGACG AGCTACCGAG CGTCGAGGTG
ACCGGGCTGG CCGACGACGA CGCCCGCCGG CTGCTGGTGG ACACCCGCGC CACGCTGGAC
GAGACCGTGC GGGACCGCGT GCTCGCCGAG GCCAGGGGCA ACCCGCTCGC GCTACTCGAA
CTGCCCGGCG CCGGCGGCTT TGCCCTGCCC GACGCGTCCT CGGTGCCCAG CAGGGTCGAA
CGCAGTTTCC AGGATCGTCT CGTGCCCCTG CCCGAGGACG TGCGATTGCT GCTGACCGTG
GCTAGCGCCG ACCCGACCGG TGACCCCGGC CTGCTGTGGG CCGCCGCCGA ACGACTCGGT
GTCGGCCCGG CGGCCGGCGC GCACGCCGAG GCGTCCGGGC TGGTCGAGCT CGGCCCGCGC
GTGCGGTTCT GCCACCCGCT CGCCAGGTCC GCGGTCTACC AGGCCGCCGC CGTCGAGGAC
CGCCACGCCG CGCACCAGGC GCTCACCGAG GTCACGGACC CGGAACGGGA CCCCGACCGG
CGGGCCTGGC ATCGCGCGCA GGCCGGTGCC GGCCCCGACG AGGACACCGC CGCGGAACTG
AACCGTTGCG CCACCCGTGC CGCGGCACGT GGGGGAGTGG CGGCTGCCGC GGCGTTCCTC
GCGCGGGCCG CTGCCCTATC CCGTGACCCC GCGCGCCGCA CCGAACGCAC ACTCGCGGCC
GCGCAGGCCC ACCTCGACTC CGGTGCACTC GACGCGGCGG ACGGCCTGCT CACCGCGGTC
ACGGTGGACG GGACCGACCC CGTCGCGCTG GCCAGGGTGG AGCTGATGCG CGGGCGGATC
ACGTTCGTGC GCCGGCGCGA CGGCGACGGT CCGGCGTTCA TGCTGCGCGC GGCCCAGCGC
CTCGCCGCGA CCGACCCACG GTGGTCGCGG GACTGCTTTC TCGACGCCGT CGAGATGGCC
CTGGTCGTCG GCCGGGCCAG CGGGGTCATG GACATGGTCG TCGAGGCAGC GCGCTCGGCG
CCGCCGGCAT CGGGCCCGCC GGACCTCCTG GACGCGCTGC TGCGGCTGGC GACGGAAGGA
CACCACGCGG CGGCGCGGCC GGTCCGCGAG ATCCTCGCCT CCCGTCCACA GTGGACACGG
CGACCGGCGC TCGCCGGGAT GCTCGCCGTG GAGCTCTGGG ACGCCGAGGC ACACGGCGTG
ATCACCGACT GGGTGCTGGC CTCCGCCCGA GAGTCGGGGT CGCCGCTCAC GCTCCGCCTC
GGTCTTGGCA TGGCGGCGGC CGGCGCGGTG CACACCGGTG ACTTCGCCGC CGCCACGTCG
GCGATCGCCG AGGAGGAGGC GGTGGCGGAC GCGGTCGGGG TGGAGCCGCT GGCGTATCCG
CAGCTGCACC TCGCCGCCTT GCGTGGCCGG GAGGCACAGG CGCGGGCGGT GATCGACAGT
TTCACCGCCG AGGCGACCGC GAGCGGCACC GGGCAGACGA TCGCCAACGC CGACTGGGCC
ACGGCCGTCC TGAGTAACGG CCTCACCGAC TACCCGGCGG CACTTGCCGC CGCCGAGGCG
GCCACCCGGC ACGGCGACCT GTTCGTCGCC GCGATCGCCC TGCCGGAACT GGTCGAGGCC
GCGGTCCGCT GCGGCGAGCA CGAGGTGGCC CGGTCGGGTT CGGTATCGCT CACCGAACGC
ACCGAGGGCA GCGGCACACC GTGGGCGCTG GGCGTCGGCG CCTACGCCCG CGCACTCGTC
ACCGGCGACG AGGACGACTT CGCCGCGGCC ATAGGTCACC TCGAGAAGAG CCCGCTCGCC
CCGTATCTGG CCAGGGCGCA CCTGCTCTAC GGCGAGTGGC TGCGCCGCCA GGGCAGGCGA
CGCGACGCCC GCCGGCAGCT TCGTACCGCC TACGACCGGT TCGCCGACAT CGGCATGGCG
GCCTTCACCG ACCGCGCCGC CGCCGAGCTG CGTGCGGCCG GCGCCGACGT GCGCGGCCGC
ACGTCGGGCA ACACCGACGA CCTCACCGCG CAGGAGACCC ACATCGCCCG CCTGGTCGCG
GACGGCGCCA CGTCAAAGGA GGTCGCGGCA CGGCTGTTCA TCAGCCCCCG CACCGTCGAC
GCCCACCTGC GCAATATCTT CCGCAAGCTC GGCATCACGT CCCGTCGCCA GCTCCGGGAC
CTGCCCGCAC TGCGTACCCC CACCGCCAGA TGA
 
Protein sequence
MPPRARSPER RASGTRTGGP WSWTPGLPPS SRVVARHEAR TGPAPPASVK TTYIGAHTVP 
VVGTPVRTFV GRGEERRRLE VLVAGAKNGE GGVLVLRGEA GIGKSALLDH LRRSAQGMRV
IEATGSQFET ELPFAALHQL CVPALGDLAA LPPPHRAALE AAFGLADGTP EMFRIGMAAL
ELLVTAAGSE PVLCLVDEAH WLDDASTRIL TFLARRISAE PVAMVLAARH GLDELPSVEV
TGLADDDARR LLVDTRATLD ETVRDRVLAE ARGNPLALLE LPGAGGFALP DASSVPSRVE
RSFQDRLVPL PEDVRLLLTV ASADPTGDPG LLWAAAERLG VGPAAGAHAE ASGLVELGPR
VRFCHPLARS AVYQAAAVED RHAAHQALTE VTDPERDPDR RAWHRAQAGA GPDEDTAAEL
NRCATRAAAR GGVAAAAAFL ARAAALSRDP ARRTERTLAA AQAHLDSGAL DAADGLLTAV
TVDGTDPVAL ARVELMRGRI TFVRRRDGDG PAFMLRAAQR LAATDPRWSR DCFLDAVEMA
LVVGRASGVM DMVVEAARSA PPASGPPDLL DALLRLATEG HHAAARPVRE ILASRPQWTR
RPALAGMLAV ELWDAEAHGV ITDWVLASAR ESGSPLTLRL GLGMAAAGAV HTGDFAAATS
AIAEEEAVAD AVGVEPLAYP QLHLAALRGR EAQARAVIDS FTAEATASGT GQTIANADWA
TAVLSNGLTD YPAALAAAEA ATRHGDLFVA AIALPELVEA AVRCGEHEVA RSGSVSLTER
TEGSGTPWAL GVGAYARALV TGDEDDFAAA IGHLEKSPLA PYLARAHLLY GEWLRRQGRR
RDARRQLRTA YDRFADIGMA AFTDRAAAEL RAAGADVRGR TSGNTDDLTA QETHIARLVA
DGATSKEVAA RLFISPRTVD AHLRNIFRKL GITSRRQLRD LPALRTPTAR