Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3038 |
Symbol | |
ID | 3904391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3605027 |
End bp | 3607879 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880358 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_482124 |
Protein GI | 86741724 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCGC GAGCACGGTC GCCCGAGCGG CGCGCGTCGG GGACTCGCAC AGGTGGCCCG TGGTCATGGA CACCGGGACT CCCTCCGTCG AGTCGGGTAG TGGCTCGACA CGAGGCTCGC ACTGGTCCGG CACCGCCCGC GTCGGTGAAA ACCACCTACA TCGGCGCGCA TACTGTTCCC GTTGTGGGGA CACCGGTCCG GACGTTCGTC GGCAGGGGCG AGGAACGCCG CAGGCTCGAG GTGCTGGTCG CGGGCGCCAA GAACGGCGAG GGCGGGGTGC TGGTGCTCCG CGGCGAGGCC GGCATCGGCA AGAGCGCTCT GCTCGACCAC CTGCGGCGCT CGGCGCAGGG GATGCGGGTC ATCGAGGCGA CCGGATCCCA GTTTGAGACC GAGCTCCCGT TCGCCGCGCT GCACCAGCTG TGCGTCCCGG CCCTGGGCGA CCTCGCGGCA CTACCCCCGC CGCACCGCGC GGCACTGGAG GCAGCGTTCG GCCTCGCCGA CGGCACACCA GAGATGTTCC GCATCGGCAT GGCGGCCCTG GAACTGCTGG TCACCGCGGC GGGGTCGGAA CCGGTGCTGT GCCTCGTCGA CGAGGCCCAC TGGCTCGACG ACGCATCGAC ACGGATACTG ACGTTCCTCG CGCGGCGCAT CTCGGCCGAA CCGGTGGCGA TGGTGCTAGC CGCACGGCAC GGACTCGACG AGCTACCGAG CGTCGAGGTG ACCGGGCTGG CCGACGACGA CGCCCGCCGG CTGCTGGTGG ACACCCGCGC CACGCTGGAC GAGACCGTGC GGGACCGCGT GCTCGCCGAG GCCAGGGGCA ACCCGCTCGC GCTACTCGAA CTGCCCGGCG CCGGCGGCTT TGCCCTGCCC GACGCGTCCT CGGTGCCCAG CAGGGTCGAA CGCAGTTTCC AGGATCGTCT CGTGCCCCTG CCCGAGGACG TGCGATTGCT GCTGACCGTG GCTAGCGCCG ACCCGACCGG TGACCCCGGC CTGCTGTGGG CCGCCGCCGA ACGACTCGGT GTCGGCCCGG CGGCCGGCGC GCACGCCGAG GCGTCCGGGC TGGTCGAGCT CGGCCCGCGC GTGCGGTTCT GCCACCCGCT CGCCAGGTCC GCGGTCTACC AGGCCGCCGC CGTCGAGGAC CGCCACGCCG CGCACCAGGC GCTCACCGAG GTCACGGACC CGGAACGGGA CCCCGACCGG CGGGCCTGGC ATCGCGCGCA GGCCGGTGCC GGCCCCGACG AGGACACCGC CGCGGAACTG AACCGTTGCG CCACCCGTGC CGCGGCACGT GGGGGAGTGG CGGCTGCCGC GGCGTTCCTC GCGCGGGCCG CTGCCCTATC CCGTGACCCC GCGCGCCGCA CCGAACGCAC ACTCGCGGCC GCGCAGGCCC ACCTCGACTC CGGTGCACTC GACGCGGCGG ACGGCCTGCT CACCGCGGTC ACGGTGGACG GGACCGACCC CGTCGCGCTG GCCAGGGTGG AGCTGATGCG CGGGCGGATC ACGTTCGTGC GCCGGCGCGA CGGCGACGGT CCGGCGTTCA TGCTGCGCGC GGCCCAGCGC CTCGCCGCGA CCGACCCACG GTGGTCGCGG GACTGCTTTC TCGACGCCGT CGAGATGGCC CTGGTCGTCG GCCGGGCCAG CGGGGTCATG GACATGGTCG TCGAGGCAGC GCGCTCGGCG CCGCCGGCAT CGGGCCCGCC GGACCTCCTG GACGCGCTGC TGCGGCTGGC GACGGAAGGA CACCACGCGG CGGCGCGGCC GGTCCGCGAG ATCCTCGCCT CCCGTCCACA GTGGACACGG CGACCGGCGC TCGCCGGGAT GCTCGCCGTG GAGCTCTGGG ACGCCGAGGC ACACGGCGTG ATCACCGACT GGGTGCTGGC CTCCGCCCGA GAGTCGGGGT CGCCGCTCAC GCTCCGCCTC GGTCTTGGCA TGGCGGCGGC CGGCGCGGTG CACACCGGTG ACTTCGCCGC CGCCACGTCG GCGATCGCCG AGGAGGAGGC GGTGGCGGAC GCGGTCGGGG TGGAGCCGCT GGCGTATCCG CAGCTGCACC TCGCCGCCTT GCGTGGCCGG GAGGCACAGG CGCGGGCGGT GATCGACAGT TTCACCGCCG AGGCGACCGC GAGCGGCACC GGGCAGACGA TCGCCAACGC CGACTGGGCC ACGGCCGTCC TGAGTAACGG CCTCACCGAC TACCCGGCGG CACTTGCCGC CGCCGAGGCG GCCACCCGGC ACGGCGACCT GTTCGTCGCC GCGATCGCCC TGCCGGAACT GGTCGAGGCC GCGGTCCGCT GCGGCGAGCA CGAGGTGGCC CGGTCGGGTT CGGTATCGCT CACCGAACGC ACCGAGGGCA GCGGCACACC GTGGGCGCTG GGCGTCGGCG CCTACGCCCG CGCACTCGTC ACCGGCGACG AGGACGACTT CGCCGCGGCC ATAGGTCACC TCGAGAAGAG CCCGCTCGCC CCGTATCTGG CCAGGGCGCA CCTGCTCTAC GGCGAGTGGC TGCGCCGCCA GGGCAGGCGA CGCGACGCCC GCCGGCAGCT TCGTACCGCC TACGACCGGT TCGCCGACAT CGGCATGGCG GCCTTCACCG ACCGCGCCGC CGCCGAGCTG CGTGCGGCCG GCGCCGACGT GCGCGGCCGC ACGTCGGGCA ACACCGACGA CCTCACCGCG CAGGAGACCC ACATCGCCCG CCTGGTCGCG GACGGCGCCA CGTCAAAGGA GGTCGCGGCA CGGCTGTTCA TCAGCCCCCG CACCGTCGAC GCCCACCTGC GCAATATCTT CCGCAAGCTC GGCATCACGT CCCGTCGCCA GCTCCGGGAC CTGCCCGCAC TGCGTACCCC CACCGCCAGA TGA
|
Protein sequence | MPPRARSPER RASGTRTGGP WSWTPGLPPS SRVVARHEAR TGPAPPASVK TTYIGAHTVP VVGTPVRTFV GRGEERRRLE VLVAGAKNGE GGVLVLRGEA GIGKSALLDH LRRSAQGMRV IEATGSQFET ELPFAALHQL CVPALGDLAA LPPPHRAALE AAFGLADGTP EMFRIGMAAL ELLVTAAGSE PVLCLVDEAH WLDDASTRIL TFLARRISAE PVAMVLAARH GLDELPSVEV TGLADDDARR LLVDTRATLD ETVRDRVLAE ARGNPLALLE LPGAGGFALP DASSVPSRVE RSFQDRLVPL PEDVRLLLTV ASADPTGDPG LLWAAAERLG VGPAAGAHAE ASGLVELGPR VRFCHPLARS AVYQAAAVED RHAAHQALTE VTDPERDPDR RAWHRAQAGA GPDEDTAAEL NRCATRAAAR GGVAAAAAFL ARAAALSRDP ARRTERTLAA AQAHLDSGAL DAADGLLTAV TVDGTDPVAL ARVELMRGRI TFVRRRDGDG PAFMLRAAQR LAATDPRWSR DCFLDAVEMA LVVGRASGVM DMVVEAARSA PPASGPPDLL DALLRLATEG HHAAARPVRE ILASRPQWTR RPALAGMLAV ELWDAEAHGV ITDWVLASAR ESGSPLTLRL GLGMAAAGAV HTGDFAAATS AIAEEEAVAD AVGVEPLAYP QLHLAALRGR EAQARAVIDS FTAEATASGT GQTIANADWA TAVLSNGLTD YPAALAAAEA ATRHGDLFVA AIALPELVEA AVRCGEHEVA RSGSVSLTER TEGSGTPWAL GVGAYARALV TGDEDDFAAA IGHLEKSPLA PYLARAHLLY GEWLRRQGRR RDARRQLRTA YDRFADIGMA AFTDRAAAEL RAAGADVRGR TSGNTDDLTA QETHIARLVA DGATSKEVAA RLFISPRTVD AHLRNIFRKL GITSRRQLRD LPALRTPTAR
|
| |