Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0119 |
Symbol | |
ID | 3903449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 144898 |
End bp | 148125 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637877452 |
Product | signal transduction protein |
Protein accession | YP_479242 |
Protein GI | 86738842 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5635] Predicted NTPase (NACHT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.955952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGGGC AAGGGAACGA GGCGGGCAGC CTGCATCGAG CCGGTGTCGC CGCTTGGCTG GTGTCGCATG GAATTAATGG AAATCCTATT CCTCTGGACG ACACGGCGAA TTCTGGAACG ATTCAGAGCG TCGACTTTGA AACAAATGAT GCTATCGACG ACATACATTG TAAGCTAACG GACGGCCGTA CACTGCTAGT ACAGGCGAAG CGAGCTTGCG GTGACGATCG ACATTTGAAG GATACCGTCT ACCAGTGGGC ACGCCAACGG CCCGAAGCCG GTGTAATATT CGGCCTCGCT GTTGCTGAGC CGAAAGGACC AGTTCGGCAT CTCCGTACGG CCTTAAACCG GCGTAGGAGG TCGATACCAG GAAACCCAAG CTCTGGAGAG AAAAGCGCGC TGGCAGCACT CGCCAAAGCA TTCCCTGTAG ACATCTCAAC CTCCCGACGC GAAGAAATCT TGGACTCCGC TTATGTGCTG GTCGTCGATG CGCATGAGCC GGGGTTGCCC CATTTTCAAG CTGCAGCTTC CTGGTTGGTA GCTGCAGGTG TTCCACATGA GCGCGGGGCC GAAGCATTCG AGGCATTGAG ATCGCGGCTT CAGGCCGACG CGGCGAAGGC ACGCGGGAGT GATCTGGACG ATTGGCTCGA AATACTTGCT AAGGCTGGAT ACCCGCTCGC AGAGGAATAT CGTGGGTCAC GCGGCCAGCG CCGCCAGTTC GACTTGGGAG TTCTGGCGCG ATATTGCGAC CGCTTTCAAG CCGACCGCGG AATGCTATCT TTCGCGCTAC TCGCCGATGA CCTACCGCCG ATGAAGGTCG AGGGCCTGGC GAAATCATTC ACGGTTAGTT GGTCGTCGAC CTTCGAAAAT CGGAAGTATG CTTCGGAACC CTTGCTTGAC GTCGCTCGGC GCTGGGGCCG GCTAGTGGTT ACCGGTCTGC CTGGTTCTGG AAAGTCGACA GCCATAAGAC AGCTTGCGGC GGAATGGGCA GCCACGTCGG ACGCGCCCAT ACCTATTTTG GTGTCGCTAA AGGTCGTAGC CGAGAATCCT CCCGAGCCGG CTTCATTCAC TCCGGCTATG CTCGTCGATG CCGCGCTTCC TAAGCTCGGA GGAGACGAGA GGAGAGCCTT GCACGCCACT CTACTGGCTA AAATCGGAGA CGGAGACGCT GCGCTTCTCC TCGATGGCCT TGATGAGTGC GGGACCGCAA CCAGCACGAT AGCTGATGGA ATAACTCGAC TCCTCGAACA GCTACACCCG GATACCGACG TAATAGTTAC GGCGCGAGGG AGCGCCCTTC CTGCCGCTTC GAAGACAGGA CTTCCTACCG TTGAGTTGAC TGAACCGCAA GGTCTCACGT CTCAGCTTGA CAATCTGATC CGCCATGCGG CAAAAATTCG AGCGAACGGC CAGCAAGAGG CCGAGTGGGC GGCCGCGCGT ATCTCGTGGC TAGAGTCCGT CCGCCGCGCG GATGGTCATG GTGGCGGTCC GTTGCTGGTT CGATCAAACA GCCTTTGGCA GGTTCCGCTA CTTGCGACAA TGGTTACACT TCTGGCGACA TTGCGGCCGA CCGAAAAGAT TCCTACCAAT AGATCACTTT TGTTCATGGC GGCAGTGGAG GAATCGGTCA CCAAATGGGA ATCACGGCGT GCCATACAGC GCGTGCCATG GAACGTAAGT GACCCTCGGA TGCTCGTCGA CGGCTTTGCC GTGATCGCGC ATGAGCTCGC CCATCGTAGC GGTCGCTTGT CGGTTGAAAT CGCTAGAAAA TCGATCAGAG ACCGCGCGGT TCAATATTGG GGAATGTCGC GGGGGCCAGC AGAGTCGGCC TCTGCAGACA TTGTTAGGTT CTGGGACGAA GTTATCGGGG CCTTTCCGAG AAATTCAGAT GGAATGCTTC TTTCGCGAGC GCAGTATCTT GTGGATATTG GCGACGCGAT GTGGGCCGCT CACGTTCTCG AAGACGAAGA GGAAAGAAAG GCTTGGGTGA ATCGTGCGCT CGAGGATCCG AACCGCCGCG AGGCTTTTAT TCTAGCTGTT TTGTACGCGC CGTCGATTTT GCATAGTATC GAAACCTTCC ATGGTAGTGG AAATGATCGC GGGCGAAGGG CGCTTGTTTG GGCCGCTGAC GCCATCAGAG AGGGTGGTAG ATCTCTGATG AATGTTGTCA CCCCCGCCCT GATGGCAGCA CTTGCAGATG CTGCAGGCAA AGGATTGGCC CTCCCCGATA TCGGTAGCAA CAGTGCAGGT CAGAGGAGCG GGCGAGACCC CAAGGAGTGG GAGTACGTCC TACGGCTGGC TGGTCTGCCG GTCGTCCCCC AACTCCGCAA GTTGCGGGAC GACTTGCTTG CCGGACTGCG GTTGAACAAG GAGCACCGTT TGGTGTCCCA AGCGCTCACT TCACTTACCG ATGCGGTGGT AGACGGGCAT ACCGAAATAC CAAACTCTTC CTTGGCTGCG CTTAAAGAAA TGCTGGCCGT TCCGGCGCCA CGGCGAGTAA ATCGTCCACC GCGCAGAAAT TCGCGTGGAG TTATGAGCTT CACTCTACCG GAACGCCGGC GTCCGATAGG ATATGCCGAC GTGCTCAAAC TTGCAGCCGA GTACGCTGGA AGTCTCGATT CATCGACCCG AACGGCTATC TACAGCGAGG CTCGGGAAGC TAGTGTTCGC GACTATTTTG AAATCGTAGC CGTTTTGCAG GCAAAGGGAT TCTCTGATCC GCAACCATTT TTCAGAGTCC CTGAACTTAT TCGCCTAACC GAGGAGTTTC CCAGTCATTG GGCGGCTGTC GAATGGCTGT TTCGCCCTAT GGAAAATATC TCGGAAGGTA CGCAGCTGCG GCGGGTGCAG CGGTGGAGAC CCGCCGAGAT TCTAGAGTTC ACCGAACTCA TCGGTCTACG TGCATCTGGT CTTGATGATC TGAGGGCCGG CAGGAATGAG GCCACGACGA CCATAGAATC TCTAGTTAAA TGTGTTTGCT CATCCTATAG AATTTCGGAA TCTCATATCG CGGCTCAAGC GGCCTACATT GCAAGTATCG CATCGACTGG GGACCAGCAG ATTGTCCGAA TTATTCTGGG GATTCCCCCT GCTCGCGAAG TGCGTTTCGT AGTGCCAAGT ACGGAATTTC GGCTCGAGGG CCTTGAGTCT TTGCTTGCTG GGTACGCAAG TACATGCCAG ATACTGAGCC TTTTCAATCT CTGTTTGTCT GATTCTTTCG GCCGTTGA
|
Protein sequence | MSGQGNEAGS LHRAGVAAWL VSHGINGNPI PLDDTANSGT IQSVDFETND AIDDIHCKLT DGRTLLVQAK RACGDDRHLK DTVYQWARQR PEAGVIFGLA VAEPKGPVRH LRTALNRRRR SIPGNPSSGE KSALAALAKA FPVDISTSRR EEILDSAYVL VVDAHEPGLP HFQAAASWLV AAGVPHERGA EAFEALRSRL QADAAKARGS DLDDWLEILA KAGYPLAEEY RGSRGQRRQF DLGVLARYCD RFQADRGMLS FALLADDLPP MKVEGLAKSF TVSWSSTFEN RKYASEPLLD VARRWGRLVV TGLPGSGKST AIRQLAAEWA ATSDAPIPIL VSLKVVAENP PEPASFTPAM LVDAALPKLG GDERRALHAT LLAKIGDGDA ALLLDGLDEC GTATSTIADG ITRLLEQLHP DTDVIVTARG SALPAASKTG LPTVELTEPQ GLTSQLDNLI RHAAKIRANG QQEAEWAAAR ISWLESVRRA DGHGGGPLLV RSNSLWQVPL LATMVTLLAT LRPTEKIPTN RSLLFMAAVE ESVTKWESRR AIQRVPWNVS DPRMLVDGFA VIAHELAHRS GRLSVEIARK SIRDRAVQYW GMSRGPAESA SADIVRFWDE VIGAFPRNSD GMLLSRAQYL VDIGDAMWAA HVLEDEEERK AWVNRALEDP NRREAFILAV LYAPSILHSI ETFHGSGNDR GRRALVWAAD AIREGGRSLM NVVTPALMAA LADAAGKGLA LPDIGSNSAG QRSGRDPKEW EYVLRLAGLP VVPQLRKLRD DLLAGLRLNK EHRLVSQALT SLTDAVVDGH TEIPNSSLAA LKEMLAVPAP RRVNRPPRRN SRGVMSFTLP ERRRPIGYAD VLKLAAEYAG SLDSSTRTAI YSEAREASVR DYFEIVAVLQ AKGFSDPQPF FRVPELIRLT EEFPSHWAAV EWLFRPMENI SEGTQLRRVQ RWRPAEILEF TELIGLRASG LDDLRAGRNE ATTTIESLVK CVCSSYRISE SHIAAQAAYI ASIASTGDQQ IVRIILGIPP AREVRFVVPS TEFRLEGLES LLAGYASTCQ ILSLFNLCLS DSFGR
|
| |