Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4351 |
Symbol | |
ID | 5672706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5194317 |
End bp | 5197163 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243224 |
Product | cyclic nucleotide-binding protein |
Protein accession | YP_001508641 |
Protein GI | 158316133 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1225] Peroxiredoxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTCA GCACGCAGCC CCAGCAGCGG CAGGCCGCGC CGGCACCCGC GGCCACGGAG ACAGCGACAG TCACGGTCTG GGACCGGCTT GCCGACGCCG CCAACCCGGC CCGCTACCGG CCGAAGCGGC AGGACGGGCT GATCTGGCGC GAGTTGCGAT CCGCGCGGGG CGAGGAGTAC GTCATCGTCC AGAACCCCGA CGCCGCGACG TACCGCAAGA TCACGATGGC CGAGTTCTAC CTGTTCGAGC TGATGGACGG CAGCCGCAGC GTCCAGGACC TGGTCGTCGC CTACATGATG AAGTACCACC GGTTCGCCCT GCCGCTGGTC CTTCGCCTCG TGCGCAACCT GAAGGTCGGG CAGATGCTGA CCGACCCGCC CCGGTTCGTG TTCGGGCCGT TGGGCGAACA GCTGCGCCGC CGGCCAGTGA GCTCCTTCGC CACCGGGTTC GCGAAGTCCT TCGTCCGGCG CGAGTTTCCG CTGCACGGGC TCGACGGCCT GGTCGGCCGC GCCTACGACC GGGGCGTGTG GGTGCTTTTC ACCCGGCCGG CAAAGATCGT CATGTTCGCG GTGGCGATGC TCGGCGTACC GCTGCTGGCC TGGTCGCTGT CGTCGTCGGC GTCGGGGCTG GCCGACCAGT CCCTCGTCGT CACAGTTCCG ACAATGTACG GCTGCCTGCT GGTCGTCGCC GTGCTGCACG AACTGGCGCA CGCCTTCGCG GCCAAGTCCT ACGGCCGGGT GGTTCGGCGC GGCGGACTAT CGATCTTCTA CGGCTCGCCT GGGATGTTCG TCGACACCCA GGACATGTGG ATGGAGCCGC GCGGGCCGCG GATGGTCTCG GCCTGGGCCG GGCCGTTCTC CGGGTTCGTG CTCGCCGGGC TCTCCGGCAT CGTCCTGGTG GCCACGCCGG ACGCGCCGTG GGCGGCAGTG GTGGCGATCT TCGGCACCGC TGCCCTGGCC GTGAACCTGG CCCAGCTCAC GCCGCTCATC CAGCTCGACG GCTACTACAT GCTGATGGAC TGGCTGGAAC TGCCGAACCT GCGAGCCAGG GCGCTGGGGT TCATCCGGGG GGAGCTCCCC GGCAAGGTCC GTCGCCGGGA GCGGTTCGAC CGCACCGAAC GCGTTTTGAC GATCTTCGGA CTGGCCGCCG CGGCGTACAC CGGCTACGTG CTCGTCCTGG CAGCCGGGTT CGCCTGGGTC CGGGCGAAGA CGATGGTGAG CGACGCGGTC GCCGTGCCGA GCCTTGGCCG GGTCCTTGCC GCCACGGTGG TTCTGCTCCT GACCCTTCCG CTGCTGTACG TGCTCGGGCA GCGGCTGTGG CGGGGTGGCT CGTCGCTGGT GGTGGCGAGC CGGCGGCTGC GCCGGCTGGC CGCGGAGCGC CGCTACCGGG AGCGGGTCGC GGTACTCGCG AACGTACCGC CGGTCGCCGA ACTGGGAAAG GCGTACGTCC AGTGGATGGC GCGGCGGACC ACCGAGAAGG TGTTCCGGGC CGGCACCACG GTGGTCGCTG CGGACGAGGT CGCCGAGGTC TTCTGCCTGG TGCTGTCCGG CGAGGCCGAG GTCGTCGAAT CGGGCGCGAA TGCCGGGACG GGGCCGGGCG CGGGAAGTGC GCTGCGCACG CTCGGCCCGG GCGACTACTT CCCGCCGCCC GGATTGCCAA GGTCGCCCTT GACCGTGCGG GCGCTCACCG ACGTGCACGT GTTGCGGCTG GCCGGTGCCG ATTTCTCCGA CCGCCTCGCT CCGCTGCTGG CCCGACGGGC CGAGAACGAC ACACGGGCCG ACGAACGGGC GGAACTCGAG GGCTTCGCAC TGTTCGACGG CCTCCGCACC CGCGACAAGG ACACGCTGCT GGCCCACCTG CGGGCACGGA GCTGTGTGGA CGGCGAGGTC GTCGTCGCCG AAGGCGACCC CGGTTCGGCC TTCTACCTGG TTCGTAGCGG GGCCGTCGCG CTGGCACAGA CCTGCGTCGA CGGGCCGCCG CGAACCCTGG GCGCCGGGGA ATTCTTCGGC GAGAGCGCAC TGCTGCACGA CGAGCCCCAC GCCGCGACGG CCACGTCGGT CGGCGAGACC AGGCTGTGGG AGCTGGACCG GACGACGTTC GACGACGTGG TGTGCCGCTA TTTCGGGCTG TCCGACGCCG TACGCGAATC CGCCGAAGCG GGCGAAGCGA CGCGTGCCGC TGAGGCGCTG GCGGGCACGG TGGCCGGCCG CTGGCTTGAG ATCGAGGTCG GCGACCCGGC ACCCGGGTTC ACCCTCGACA CCGCCAGCGG GCCGTCGCCG GTGTCGTTGT CGGACTACCG GGGCCAGGTG GTGCTGCTCT GGTTCTCCCG CGGCTACAAC TGCCCGTTCT GCCGGGAGTA CATGGCCCGG TTGGCGCCGG CGGTCGGCGA TTTCGAGCGG GCCGGCGTGC AGATCCTGCA GCTGGCGCCC AACCTCGTCG ATTCCGCCCG CGAGTTCTGG CGCGGCAAGG ACCTGCCGTT CCCGTTCCTG TGCGACCCGG AGAAATCCGC CTACCGGCTG TGCGGCCTGC AGGACATCGG CGCCGGCGAA GCGCAACGCA ACTCGGTGCG GGGCTTCACG CGGGCGTTCA CCACCGGCCA GGGCCGTACG ACGATGCACG CGCTCTGGCT TGACGTGGTG AACCCGTCGA TCGGCGAACG GCTCGGCCAC CACACGATGA CGGCGATGCA GCAGGGCGTC TTCCTGGTCG GTCCGGATGG CGTGCTGCGC CGCAAATACG TCTTCGGCCC ACTCGACGAA CCGCCGTCGA ACACCGAACT CCTCGAAGCG GCCGCCGAAC TGGGATCGAT CGAGCTGGCG ACAACACAAC TGGAGGCGGG CTCGTGA
|
Protein sequence | MTVSTQPQQR QAAPAPAATE TATVTVWDRL ADAANPARYR PKRQDGLIWR ELRSARGEEY VIVQNPDAAT YRKITMAEFY LFELMDGSRS VQDLVVAYMM KYHRFALPLV LRLVRNLKVG QMLTDPPRFV FGPLGEQLRR RPVSSFATGF AKSFVRREFP LHGLDGLVGR AYDRGVWVLF TRPAKIVMFA VAMLGVPLLA WSLSSSASGL ADQSLVVTVP TMYGCLLVVA VLHELAHAFA AKSYGRVVRR GGLSIFYGSP GMFVDTQDMW MEPRGPRMVS AWAGPFSGFV LAGLSGIVLV ATPDAPWAAV VAIFGTAALA VNLAQLTPLI QLDGYYMLMD WLELPNLRAR ALGFIRGELP GKVRRRERFD RTERVLTIFG LAAAAYTGYV LVLAAGFAWV RAKTMVSDAV AVPSLGRVLA ATVVLLLTLP LLYVLGQRLW RGGSSLVVAS RRLRRLAAER RYRERVAVLA NVPPVAELGK AYVQWMARRT TEKVFRAGTT VVAADEVAEV FCLVLSGEAE VVESGANAGT GPGAGSALRT LGPGDYFPPP GLPRSPLTVR ALTDVHVLRL AGADFSDRLA PLLARRAEND TRADERAELE GFALFDGLRT RDKDTLLAHL RARSCVDGEV VVAEGDPGSA FYLVRSGAVA LAQTCVDGPP RTLGAGEFFG ESALLHDEPH AATATSVGET RLWELDRTTF DDVVCRYFGL SDAVRESAEA GEATRAAEAL AGTVAGRWLE IEVGDPAPGF TLDTASGPSP VSLSDYRGQV VLLWFSRGYN CPFCREYMAR LAPAVGDFER AGVQILQLAP NLVDSAREFW RGKDLPFPFL CDPEKSAYRL CGLQDIGAGE AQRNSVRGFT RAFTTGQGRT TMHALWLDVV NPSIGERLGH HTMTAMQQGV FLVGPDGVLR RKYVFGPLDE PPSNTELLEA AAELGSIELA TTQLEAGS
|
| |