Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4759 |
Symbol | |
ID | 5673101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5683830 |
End bp | 5685503 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243616 |
Product | diguanylate cyclase |
Protein accession | YP_001509032 |
Protein GI | 158316524 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.805923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAC TCCGTGGCGT CACGGGCGCC ACACACGGTT TGCGGCGTTC GACCGTTTCG GGCCGTACGC CGCAGTTGTT GTGGTGCGCG GCCGTCGCCG CCGTGGTGCT TTTTGCGCTG GAGGTCGCCC TCGCGGTGCT GTTGGGCCCG CGGCCGGGCG TCGTCGCCAC GATGTCGCTC AACGCCGCCG CCCAGGTGGC CGCCGCGGCG GCCTGCTTCT GGACCGCTCG GCGCACCCGC CCCGGTGACC GGCGCTGGCG GCGGCTCATC GGCGTCATGG CGGGCAGCTC GGCGCTCGCC AGCGTCGCGA CGGCGGCAGC CGTGCTGGGT GGTGAATCTC CCCGGGTGTC CTCGGGGTGG TACGCCGTCC TGCTCGTTTT CCACGGTGTG GCCCTGGCCG GGCTGCTGTC GCTGCCCACC GACCCGGTGG ACGGCCGGGC GGGGAGGCAG GGCCGCGGCT CGTCCCGCTG GTACGTGATC ACCATGCTGG ACTGTGTACT GATCGTCGGT TCGATCGTCC TGCTGGAATG GGGGACGGTG CTCGCCGCGG TCGTCCGGAC AGGTGCGCCC AACCTCAGAG AGTTCCTGCT CTCCCTGGTC CACGCGGTCT CCGTACTGAT CCTGGTCACG GCGGTGGTGC TGATCGCGAG CTTCCGCCGG CCGCGCTCCC CGACGACGTT GGCACTGCTC GGCACAGGTC TGCTCGCCTA CGGTCTCACC AGCAACGTCT TCGTCTACCG CGTCGCGCAG CACCAGCTCG ACATGCCGCC GTTGAGCGTG GTCGCATTCG GCCTCGCCTT CCTGCTGGTC TTCCTTGCCG CGCTGCTGCC GTTCCCGGCT CGCGCACCGC CGGACGGCCC GGCGCTGGAC GGCCCGGCGC CGGCCGGGCC GCAGGCGACG TGGGCGTACG TGGTGCTGCC CTACGCGGTG CTCGGTGCGG CGGGCCTGCT GGTGGTCGGC AAGCTGTCGA TCGGTACGCA GGTCGACCGG TTCGAGACGT ACGGCATCGC CGGGCTGCTG CTGGTCGCGC TGGTCCGCCA GATGGTCACG CTGGTCGAGA ACGCCCGGCT ACTCACCGGA ATCCGGGACC AGGAACGGGA GCTGCACCAC CAGGCCTTCC ACGACCCGCT GACCGGCCTG GCGAACCGGG CCCTGTTCAC CCGCCGACTA CAGCGAGCAC TCACGCGCGC CACCGACGAC GCCGGTGACG CCAGCGACGC CGGTGGCGGC ACGTCGCTGT CCGTGCTGTT TCTGGACCTT GACGAGTTCA AGCACATCAA CGACACCTTC GGGCATGCCG CCGGGGACGA GCTCCTCCGG ATCAGCGCGG AGCGGCTGCG GGCGGGAACA CGGACCGTGG ACACCGTCGC CCGGCTCGGC GGCGACGAGT TCGCCGTCAT CCTCGACGGC GACGGACTCC GCAATCCCCG CCGTGTCGGG GAGCGGCTCG CGGCAGCGAT ACAGGCACCG TGTCTGCTGG CGGGACGGCC TTACACCCCG CGAGCCAGCC TCGGACTGGT CACCCTCGAC TCCCCCGCCC GCCCGGCCGA CCCCGACGTC CTGCTCCACC AGGCCGACCT TGCGATGTAC GCGGCCAAAC GCGAACGGGC CGGCAGGCTG AAGGTCTACC GGTCGGACAT GAGCCCTCCG ATCGCCGCAC CGCCGCCCGG CTGA
|
Protein sequence | MSELRGVTGA THGLRRSTVS GRTPQLLWCA AVAAVVLFAL EVALAVLLGP RPGVVATMSL NAAAQVAAAA ACFWTARRTR PGDRRWRRLI GVMAGSSALA SVATAAAVLG GESPRVSSGW YAVLLVFHGV ALAGLLSLPT DPVDGRAGRQ GRGSSRWYVI TMLDCVLIVG SIVLLEWGTV LAAVVRTGAP NLREFLLSLV HAVSVLILVT AVVLIASFRR PRSPTTLALL GTGLLAYGLT SNVFVYRVAQ HQLDMPPLSV VAFGLAFLLV FLAALLPFPA RAPPDGPALD GPAPAGPQAT WAYVVLPYAV LGAAGLLVVG KLSIGTQVDR FETYGIAGLL LVALVRQMVT LVENARLLTG IRDQERELHH QAFHDPLTGL ANRALFTRRL QRALTRATDD AGDASDAGGG TSLSVLFLDL DEFKHINDTF GHAAGDELLR ISAERLRAGT RTVDTVARLG GDEFAVILDG DGLRNPRRVG ERLAAAIQAP CLLAGRPYTP RASLGLVTLD SPARPADPDV LLHQADLAMY AAKRERAGRL KVYRSDMSPP IAAPPPG
|
| |