Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0369 |
Symbol | |
ID | 5668793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 438959 |
End bp | 440494 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239301 |
Product | protein serine/threonine phosphatase |
Protein accession | YP_001504741 |
Protein GI | 158312233 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0631] Serine/threonine protein phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0013966 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00252542 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGACCTCA ACCCCCGGCA CGGGCAGGGC TACCCCGGGC ACGGCTATCC CCCCGAACGC CCGGACTCGC CGCACCCGAA TGCCGGCTAC GCCGAGAACT CCACCCACCC CGACGGATAC CCAGGTGACG ATCTTCCGAC CGAGAGCTAC CGGGCGGAGG ACTTCCGAGC CGATGATCAG CCTACCGAGA GCTACACGCC GGACCAGTTC ACCGGGTCGG CCTACCCGGA GAGCGGCTAC GCCGACAACG GCTATGCGGA GAACTCCTAT CCGGACGGCG CCTATCCCGG CGCCGGCTAC CCGGCGGGCA CGGGTGACTA CCGGGAAGGC GACTATCACC ACGAGTACCC CGAATCCGCG TACTCCGAAC CTGGCTACCA GGGTGAGACG TACTCGGACG ACACCTATTC GGGCAGCGGA TTCAACGAGG ACGGATACGC GGAGGACACC GATCCGGCCA CGATGCTCCT GGGCCAGCGC GCCGTCCCGG GCGAGGCGGA CGACGACGCG CACGACGAGG TGATGCGCTG CCCGGTGTGC GCGGCACCCA ACTACCAGGA CGCCCGGTAC TGCGAGGCCT GCGGTTTCGC CCTGACCGGC GTCCCGGTGG CCGGCGGCGA CGAGGACCAC CGCGAGATCG ACCTCGGCCC GCTCGCCGGC GTGTGTGACC GCGGCGTGCG GCATACCACC AACGAGGACT CGATGGGCGT GGCGATCGTC CACGGCACGC TGATCGCGGT GGTCTGCGAC GGGGTCTCGA CGACCCCGGG CTCCGGGGAG GCGTCCGCCG CCGCCGCGGC GACCGCGACG GCCATCCTCG CCGACGCGGT GCGCGCCCAC GGGCCCGGCG AGGCCGTCCC GTATGACGAG CGGCCGGCGA GCCGGGCGAA CGACGTCCTC GACGTTCTGG AGGAGTACTC CGAGTACGAC GAGCCGCGCA CCGCTCCGCG GCGGATCTCC GGTGGTTTCG GCCCGGACGA GGCCGACATC GCCCTGCAGC GGGCCGTGGA GGCCGCCCAG GCCACGATCG CCCAGCTCAG CGCGGCGGAG GGCCGGATGG CGCCGTCCTG CACCTTCGCG GCGGCGATCA TCACGCCGCC GTCGGTGGAC GGTCCCGGCA TGGTGACCGT CGGCTGGGTC GGCGACAGCC GCGTCTACCT GCTCGGCCCG CGCTGGTGCG AGCGGCTCAC CGCGGACGAC ACCTGGGCGG CCGAGGCGGC CCGGGCCGGG CTCATCCCGG CGTCGGAGGC GGAGACCCAC CGGCGCGCGC ACACACTGAC GCGCTGGCTC GGCGGCGACG CCGAGGACGT CACCCCGCAC ACCGCCGCCT TCCCGATCGA GACCCCGGCC ACCGTGCTCG TCTGCAGTGA CGGCCTGTGG AACTACTCCT CGCGCCCGGA CGTCCTCGCC GCGCTGGTGA ACCAGCTCGA GCCGAGCGCG TCCGCGCTGG ACGTCTCCCG CCACCTCATC GACTTCGCCA TCGACCAGGG CGGGCACGAC AACATCACCG TGGTCGCGGC GAGGGTGACG AGCTGA
|
Protein sequence | MDLNPRHGQG YPGHGYPPER PDSPHPNAGY AENSTHPDGY PGDDLPTESY RAEDFRADDQ PTESYTPDQF TGSAYPESGY ADNGYAENSY PDGAYPGAGY PAGTGDYREG DYHHEYPESA YSEPGYQGET YSDDTYSGSG FNEDGYAEDT DPATMLLGQR AVPGEADDDA HDEVMRCPVC AAPNYQDARY CEACGFALTG VPVAGGDEDH REIDLGPLAG VCDRGVRHTT NEDSMGVAIV HGTLIAVVCD GVSTTPGSGE ASAAAAATAT AILADAVRAH GPGEAVPYDE RPASRANDVL DVLEEYSEYD EPRTAPRRIS GGFGPDEADI ALQRAVEAAQ ATIAQLSAAE GRMAPSCTFA AAIITPPSVD GPGMVTVGWV GDSRVYLLGP RWCERLTADD TWAAEAARAG LIPASEAETH RRAHTLTRWL GGDAEDVTPH TAAFPIETPA TVLVCSDGLW NYSSRPDVLA ALVNQLEPSA SALDVSRHLI DFAIDQGGHD NITVVAARVT S
|
| |