Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0179 |
Symbol | |
ID | 5668604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 217922 |
End bp | 218809 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641239108 |
Product | HAD family hydrolase |
Protein accession | YP_001504552 |
Protein GI | 158312044 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1877] Trehalose-6-phosphatase |
TIGRFAM ID | [TIGR00685] trehalose-phosphatase [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.183714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.508891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACT CCTCCGCGCC CTCGCCGCCC GCTCCCGCGG GTGCCCACGC CACCGCCGAG CCTGCCCCCG CCCCGCGCAC CCTGGCCGGC GGCGCCGGCC TGGCCGCCCT GCTGACGACG CCGGACAAGG CGCTCGTCGC CCTCGACTAC GACGGCACCC TCGCCCCGAT CGTCTCCCGG CCCTCCGACG CGGTGCCCGC GCCCGGCGCG ATGGCCGCGC TGGGGCGGAT CTCCCGCCGC GTCGGCACCG TCGCGATCAT CACCGGCCGG CCGGTGGACG CCGTCCTCGA GCTCACCGGG GCGGAGCGGT TCACCGATCT CGGGCACCTG CTCGTCCTCG GCCAGTACGG GCTGCAGCGG TGGGACGCCG AGACCCGGCA GACCACCGCC CCGGAACCAC TGCCCGGGGT GGAGGCGCTG CGCTCGGCCC TGCCGGACGC ACTGCACGAC GCCCCGGCCG GGACGTCGGT CGAGGACAAG CGGCACGCGC TGGTCGTCCA CGTGCGGCGG ACCGCCGACC CCGACGCCGC GCTGGCGGCG CTCACCCCGG CGCTGACCAG GCTCGCCGAG GAGTACGGGC TCGAGGCGGC GCCGGGCAAG CGGGTCCTGG AGCTGCGCCC GCCCGGTCAC GACAAGGGCC GTGCCCTGCG CGGGCTCGTC GCCGAGCGGG CGGCCCGGTC CGTCCTCGTC GCCGGCGACG ACTACGGGGA TCTCCCCGCC TTCGAGGCGG TCGACGAGCT GCGGGCCGGC GGCCTGGGCG CGATCACCGT GTGCAGCGAC AGCCCGGAGG TGCCCGACGT GCTGCGCGAG CGGGCGGATC TGGTCGTCAG CGGCCCGACC GGCATGGTCA CCCTGCTGGA GGTACTCGCC GACCGCCTCG GCGCCTGA
|
Protein sequence | MTDSSAPSPP APAGAHATAE PAPAPRTLAG GAGLAALLTT PDKALVALDY DGTLAPIVSR PSDAVPAPGA MAALGRISRR VGTVAIITGR PVDAVLELTG AERFTDLGHL LVLGQYGLQR WDAETRQTTA PEPLPGVEAL RSALPDALHD APAGTSVEDK RHALVVHVRR TADPDAALAA LTPALTRLAE EYGLEAAPGK RVLELRPPGH DKGRALRGLV AERAARSVLV AGDDYGDLPA FEAVDELRAG GLGAITVCSD SPEVPDVLRE RADLVVSGPT GMVTLLEVLA DRLGA
|
| |