Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1749 |
Symbol | |
ID | 5670151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2098217 |
End bp | 2099317 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240670 |
Product | HAD family hydrolase |
Protein accession | YP_001506093 |
Protein GI | 158313585 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01457] HAD-superfamily subfamily IIA hydrolase, TIGR01457 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0343675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000628636 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGGTCC CCCGAGAGGC GGCGGGCACG GCGTGGTCCG TGCTCGCCGG AACGAGCCGG CCACTGGTGG CGATGTTCGA CGTCGCGTTG ATGGATCTCG ACGGGGTGGT GAACCGCGGC GAGCGGGCTG TGCCGCACGC CGCCGCGGCC ATCGAGGCCG CGGGCCGGCA GGGGATGCGC ACCGTCTACG TGACGAACAA CGCGCTGCGG ACCCCGGAGA CCGTCGCCGC GCGGCTGACG GGTTTCGGCG TGCCAGCCGA ACCGCCGGAG GTCGTCACCT CGGCGCAGGC GGCGGCGCAC GTGCTCGCCG AACGGCTACC GGCCGGGGCG GTGGTCCTGG TCGCCGGAGG TGTCGGGCTC CGGGAGGCGG TGCGCGCGGA GGGCCTGGTC CCGACCGGGT CGGCCGCGGA CGAGCCGGCC GCCGTGGTCC AGGGCTTCGA TCCGGAGATC AACTATGCCC GGCTGGCCGA GGCGGTGCTG GCGATCCGGG CGGGGGCGTG GTGGGTCGCG AGCAACACCG ACCTGACGGT GCCGACGGAG CGTGGCCTGG CGCCCGGTAA CGGGGCGCTG GTGGCCTTCG TTCGGGCCGC GACGGGCGCG GAGCCCGAGG TGACCGGGAA ACCGGAGTTC GCGATGCACG CGGAGTCGGT GCGGCGCAGC GGCGCGCGTG ATCCGATCAT CGTCGGCGAC CGGCTGGACA CCGACATCGA GGCGGGTTTC CGTGCCGGCA CGCCGACTCT GCTGGTGTTC ACCGGTGTCA CCGGGCCCGC GGAGCTGCTC GGCGCGCCCG CGCGGCACCG GCCGACCTTC CTCGCCGCCG ACCTGCGCGG GCTGCTGCGC CCCCAGCCCG CCGCGCTCGC CCGGGACGGC TCATCCCGGT GCGGCGGGTG GACGTGCGAC CTGGACGGCG GCACCCTGCG CTGGCACCAG GCCGACCCCG GGAGCGCCGG GCTGGACGAC GGGCCGGACG ACGGGCTGGA CGCGCTGCGG GCCGCGTGCG CGCTGGTCTG GGCGGCAGCG GACGAGGGCC GTCCGGTCGA GGCCCTGGCC ACTGATCGGC CGCCGGGCTG CGAGGACCTG CGCGCGCCGG CCGCGCGCTG A
|
Protein sequence | MTVPREAAGT AWSVLAGTSR PLVAMFDVAL MDLDGVVNRG ERAVPHAAAA IEAAGRQGMR TVYVTNNALR TPETVAARLT GFGVPAEPPE VVTSAQAAAH VLAERLPAGA VVLVAGGVGL REAVRAEGLV PTGSAADEPA AVVQGFDPEI NYARLAEAVL AIRAGAWWVA SNTDLTVPTE RGLAPGNGAL VAFVRAATGA EPEVTGKPEF AMHAESVRRS GARDPIIVGD RLDTDIEAGF RAGTPTLLVF TGVTGPAELL GAPARHRPTF LAADLRGLLR PQPAALARDG SSRCGGWTCD LDGGTLRWHQ ADPGSAGLDD GPDDGLDALR AACALVWAAA DEGRPVEALA TDRPPGCEDL RAPAAR
|
| |