Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1176 |
Symbol | |
ID | 5669589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1400042 |
End bp | 1401055 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240108 |
Product | HAD family hydrolase |
Protein accession | YP_001505536 |
Protein GI | 158313028 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00246579 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.129694 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCTTCT CCTGGTTCGC CCCGGACGCG GTGCCGGCCG TCGACGGGAC ACCCCTCCCG GACGACCGCA AGCCGCTCGC GGCCGTGGTG TTCGACTGGG GCGGGACGCT CACCCTCTTC CACGACGTCG ACCTGCTCGA CCTGTGGCGG ATGACCGCGC TGGAGATCTC GTCCGCACAC GCCGACGAGA TCACCGCGAT CCTGGCTGGC CTGGAGACGA CCTGGTGGCA TGCCCGGGAC GGCAGCGACG GCCTGGCCGG CAACGGCGCG GCGGGCGGTA TGACCGGGTC GGTCACCGAA CTGCTTGCGG CCGCCTCCGC GGCCCTCGGC GTGGACGTGG CCGCGGCCGT GCTGCGCGCG GTCAGCCGGC GCGACCTCGC AGCCTGGACC CCGCACACGG TGTGCGATCC CGAGGCACTG CTGCTCGTGC ACCTCGTGCG TGCCCGCGGC CTGCTCGTCG GCCTGCTCAC CAACACCCGC TGGCCACGGG CCTGGCACGA GCGCCTACTC GAACGCGACG GCCTGCTCGA CCTCTTCCAC GCCAGGATCT ACACGAGCGA CCTCCCTTTC GACAAGCCAC ACCCGATCGC CTTCCAGGCG GTGCTGGCCG CACTGGGCGT CACCGACCCG TCCCGGGTGC TCTTCGTCGG CGACCGGCTG CGGACGGACA TCCGCGGCGC GCGGGCGGCC GGCATGCGGG CTGTGCTGGT CGCGGACGGT GGCTTCGCGG ACGGCGGATT GGCGGACGGC GCCGCGGCGG GCCTCGGCGG TGGTCCCGGC GGCGTGGTCC TCGATCCGCG TGATCCCCTT GGGCTGGCGG TGGCGGGCCG CGATCTCGCA GGGCGGGCCG ACCTGGGCCG AGACCCCGCC GCGCGGGATC TGCCCGGTGC GGACGGGGCC GGGGCGTCCG CGTCGATGCT CGCGGATGCG GTGATCGGCC GCCTCGGCCA CCTGCTCGAG GTCCTCGACC TGCTCGGCGC CCCCGGGCCT GGGTCACCCG CCCCACGTCG CTGA
|
Protein sequence | MGFSWFAPDA VPAVDGTPLP DDRKPLAAVV FDWGGTLTLF HDVDLLDLWR MTALEISSAH ADEITAILAG LETTWWHARD GSDGLAGNGA AGGMTGSVTE LLAAASAALG VDVAAAVLRA VSRRDLAAWT PHTVCDPEAL LLVHLVRARG LLVGLLTNTR WPRAWHERLL ERDGLLDLFH ARIYTSDLPF DKPHPIAFQA VLAALGVTDP SRVLFVGDRL RTDIRGARAA GMRAVLVADG GFADGGLADG AAAGLGGGPG GVVLDPRDPL GLAVAGRDLA GRADLGRDPA ARDLPGADGA GASASMLADA VIGRLGHLLE VLDLLGAPGP GSPAPRR
|
| |