Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2832 |
Symbol | |
ID | 5671221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3349571 |
End bp | 3350317 |
Gene Length | 747 bp |
Protein Length | 248 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241741 |
Product | HAD family hydrolase |
Protein accession | YP_001507161 |
Protein GI | 158314653 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.106168 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGTCA CGGTCGTCAG CGTGGACCTG TGGGGAACGC TGATCAGCTA CGGCGACCGG GACGCCGAAG CCGCCTGGCG GGTCCGCGAG TTCCACCTCG CCCTGACTGC GTTCGGCCAC ACCCTCCCCG CCGAGCACCT GGACACCACG GTCCGAGCGG TCCGTGCCGA ACTACTCGAC GAGCAGCGAC AGCACGGACG GCAGATCCCG GTTCCCGAGC AGGTCCGCAC GATCGTCCGC CGGCTCGGTA TCGACGCCGA CGACGCCCTC GTCCAGGTGC TCACGGTCGC GCACACCCAC GCCGTGCTGC GGGCCTGCCC CCAGCTGATC CCCGGCGCGC ACGCCTGCCT CGCCGCGCTG TCCGCCGCCG GCTACCGCCT TGTCCTCGCC TCCAACACCC TGGCGACCCC AGGAGCCGTC ACCCGGCAGA TCCTCGACCG CCACCGGCTC ACCGACCACT TCGACCGGCT GTTCTTCTCC AGCGAACTCG GTGTCGCCAA ACCCCAGGCC GCGATGTTCC ACGCCATCGC CGAACAGACC GACACCACCG TCGACCAGGT CGTGCACGTC GGCAACGACT GGCGCACCGA CGTCCGCGGA GCCCTCGCCG CGGGCTGCCG CGCGGTCTGG TTCAACCCCG GGCGCAAGCC GTCGCGACCC GAAGCGCCCG ACGCCGCACT GCTCGTCGAC ATCCCCGTAC TGGTCCGGCG ACTCCGCCCC CCAGACCACG CCCACCGCGG CACCTGA
|
Protein sequence | MSVTVVSVDL WGTLISYGDR DAEAAWRVRE FHLALTAFGH TLPAEHLDTT VRAVRAELLD EQRQHGRQIP VPEQVRTIVR RLGIDADDAL VQVLTVAHTH AVLRACPQLI PGAHACLAAL SAAGYRLVLA SNTLATPGAV TRQILDRHRL TDHFDRLFFS SELGVAKPQA AMFHAIAEQT DTTVDQVVHV GNDWRTDVRG ALAAGCRAVW FNPGRKPSRP EAPDAALLVD IPVLVRRLRP PDHAHRGT
|
| |