Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4152 |
Symbol | |
ID | 5672507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4932356 |
End bp | 4933591 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243025 |
Product | epocide hydrolase domain-containing protein |
Protein accession | YP_001508442 |
Protein GI | 158315934 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0493973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.916687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCC TGCATGCCTT CCCCCTGGAG CCCGCCCCGA TCCATGTGTC CGACGACGTC CTCGATGACC TGCGCGCCCG TCTCACCCTG ACCCGTCCGC CGCTGGACGA GGGGAATGAG GACTGGTCCT ACGGCGTCCC GGGCAGCTAT CTGCGTGAGC TGGTCGTCTA CTGGCGGGAC GGCTACGACT GGCGCAAGGC CGAGGCCGCC ATCAACGCCT ACGAGCACTA CCAGGTGAGC GTCGCCGGTG TCCCGGTGCA CTTCATGCGC AGGCCCGGCC GCGGCCCCCG CCCGATTCCG TTGATCCTCA CCCACGGCTG GCCGTGGACG TTCTGGCACT GGTCGAAGGT GATCGACCCG CTCGCCGACC CGGCCACGTT CGGCGGTGAC CCCACCGACG CGTTCGACGT CCTCGTGCCG TCCCTGCCCG GCTTCGGTTT CCCTGGCCCG CTCACCAGCT TTTCCGACGT CAACTTCTGG AAGGTCTCCG ACCTCTGGCA CACCCTGATG ACCGAGACCC TGGGATACGA GAAGTACGCC GCCGGGGGCT GCGACATCGG CGGGATCGTC TCCAGCCAGC TCGGCCACAA GTACGCCGAC GAGCTGTACG GCATCCACAT CGGCTCCGGG CTGCCGCTCG ACTTCTTCAC CGGTCCCCGC GCCTGGGACT TCGCCCGGAA CCGGCCCCTC ACCGACGACC AGCCCGCCGA CGTCCGCGCC CGGATCATCG AGCTGGACCA CCGCTCGGCG TCCCACCTCG CCGTGCACAT GCTCGACGGC GCCACCCTGG CCCACGGGCT GAGCGACTCA CCCGCCGGAC TGCTCGCCTG GCTGCTGGAA CGCTGGAACG CCTGGAGCGA CAACGGCGGC GACGTCGAGT CCGTCTTCAC CAAGGACGAC CTGCTCACCC ACGCCACGAT CTACTGGGTG AACAACTCCA TCGCCACGTC GATGCGTTAC TACACCAACG CCAACCGCTA CCCCTGGGCT CCTGCCCACG ACCGCACCCC GGTCGTGCAG GCCCCGGTCG GCCTCACCTT CGTCACCTAC GAGAACCCGC CCGGCATCCA CACCGCAGAC GAGCGTGTCC AGGCGTTCAA GACCGGCCCG CAGGCCGACT GGTTCAACCA CGTCAACGTC AACGCCCACG ACCACGGCGG CCACTTCATC CCCTGGGAGA ATCCCGACGC CTGGGTGAGC GACCTGCGAC GCACCTTCCA CGGCCGCAGG CCCTGA
|
Protein sequence | MTPLHAFPLE PAPIHVSDDV LDDLRARLTL TRPPLDEGNE DWSYGVPGSY LRELVVYWRD GYDWRKAEAA INAYEHYQVS VAGVPVHFMR RPGRGPRPIP LILTHGWPWT FWHWSKVIDP LADPATFGGD PTDAFDVLVP SLPGFGFPGP LTSFSDVNFW KVSDLWHTLM TETLGYEKYA AGGCDIGGIV SSQLGHKYAD ELYGIHIGSG LPLDFFTGPR AWDFARNRPL TDDQPADVRA RIIELDHRSA SHLAVHMLDG ATLAHGLSDS PAGLLAWLLE RWNAWSDNGG DVESVFTKDD LLTHATIYWV NNSIATSMRY YTNANRYPWA PAHDRTPVVQ APVGLTFVTY ENPPGIHTAD ERVQAFKTGP QADWFNHVNV NAHDHGGHFI PWENPDAWVS DLRRTFHGRR P
|
| |