Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6766 |
Symbol | |
ID | 5675079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8229511 |
End bp | 8230491 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641245615 |
Product | endonuclease/exonuclease/phosphatase |
Protein accession | YP_001511006 |
Protein GI | 158318498 |
COG category | [R] General function prediction only |
COG ID | [COG2374] Predicted extracellular nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACGA CCTACTACCT CGCCTGGTGG AACCTGGAGA ACCTCTTCGA CGAGGAGAAC TCACCCCGGC GGACCGAGAA ACTCACCCGC ACCCTCGGTG ACGACCTCGC CGGTTGGACG CCGCAGCTCC GCGACCGCAA GGTCTCCCAG CTGGCGTCGG TGATCGCGCA GATGAACGGC GGTGCGGGCC CGGATCTGCT CGGGGTGTGC GAGGTGGAGA ACCGGTTCGT CCTCGAGCTC CTCGCCGCCG CGGTCGCCGA CCGGCTCGGT GGGCGGCGCT ACGAGATCGT GCACGCCGAC ACCGACGATG CCCGCGGGAT CGACGTCGCG TTCCTCTACG ACCCGACCCT GCTGACCGCC CCGGCGGGGC AGGTCTTCTT CCACGTGGTG ATGCGCCGCA ACGCCACCCG CGAAATCGTC CAGGTCAACT TCCAGACCCA CACCGGGCGG ACGTGGGCGG TGTTCGGGAA CCACTGGCCG TCGCGGTCCG GCGGGCAGTA CGAGTCCGCC GGCTACCGGG CCATCGCCGG GGAGACCCTG GCCTACTTCC ACCAGCGGGT CCGTGAGGAA CACGGCCAGG ACACCGCCGC GCTGGCGATG GGTGACTTCA ACGACGAACC GTTCGACACC TCCCTCGTCG CCCACGCCCT CTCGACCCGG CAGGCCGTCC GGGTGATCAA CGCGGACACG CCAAGGTTCT GGAACCTGAT GTGGCCCGCC GCCGGCACCC CGGAGGGCAC GTTCTACTTC CAGAACGAAC CGAACCTGCT CGACCAGTTC CTCGTCAACG CGAACATGGC CCGCCCCACC AGCCCCCTGC ACGCCAACCC CGACAGCGTG CGGATCCTGC GGTTCCCCGA GCTGGTCCAC ACCGGCGACT ACCCCCGACC CCGCCCCTTC GGTGGGATGG GCGCCGCTGT CGATGAGACC GGCTACTCCG ACCACTTCCC CATCGGGATG ACCGTCACCG AAATCGACTG A
|
Protein sequence | MPTTYYLAWW NLENLFDEEN SPRRTEKLTR TLGDDLAGWT PQLRDRKVSQ LASVIAQMNG GAGPDLLGVC EVENRFVLEL LAAAVADRLG GRRYEIVHAD TDDARGIDVA FLYDPTLLTA PAGQVFFHVV MRRNATREIV QVNFQTHTGR TWAVFGNHWP SRSGGQYESA GYRAIAGETL AYFHQRVREE HGQDTAALAM GDFNDEPFDT SLVAHALSTR QAVRVINADT PRFWNLMWPA AGTPEGTFYF QNEPNLLDQF LVNANMARPT SPLHANPDSV RILRFPELVH TGDYPRPRPF GGMGAAVDET GYSDHFPIGM TVTEID
|
| |