Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0784 |
Symbol | |
ID | 5669200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 909433 |
End bp | 910659 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239712 |
Product | hypothetical protein |
Protein accession | YP_001505148 |
Protein GI | 158312640 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.246481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACT CCGAGCGCTG GATCGTCGTC GTCGAGGAGC CCTTCCTCCC CGCGGATGCC GGCGGGCGGG TGGAGACCTT CAGCTTCCTC ACGGCGGCCT CGGCGGCCGG TATCCGGATG CAGGTCCTGG TGCCGTCCCG CACCGACCTG GACATCGCGG CCTACGAGGA CGCCGTCCCC GGCGCGGCGG TCATCCGCCT ACCCCGGGAC GACAGCCCGC TGGCGCACCT CTCGCCGCGG CCGTTCACGC ACGCGTCCCG CCCGGTCGGC CCGCTGCGCC GGGCGCTGGA GAACACCCCG CCGCGCGCCG ACTCCGTCAT CAGCTACAGC TGCCGGACGT CGCATCTCGG CGAGGAGATC GCGCGGATCT GGCGGCTTCC CCACCTGGTG CGGGCGCACA ACATCGACTC GGAGTTCTTC CGCGTCCTGG CCCGGAACTC CACGGGCCCC CGCGCGGTCG CCTACGAGCT CGAGTACCAC CGCCTGCGGC TCGCCGAGCA GGCCATGCAC CACTCACCGC TGGTGAACGC CATCGCGGAC ATCTCCGTGG AGGACCACGA GTGGCGCCGC GGGCGGGCGA GCGTCCCGAC GTTCCACCTG CCGCCGTTCC TGCCCGCCAG CACGGTCGCC GAGGCCCGCG CGGCCGGCGG CGTCGCGGAC ACGGAGCGGG CCGGCGAGCG CCTGGTCTTC GTCGGCTCGC TGGACACGCC CACCAACATC GAGGCGCTGC GCTGGTTCCT GGGCGGCTGC TGGCCCACGA TCCGGGTGCG CCACCCCGCG GCCGTCCTGC AGGTCGTCGG CCGCCGTCCG GAGGACGGCC TGGCCGAGTG GCTGGCCGGC TTCGACAGCG TTGAGCTGCA CACCGACGTG CCGAGCGTGC TCGGCTACGT GGCCGGGGCG ACCGTGTCGG TGAACCCGAT GCGCTCCGGA TCCGGGGTCA ACATCAAGGC GATCGAGGCG ATGTCCGCCG GGACGCCTGT CGTCAGCACC CCGACCGGCA GCCGCGGCCT GGGCTGGCGC CCGGGCGAGC ACCTGCTGGT CGCCGACGAT CCGGGCGCGT TCGCGGACGC CGTCTGCGGG CTGCTGGACA ACCCCTGGCT CGCCGCCGAG GTCGGGACGG CCGGGCGCGA GTTCGTCCTG CGCGAGCTCG ATCACGCGAC GCTCATCGAC CGGGTCCGGG GCATGCTGGC CGGGCGCACC GAGGAGACCA CAGCTCAGAC CGCTTGA
|
Protein sequence | MSDSERWIVV VEEPFLPADA GGRVETFSFL TAASAAGIRM QVLVPSRTDL DIAAYEDAVP GAAVIRLPRD DSPLAHLSPR PFTHASRPVG PLRRALENTP PRADSVISYS CRTSHLGEEI ARIWRLPHLV RAHNIDSEFF RVLARNSTGP RAVAYELEYH RLRLAEQAMH HSPLVNAIAD ISVEDHEWRR GRASVPTFHL PPFLPASTVA EARAAGGVAD TERAGERLVF VGSLDTPTNI EALRWFLGGC WPTIRVRHPA AVLQVVGRRP EDGLAEWLAG FDSVELHTDV PSVLGYVAGA TVSVNPMRSG SGVNIKAIEA MSAGTPVVST PTGSRGLGWR PGEHLLVADD PGAFADAVCG LLDNPWLAAE VGTAGREFVL RELDHATLID RVRGMLAGRT EETTAQTA
|
| |