Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0801 |
Symbol | |
ID | 5669217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 934204 |
End bp | 936909 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239729 |
Product | hypothetical protein |
Protein accession | YP_001505165 |
Protein GI | 158312657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0459133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCCTG ACGAGTGGCA TGGCCACGGC CTCGGCCGCC CCGATCTGAC CCGTGAGAGC CGGGCCGACG TGCCGACCGG TGAATGGGTC GTCGACGACG GCCCCGCGGG CACCCGGCCG GCGTCCGACT GGTCGATCGG GGTCTATCGG CGCGGTGACG AGGTTCCGGG GCCCGGCCCG GCTACCGGAG CGACGGCCGC CACCGGCGCC ACCGGCGCCA CCTCCGCCAC CTCCGCCTCG CCGCCCGCTG GAGCGGCCGC GGTTCCCGCC CCTCCCGCCC AGCCGGCCAC CGGCAGGGCT CCCGGCTCAC CTTTTGACCC CGAGATGCTC TCCTCCCGGC GCCGGCCGGC GCGCCGCCGT TCGCCGGCCG CGCCGATCGC CCCCTACCCG GAGGGCCCGG CCTACCCGGG ATCCGCCACG CGCGGCGAGT CCGGGGCGCG CGGCGAGTCC GGGGCGGAAT GGCCCGATGA CCACGAGTCC GGCCAACGCG GACGGGGCCA CAGCGGGTCC CGGCGCGATC GCGACTCCAG ATGGGACGTG CCCGGATGGG ACGCCCCCGG ATCCGACCGT CCTGGCCCCG GCGAGGGCGA ACAGCGCCAG CGTCAGCCTG AACGGCACCG GTCCGGACCC GGCCGGCCCG CGGGTCGCGG CGCCGGTGAG CCCACGTCGG GGTACGGCGA ACCCGGCCAC GGCCACGGGC GCGATCCGTA CCGTGGCGCG GAGGGGCGTC CCGGCGCGGC GGGGCACTGG GATGAGCCGG ACTGGGCCGG CGAGCCCTCC CGTCCCGCCC CGTGGACCCC ACCGCCCTTC GAGGCACCGT CATCCTCGGC ACCGCCCGCA CGGGTCCCGC CGCCGGCGGC ACCGCCGTTC TCGGCACCGT CGTTCTCGGC ACCGAGCTCG CGGGTGCCGC CGCCGCCGTC GGAGTCACCG TCCTCGACGT GGCCGCCCGG TGATCTCGCC ACCCAGCGTT TCCAGTGGCG AGACTGGGAA GGTGAGCCGC GGCCAGGAGG ACCGAGGCCC ACCCGGCCGG GCGGGGAACG CGCAGACGAG ACGATGATCC TCCCGCCGTC CCGGCCGGAG CAGGGCGCGC GCGAACGCGG GGACCGCGAC CGGCAGAGCC GACGGTCGGG CCGTGACCAG CGGAACCATC AGCCGGGCCG TGCCCCCCGC GACCAGCCGG ACCCTGATCG GCGCGGCGAC CAGCCGGGCC TTGACCGACG TGACCGGCCC GGCCGGGATC GGCGGGGCGA TCAGCCGGAG CGGTCGCGGC GGGGGCAGGA GCCGGCCGGG CGGGCGCAGG AACATCCGGA GCGGCCGCGG CAGCGGCGTT CGGCCACCGA CCGCGCCCGG TCCGATCGCG CCCGGTCCGA TCGCGCCCGG CCTGACCGCG CCCGGTCCGA GCGCTCGCGG CCCGGGTCCG CCGGCGGGCA CGTCACCGGG GTGCTGCCCA TTGCCGACCC CGTCCGGGAC GGGCGTGGTT CCGCGCAGCC GCCGCGGACC GAGCCGGCGG CCGACGCATC GTTGCCACAC CGGGTCACCG ACTGGATCAT CGAGCACTGG GGCGCGAAGC CTGGGCGGCA CCCGTACCTG CCAGTGGGCC TGGTCGCCCT GGCGGCGGTG CTGGCCCTGG TCGCCTGGAT GCTCGGGCCG GCCCAGAACA GCCCGGCGAG CGTGGACGCG GCCCCGGTCA CGCCGACTCC GTCCGCCGCA CCCTCGCCTG CCGCCGTGCC GCCGCCGGCA CCCGCCTCGG TCGAGCCCGC CCCCGGATCC GGCGCGGTCG CCCCGCCATC GGGGCAGGTG ACCCGCGTGG CGCGGGCGGC GACCGCGGGG AGTTTCCCCA GCGGGGTGGC CGCGCACACG GTGGCGGAGG CGACGGCGTG GGCGCAGTTC CGCGGCCGGC CGGTGGACGT GGTGGTCACC TACACGGACC GGAACAGCTG GGACGCGATC GTGAACCCCT GGATCGGCCG CAGCGCGTCG ACGTTCTCAA ACTTCGCCGG CACCCTGGTC ATCAGCGTTC CCCTTTTTCC GGATGAGGGC CCGGAGCTGG GAAATCTCAC CGACTGTGCC GCTGGTGACT ACGACGCGAA ATGGCGCCAG TTCGGCCGGT GGCTGGTCAG CGAGGGCCGT GGGGACTCGT TCGTCCGCCT CGGCTGGGAG TTCAACGGCG ACTGGTTCGC CTGGCGGGCC TCAGCGAGCC CGACGTCCTA CGTGCAGTGC TTCCGCAACG CCTCGGCGTC GATCAAGGAG ACGAGCCCGA AGGTCCGCAT CGACTGGAAC ATCAACGCCC ACGGGCCGCG CAGCGCCTTC GCCGTCTACC CGGGCGACCA GTACGTCGAC GTCATCGGCA TCGACAGCTA CGACCAGTAC CCGCCGAGCC CGACCCTCAG CGCCTTCGAC GCCCAGTGCG ACGCCACCGA AGGCCTGTGC CAGGTGATCA GTTTCGCCCG CCGGCACAAC AAGCTGTTCT CGGTGCCCGA GTGGGGTGTG GTCAGCCAGC AGAACACCAA GGCCGGCGCC GTCGGCCAGG CGGGCGGGGA CAACCCGGTC TACATCGAGC GGATGTACAG CATCTTCGAG CGCAACGCGG ACATCCTCGC CTACGAGGCG TACTTCAGCG ACGACGTCCC GGGCAACGTC CACTCGTCCC TGCTCAGCCC CAACCGCCAC CCACGCTCGG CGGACACCTA CAAACGACTC TGGTAG
|
Protein sequence | MPPDEWHGHG LGRPDLTRES RADVPTGEWV VDDGPAGTRP ASDWSIGVYR RGDEVPGPGP ATGATAATGA TGATSATSAS PPAGAAAVPA PPAQPATGRA PGSPFDPEML SSRRRPARRR SPAAPIAPYP EGPAYPGSAT RGESGARGES GAEWPDDHES GQRGRGHSGS RRDRDSRWDV PGWDAPGSDR PGPGEGEQRQ RQPERHRSGP GRPAGRGAGE PTSGYGEPGH GHGRDPYRGA EGRPGAAGHW DEPDWAGEPS RPAPWTPPPF EAPSSSAPPA RVPPPAAPPF SAPSFSAPSS RVPPPPSESP SSTWPPGDLA TQRFQWRDWE GEPRPGGPRP TRPGGERADE TMILPPSRPE QGARERGDRD RQSRRSGRDQ RNHQPGRAPR DQPDPDRRGD QPGLDRRDRP GRDRRGDQPE RSRRGQEPAG RAQEHPERPR QRRSATDRAR SDRARSDRAR PDRARSERSR PGSAGGHVTG VLPIADPVRD GRGSAQPPRT EPAADASLPH RVTDWIIEHW GAKPGRHPYL PVGLVALAAV LALVAWMLGP AQNSPASVDA APVTPTPSAA PSPAAVPPPA PASVEPAPGS GAVAPPSGQV TRVARAATAG SFPSGVAAHT VAEATAWAQF RGRPVDVVVT YTDRNSWDAI VNPWIGRSAS TFSNFAGTLV ISVPLFPDEG PELGNLTDCA AGDYDAKWRQ FGRWLVSEGR GDSFVRLGWE FNGDWFAWRA SASPTSYVQC FRNASASIKE TSPKVRIDWN INAHGPRSAF AVYPGDQYVD VIGIDSYDQY PPSPTLSAFD AQCDATEGLC QVISFARRHN KLFSVPEWGV VSQQNTKAGA VGQAGGDNPV YIERMYSIFE RNADILAYEA YFSDDVPGNV HSSLLSPNRH PRSADTYKRL W
|
| |