Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7005 |
Symbol | |
ID | 5675316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8540033 |
End bp | 8541691 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641245851 |
Product | hypothetical protein |
Protein accession | YP_001511242 |
Protein GI | 158318734 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0904061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.234272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCACG CGCTTCACCA CGCCCTTGAG CCGAGTGCCG AATACAATGT CGACCTGACG GTGACTGCCG GGCGCTGGCC CGACGGGCTG GCCGGGAACA TCTTCGTCAT CGGGCCGTCG CAGCCGACCG CCGTCGACTT CATGTTCGCC GGGCCGGGCC TGCTCACCCA TGTGGACCTC GAGACCAGGC ACTGGCGGAC GAAGCGGGTC GTCACGCCCG ACCTGGCTCT TCTCGGCGGC CTCAGCCGGG CGTTGCCACC GGCCGAGCTG GCGGGACTGA CCGTCGGCGG ACGGCCGTCC CTGACGAACG TGTCGCCGCA CTTCTTCGGC GACCGGCTAC TGCTCACCGG GCTGAGCCAG CGGCCGGTCG AGTTCGACCC GGCGACTTTG GAGTTCAAGA CGTTTCTCGG CGGGGTCAGC GAATACCCCG AGGTCGTCGC GCATCCCCTG TTCCCCGGTG TGCGGACGGC GGCGCACCCG GTCGAGGACC TCGACGACGG TTGCATGTGG TGGTGCAACA CGAACCTCCG TCCCCGGGGT TCGTCCACCT CCGACATGGA GGGACCCATG TGGGTGGTGC GCTGGGACGG GCACGGCGAC GTCGAGACGT GGCACGTACC CGGCGCGCAC CTGACCCAGG GCGTGCACGA GATGACAGTG ACCCAGGACT ACGTGATCTT CACGGAGATC GGGTTCCAGC CCGAGCCCGG CACCGTCGCC GGACGCGGCC GCACCAAACC GCATCTGCCC TTCACCGACA TCTATCTGGT GGGCAAGCGC GACCTCACCG TCGCCCGGCG GGGCCGCAGC GTGCCGGTGG CCCACGCCCG CGTCCCACGC GAGTCGTTCC ACCACTTCGC CGACTACCGC CAGGACGGCG ACGACGTCAC GATGTACCTC GCTCATTCGA ACGGCTGGGA TCTCAACTAC GTGCTCACCG ATGCCGACAG CGTCTGGGGA ACCGGCTCCG GCCTCGCCAA GGGCCTGCAC GGCTTCGTCT CCGCCCCGGT GGACGCTTCA CCGGTCGGCC GCTACGTGAT CGACGGCCGA ACCGGTGAGG TCAGGGACAG CCATGTCTTC CTCGACCCGG AACGCCACTG GGCCACGCTT CTCTATGGCC GTGACATGCG GCGGCCGGGG CTCGAGCGCG GCCGCTACCT GTGGCAGTCC TACTGGGGAT GCGACACCGA GATGCTGGCG ACCCGGATCG TCGAGATGTA CCGGGATCAC CCCTATCGCG TTGTCCCGGT GGACCAGTTG CCGTCACGGG AGATCCCGTC GTCCCTGGTG TGCATTGACC TGGAGACGAT GACCGAACAG TCGGCCTGGT CATTCCCCGC CGGCACCACC AGCGAGTCAC CCGTGTTCGT TCCCGACCCC GCGGGCGGCC CCGGCTGGGC GGTGATTTTT GTCCACTACT CCGACCGGAC CGAACTTCAG GTGTTCGACG CCCTGGCTCT CGGCGCGGGG CCCGTCGCCG TCGCCACCGC CGAGGGCCTC AAACTGTCCG TCCAGTTCCA CTCCGCTTAT CTGCCCAGCA TCCGTCTACG TGACACGGGC TACGAACGTT CGTTCGCGGC CGACCTCGGC GACGGCTGGC GGGATTTCTC GCCGTCCGCC CGCGGTGTGA TCAGTAAGGT GCTCGAGCGG TACGGCTGA
|
Protein sequence | MSHALHHALE PSAEYNVDLT VTAGRWPDGL AGNIFVIGPS QPTAVDFMFA GPGLLTHVDL ETRHWRTKRV VTPDLALLGG LSRALPPAEL AGLTVGGRPS LTNVSPHFFG DRLLLTGLSQ RPVEFDPATL EFKTFLGGVS EYPEVVAHPL FPGVRTAAHP VEDLDDGCMW WCNTNLRPRG SSTSDMEGPM WVVRWDGHGD VETWHVPGAH LTQGVHEMTV TQDYVIFTEI GFQPEPGTVA GRGRTKPHLP FTDIYLVGKR DLTVARRGRS VPVAHARVPR ESFHHFADYR QDGDDVTMYL AHSNGWDLNY VLTDADSVWG TGSGLAKGLH GFVSAPVDAS PVGRYVIDGR TGEVRDSHVF LDPERHWATL LYGRDMRRPG LERGRYLWQS YWGCDTEMLA TRIVEMYRDH PYRVVPVDQL PSREIPSSLV CIDLETMTEQ SAWSFPAGTT SESPVFVPDP AGGPGWAVIF VHYSDRTELQ VFDALALGAG PVAVATAEGL KLSVQFHSAY LPSIRLRDTG YERSFAADLG DGWRDFSPSA RGVISKVLER YG
|
| |