Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6995 |
Symbol | |
ID | 5675306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8521318 |
End bp | 8523081 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245841 |
Product | hypothetical protein |
Protein accession | YP_001511232 |
Protein GI | 158318724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.511073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.817264 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCGCG GCGGACAGTG GCGAATTCGA CAATGCCGTG CTGCGGCTGG ATCTCGCGGT GCGCAAGCGC GCCAGCCGGC TCGGCGAGGA GCCGGTTGTT CCCATGACGC CACCGGTCCG GGAACGGGTG AACAGAATGA TTCATGTTCT CACCGAGGTG TCGGGCCGCG ACACGTCGGC AGTGGGAGGC CGCGACGCCG TCGGCCCCGG GAACCGTCGG CGCCCCGGGT CGTCGGAGAC CCCGGCCCCG GCCGACACCG GCCACTGAGG GTGGCGGGCC GCTCGGGGTC GGTCCCACCG GCCGGCCGGA CCGCGGGTGC CGGGCCTGAA CCAGCGGCGC ACGAGCTCTG GGGGGCCGAA CTCGTCGTCC ACCTGTTTGC TCCGGTGGAT GGGCCCGAAG CGGCGGACGC GTACGGGTTT CTCCGGGACA TCTGGTCGGC GTGCCGCCGC ACTCTGAAGA TGGGATCGGC TCTCGACCGG TTGGGAGTGC CCGCCGACCT GCCGGGTTCG GTCGGCGACC TGCCTTCCCG AGGTCTGCTC GCCGCCCAGC TCAACCTGGG CGACGGGACG CACCAGGCGC TGTTCCGGCG GGACCATGAC CTGCTCTGTC TGTCCGTGAT GCTCGCCCCG GCGGTGGACG ACGGTGCCCT CGACTGGTCC GCGCCGGAGG ACCAGTGGAG CGGCGTGGCG GATGAGCTGC CCGCCTCGCT GGTCGGCGCC GTACGTCTTT ACCAGGGGCA TCTCGGCCGC TCGGAGGCGG GTCCGCGGGC GGTGGTGGCC GCGGCTCCCG AGCTCGCCCG CACCTGCGCC AACGCCTGGT CCGGGTGGCA CCAGGCGCCC GGATGGGAGG AACGCGGTGC CACCACCGGC CTGGGGTTCG CGGTGTGGGA GCCGGCCGCT CCGGCGGACG ACCAGATCGA GCGGAGGCTG CTTATCCTGG TTCCGGGCTC ACGGGACGGA GAACTGGTCG CCTGGACGTG GAACCGCCGG GACGCCGCCG CGCCCGCTTT CCAACGCTAT CTGGCCGGGG CGGCGAAGGT CCGCTACCAG TGGCGCGTTC GGGACGGCGG GGACGCGGTC CACGGCCTGC GCGACCGGCT CGACGAACGG GGAGCGCGGC TGCGGGAGTT CCTGCGCGAC CCGGTCCCGT TCCCGGACCG CGTCGCCGCC TGTGTGCACC AGCTCCACGC CGACCAGGTG GACCTGACGA CCGTCACCAC GAGGATGACG GAGATGCGGC GCAGCGTGGA GATCTCCGGG GTCAACATGG CGGCGATCCT GGCGGCCGAT GGGACCGTGC CTCGCGACGG CGACCCGTTC GCCACTGACC GGCGGCTGGT GGAGCACGTG GTCGGCCAGC TCGACGACGA CCTGGTCTTC CTCGGCGCGG CGGACACGCG CGTCGAGGCG GTCCTGGCCC TCGCGGCCCG GTCGACCCAG CACGTCCGGT CTGCCGAGCA CGTCCGGTCG GCACCGGACA TCCGGCTCGA TCCGGAGGAG ATGACCCAAC TGCGCGCGGA GCTCGCGGCG GCCTTCGGCG CCGGGGTACG GGCGTCTCAA CTGCTCGAGG AGATCGGCCT GCCCCGCGCG CGCCAGTTCG TCCAGGGCGG TGCGACTCCG CTGGAATGGT GGACCGAGAT GCTCCGGGAG CTCGGCAACG GCGCGGTCGA CCGGCCGTAC CGTAAAACGC TCGAGGCCGC GCTGCGGGAG TACGGCTACA ACGACGTTTT CGTCCGGCTG GCCCGTCGGC ACGGGCTGCG CTGA
|
Protein sequence | MVRGGQWRIR QCRAAAGSRG AQARQPARRG AGCSHDATGP GTGEQNDSCS HRGVGPRHVG SGRPRRRRPR EPSAPRVVGD PGPGRHRPLR VAGRSGSVPP AGRTAGAGPE PAAHELWGAE LVVHLFAPVD GPEAADAYGF LRDIWSACRR TLKMGSALDR LGVPADLPGS VGDLPSRGLL AAQLNLGDGT HQALFRRDHD LLCLSVMLAP AVDDGALDWS APEDQWSGVA DELPASLVGA VRLYQGHLGR SEAGPRAVVA AAPELARTCA NAWSGWHQAP GWEERGATTG LGFAVWEPAA PADDQIERRL LILVPGSRDG ELVAWTWNRR DAAAPAFQRY LAGAAKVRYQ WRVRDGGDAV HGLRDRLDER GARLREFLRD PVPFPDRVAA CVHQLHADQV DLTTVTTRMT EMRRSVEISG VNMAAILAAD GTVPRDGDPF ATDRRLVEHV VGQLDDDLVF LGAADTRVEA VLALAARSTQ HVRSAEHVRS APDIRLDPEE MTQLRAELAA AFGAGVRASQ LLEEIGLPRA RQFVQGGATP LEWWTEMLRE LGNGAVDRPY RKTLEAALRE YGYNDVFVRL ARRHGLR
|
| |