Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6615 |
Symbol | |
ID | 5674930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8049714 |
End bp | 8051399 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641245466 |
Product | plasmid pRiA4b ORF-3 family protein |
Protein accession | YP_001510858 |
Protein GI | 158318350 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCCG TGAGCCGCGG TCGCAACGAC AGGAAGAAGC CCTCTTCCAA GGGCCCGGCG CCAGCCAGAC GCGCGTCGCG GCCCGCGCCG TCCTCCGTGC CGGAGATCCG GAACCAGTCT CCGCGGCCTC ATGACGACCG CGAGGCGGAG CTGCGCGCGT TCACCACCGA CATGCTCAGC ATGGGCGGGG TACTCCGCGA GAGCGACGAC CCGCTGATGG CGGAGGTGGT CGGTGCGACG TTCGCCGTGC TCGATGACCT CACCGATGGC TCGGTCGGGG AGGCATTCCT TGAGGAGATC GTGCCGCGGC TGGAGACCGC GGCGACCGGG GACGCTCTTG CGGTGCTGCT GGCGATAGGT GCCGTCGTGC CAGGGCCGGT CAGTGCCACG GCGGATGCCG CCGCCGGCCG GCTGACCGCG GCCGGGGTCT CCCTGCCGCA GTGGGCGGAC GAGCTTACGG TGCCGCCGCT GGCCGGCGAC TTCCAGCGCC GGTGCGACGA CGAAGGTATC GCCGTCTTCC TGAGCTGCAC GGTCGAGCGG GCTGGTCGGC GCCATGCGTT CCTGGTGTGC GTCGACCCAT TGGCCTGCGG TGAGGCCGGA GACCTCCTCG TCCTCCCGGC CGAGGATCTT CCGAGGGTTC TGGAGGGGGT CACCGCGGAG GCTCGCAAGA ACAAGGTCAG GCTCCGAACC GAGACGCTAG ACGCCGAGGA GTTCCGCTGG CAGGTCGAGA ACGCCCTGAA TGCGCGGGCG GTGCACGACG ACGAGCTCTC GTTCGACGGG GACGACGGGG AAGCTGCCTC CTTTCTCCGG TTCGGCGACG AGGACGAGAT CCCCTTCGGG GAGGAGGACG GCGAGGGCGG GCCGCCGTAC GAGGCCGTGG CGCTGGTGCT GCGGTCGCGC CTCGCGACGC TACCGACCCC GCGGAGGCCA CCCGCGCCGC ACGCGGATGA CGACGGGGAC GAGGGAATCG ACGCGCTCGG CGGTATGGCC GACCTCGTAA AGATGCTGAA CGAGCGGGGG GCGAACCCCG CCGAGCTCGC CCTGGCGAAC CTGACCGGAT GGCGCGGCCT GCCGGCGACG GGCCGTCCGC CGACGCCACC GCTCCCAAAG AAGCCGAGGC GGGGCAAGGG GCAGCAGGCC CCGGTCTACC AGGTCAAGGT CGGGCTGCGC GGCACGAAAC CACCGATCTG GCGACGGCTG GAGGTGCCAG CCGACACCAA CCTGGCACGC CTGCACACGA TCATCCAACT GGCGTTCGAC TGGGAAGACA GCCATCTGCA CGTCTTCGAG ACGCCTTACG GCAGATTTGG CACTCCGGAC GTGGATCTTG GCCACCGCGA CGAGAAGTCG GTGTCGCTGG AGCAGGTGCT TCCCGACGTC AAAGCAAAGA TTAGCTATAC CTACGATTTC GGCGACTCCT GGGAACACGA GATCGCTCTG GAGAAGATCC TCGAACGCAG CCCATCTGTC CGGTATCCGC GCTGCACCGG CGGGCGTCGC GCGGCCCCGC CGGAGGACTG CGGCGGTATC TGGGGCTACG AGGCGCTCCT GCAGATCCTG GACGATCCCA GTCATCCCGA GCACCACGAG CGGCTGGAAT GGCTGGGCCT CGACGACCCC GCCGACCTCG ACCCCACCGA GTTCTACGGA GCCGGAGTGA CGGCCGCACT GTCCCGGCTC CGCTGA
|
Protein sequence | MSPVSRGRND RKKPSSKGPA PARRASRPAP SSVPEIRNQS PRPHDDREAE LRAFTTDMLS MGGVLRESDD PLMAEVVGAT FAVLDDLTDG SVGEAFLEEI VPRLETAATG DALAVLLAIG AVVPGPVSAT ADAAAGRLTA AGVSLPQWAD ELTVPPLAGD FQRRCDDEGI AVFLSCTVER AGRRHAFLVC VDPLACGEAG DLLVLPAEDL PRVLEGVTAE ARKNKVRLRT ETLDAEEFRW QVENALNARA VHDDELSFDG DDGEAASFLR FGDEDEIPFG EEDGEGGPPY EAVALVLRSR LATLPTPRRP PAPHADDDGD EGIDALGGMA DLVKMLNERG ANPAELALAN LTGWRGLPAT GRPPTPPLPK KPRRGKGQQA PVYQVKVGLR GTKPPIWRRL EVPADTNLAR LHTIIQLAFD WEDSHLHVFE TPYGRFGTPD VDLGHRDEKS VSLEQVLPDV KAKISYTYDF GDSWEHEIAL EKILERSPSV RYPRCTGGRR AAPPEDCGGI WGYEALLQIL DDPSHPEHHE RLEWLGLDDP ADLDPTEFYG AGVTAALSRL R
|
| |