Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1708 |
Symbol | |
ID | 5670110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2039887 |
End bp | 2041662 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641240626 |
Product | hypothetical protein |
Protein accession | YP_001506052 |
Protein GI | 158313544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.864094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0229836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCG CCTCACGATC CACATCCCTG CCGGGGATTA CCCCGTTCGC GCTGGCCGGC CCCGTGCTGG CCGCCGGTGG CAAGGATCAG ACGGGCCTGC ACCTGCTCCT CGCCTTCGGC CTGTTGCTGC TGGTCGTGGC AGTCGTGGGC GCGATGCGAC GGGCCTGGCG CGGCCGTGCG CAACAGCAGG AGGAGAACCT GCCGGACCTG CCCGAGCCGC CGGAGCAGAC CGGGAACGTG CTGGCCGCCC CGCTGCGTGG GCGCTACCTC GGCACGGTGG ACGCCGGCCA CTGGCGGGAG TGGATCGCCG CGCGCGGCCT GGCCGGGCAC GACGGCGACT ACATCGCCGT CTACGAGCTT GGAGTGCGGG TCGACCGGGA CGGCGAGGCG TTCTGGATCC CCCGGGAGGC CGTGCGGGGC GCCCGCCTCG AGCGGGCGCA CGCCGGGAAG GTCGCCGCGC CGAGCCGGCT GATCGTGGTC GCGTGGTCGT TCGAGGGCCG GGAGCTGGAG GCCGGCTTCC GCGGCGAGGA CCGGGCCCGC CAGCCGAAGG TCGTCCGCTC CGTGCACGAC CTCATCGGGC CGGCGCCCGC CCAGCCGATG TCCGGCGACA TCACCTCGCC GCACGCCCTG CCCCGGCCGC GCAACCGGCT GCGGCCCCGC GTGCCGGCGC CGGCACGCCC GGCCGAGCCC GGCGCGCCGG CCGCCGCGGC CCCCGCCCGC GGGCAGCGTC ATGATCTCGC CGCGGTGGCC CCGGGCGGGC CCGCGACGAT GCCGATCCCG GTCAACGGCC GCCAGCCCCG GTCGGAGCGG GCCGGGTGGC GCCGCGGTGG CGCCGCGGCC AGTCAGGGCG CCGCCGCCGA GACCCACGCC GGCCAGCGCG GATACAGCGC GGACGCGCCC GGCCCGGTCG CCTCCGGCGG CTACGACACC GCTGCCCACG GGACGGGTGC CCTTAGGACG GGTGCCCAGG ACACGGGCGC CTACGACATC CGCGCCCACG TCACCGGCGC GCACAGCATC AGTACCGACA GTACCGGCGT CCACGGCACC GGCGCCCACG GCGCCAGTGC GTACGACACC GGTGCGTACG ACACCGGCTC CCACGGTGCT GGCGGGTACG GGACGGGCTC GCACCGGACG GGTGCGTACG ACACCGGCGG TTACCGGCCA GGTGCCCACG ACGTGGGCGC CCAGGGCTCC CAGGCCGTGA GCGGCGGGAC GGCCGGCTAC GACACCGCCG GCTACAGCAC AGCCGGCTAT GACACGGCCG GGTACGGGAC TGACAGCTAC GACCTGCGTC GCCAGGGCAC CGGCGGGCTG GACGCGCGCG GCCACGCCAC CGGCGGGTAC GGGCGCCCCG GCGCCCCCGG GCCGGCCGCT CCCGACCAGG GGGGGCTCGA TTCGGCCGCG TACCACCTGG GTGCCGGCGA CACCGGCGTC CACGGCTCGG GTGCCTACGG CTCGGGTGCC TACGGCTCGG GTGCCTACGG CTCCGGTGCC AACGACACGG GTGCCCACGA CTCGCGTGGA TACGGCCAGG GCGCATACGG CCAGGGCCGG CGCGACCCGG GCGGACGTGA CCAGGGTGGA TACGACCGGG AGGCGTACGC CTCCCGCGGC CGGGCCGGCA CGCCGCCGGC CGTCGGCGCG CCGGGTGACC AAGGGAACTA CTGGCGGACC GGCGCCGACC CGGCGGAGCG GCCGCGAGAC CAGGGGACCG ACGCGTTCAC CGCACCGCCC GGTGAGGCCT CGTACCGACG GGAGGAGTAC CCGTGA
|
Protein sequence | MTSASRSTSL PGITPFALAG PVLAAGGKDQ TGLHLLLAFG LLLLVVAVVG AMRRAWRGRA QQQEENLPDL PEPPEQTGNV LAAPLRGRYL GTVDAGHWRE WIAARGLAGH DGDYIAVYEL GVRVDRDGEA FWIPREAVRG ARLERAHAGK VAAPSRLIVV AWSFEGRELE AGFRGEDRAR QPKVVRSVHD LIGPAPAQPM SGDITSPHAL PRPRNRLRPR VPAPARPAEP GAPAAAAPAR GQRHDLAAVA PGGPATMPIP VNGRQPRSER AGWRRGGAAA SQGAAAETHA GQRGYSADAP GPVASGGYDT AAHGTGALRT GAQDTGAYDI RAHVTGAHSI STDSTGVHGT GAHGASAYDT GAYDTGSHGA GGYGTGSHRT GAYDTGGYRP GAHDVGAQGS QAVSGGTAGY DTAGYSTAGY DTAGYGTDSY DLRRQGTGGL DARGHATGGY GRPGAPGPAA PDQGGLDSAA YHLGAGDTGV HGSGAYGSGA YGSGAYGSGA NDTGAHDSRG YGQGAYGQGR RDPGGRDQGG YDREAYASRG RAGTPPAVGA PGDQGNYWRT GADPAERPRD QGTDAFTAPP GEASYRREEY P
|
| |