Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4063 |
Symbol | |
ID | 5672421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4843622 |
End bp | 4845349 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242939 |
Product | hypothetical protein |
Protein accession | YP_001508356 |
Protein GI | 158315848 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.824829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGTGC AGAAGGACGC CTTTCCAGGG CAGGACACCG AGGTCCGCCC CGCGCCGTCG GAAATCCCGG GCCCTGCGCC CGTCCGGGAG CAGGCGGCCC GGGAAATCCG TCTCGACAGC CGCGAGACCG TGCAGACTCC CACCGCCGTG GTCTTTCTCA CCGAGGACCG CGCGTACAAG CTGCGCCGGG CGGTCAACCA CGGTTTCGTG GACTACCGCT CCCGCCGGGC CCGGCTGATC GCCTGCGAGG ACGAGGTACG GCTCAACCGG CGCCTCGCCC CGGACGTGTA CCTCGGCGTG GCCGACATCC GGGACGAGAC CGGGGCACTG CGCGACCACA TGGTCGTCAT GCGACGGCTG CCGGCCGACC GCCGGCTCTC CGCGCTGATG ACAGCCGACG TCTCCGGCGA GCTGCGTGAG CTCGCCCAGC GGATCGCGGC GTTCCACGAA GGGTGCGAGA CCACGCCCGA GATCACCCGC ACCGGTGGTC TGTGCGCGTT GGAGGCACTC TGGCTGGAGG CGATGGACGG CCTCGCGCCG TTCCGCGGCC GGATCCTCGA CGCCGCCACC GTCGACGAGA TCGGCCGGCT CGCGCTGCGC TACCTGACCG GCCGGGGCCC ACTACTCGCG GAGCGCCAGG CCGCCGGCCG GATCCGCGAC GGCCACGGCG ACCTGCTCGC CGACGACATC TACTGCCTGA ACGACGGCCC GCGGGTCCTC AACTGCGTCA ACGTCGACCC CGCGCTGCGG GCCGGTGACG TCCTCGGCGA CGCAGCCTCC CTCGCGATGG ACCTCGAACG GCTCGGCAAC GCCACCGCGG CCCGGACGTT CCTCGACGCC TACCGTGAGT TCTCGGGCGA GACCCATCCA ACGTCGCTGG AGGACCTCTA CATCGCCTAC CGGGCGGTCG TCCGCGCCAA GACCGCCTGC GTCCGCGACC ACCAGGGTGA CCCGGCCGCC GCCGACGAAG CCCGCCGGCT CACCGACCTC GCGCTACGCC ACCTACGGCG CGGCCGTCCC CGGCTCATCC TGGTCGGCGG CCTGCCCGGC ACCGGTAAGT CGACACTGGC CAGCCATCTC GTCTCCGGCG AGGATGACTG GGTGCTGCTG AGCTCGGCCG CCGTCCGCGG CGAGCCCGTC GGAGCGGGCG CGACCGCCCC CGAGTCCGCC TCCACTTCCG CCTCCGATTC GGCCGGGACG GAGCCCGCGG CGGGGTGCTA CGGCGCGGAC GCGACCGAGC ACAGCTACGT GGAGGTGCTC ACGCGGGCCC GCCACGCGCT CGAAAGAGGG AGGAGCGTCG TGATCGATGC CTCCTGGTCC TCGAGGCGGA TGCGCGCACG GGCCGCCGAG CTGGCGGCGG AGTGCGACGC CGACCTGATG CAGCTGCGGT GCGTGGTCCC GCCCCGGGTC GCGGTCGCCC GCATAGCCGA CCGCGCGACC GTCCCCATCG CACTCGGCTC CACCACGGAC CGGTCCGGTC CGGACCACTC GACCGCGACC CGTTCCGCTC CGGCGGGCGT CGTTCCGATC GGTGCCCTTC CGATCGGCGC CGTCGCGGAT GGTGCCACCA CCGGGCACCG CGCCGACCTG GTCAACGCCC TGACTCCCAA CGAGTGGATC TACCTGGACG TGGCCACCCG CACCGATCCG TGGCCCGACG CACGCGACAT CGACACCTCC GCCCCAGCCG AACACGCAGT CACCGCCGCG TACCTTCTGA TCAACTAG
|
Protein sequence | MFVQKDAFPG QDTEVRPAPS EIPGPAPVRE QAAREIRLDS RETVQTPTAV VFLTEDRAYK LRRAVNHGFV DYRSRRARLI ACEDEVRLNR RLAPDVYLGV ADIRDETGAL RDHMVVMRRL PADRRLSALM TADVSGELRE LAQRIAAFHE GCETTPEITR TGGLCALEAL WLEAMDGLAP FRGRILDAAT VDEIGRLALR YLTGRGPLLA ERQAAGRIRD GHGDLLADDI YCLNDGPRVL NCVNVDPALR AGDVLGDAAS LAMDLERLGN ATAARTFLDA YREFSGETHP TSLEDLYIAY RAVVRAKTAC VRDHQGDPAA ADEARRLTDL ALRHLRRGRP RLILVGGLPG TGKSTLASHL VSGEDDWVLL SSAAVRGEPV GAGATAPESA STSASDSAGT EPAAGCYGAD ATEHSYVEVL TRARHALERG RSVVIDASWS SRRMRARAAE LAAECDADLM QLRCVVPPRV AVARIADRAT VPIALGSTTD RSGPDHSTAT RSAPAGVVPI GALPIGAVAD GATTGHRADL VNALTPNEWI YLDVATRTDP WPDARDIDTS APAEHAVTAA YLLIN
|
| |