Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0699 |
Symbol | |
ID | 5669116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 818127 |
End bp | 819197 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239627 |
Product | hypothetical protein |
Protein accession | YP_001505064 |
Protein GI | 158312556 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.72272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCGGTC TCGATCTTTC TCGGCGGTTT TACGAGCAGG CGGTTCGGCC GCTGGTCCAG GGCGTGCCTC ACGCTGCGGC GTTGCTCGGT GAGGGTTCCG AGGTACTCGG CTTCGACGAC GAGGTCTCCG CGGACCACGA CTTCGGTCCC CGAGTGCAGC TGTTCGTCCT GCCGGATCTC GACACCACGC CGATCGACAT CGCTCTGGAG CGGCTGCCGG CCTGGTTCGA GGGTTTCCCG GTGGTCTATC CCGACAGCGA CCGACACGAC GGCCGGCCGC ACCACCAGGT GGAGGTGACC ACCGCGCGGG CGTTCGTCGT CGACCGGCTC GGCGCCGACC CCGCCGACGG AATGGAGCTG GTGGACTGGC TGCTGGCCCC GACCCAGATC CTGGCCAGCC TCACGGGCGG CGTGGTCCTG CACGACCCGC TCGGTCTGCT TACCGCGCGG CGTCGGGCGC TGGCCTGGTA CCCCGACGAC ATCTGGCGCT ATGTCCTGGC AGCCGGCTGG CTGCGGATCA GCCAGGAAGA AGCATTCGTC GGTCGCGCCG GTGCCCGGGG CGATGACCTC GGCTCGCGAA TAATCGCTGC TCGGATTGCC CGCGACCTCG TCCGGATCGG ATTCCTCGTC GAACGTCGCT GGGCCCCGTA CAGCAAATGG CTGATGACCG CGTTCGCGCG GTCGACCCTC GCGGACCAGG TCGGTCGGCA CCTACGCCAC GCGCTCGGCG CGACGCGGTG GCAGGAACGC GAAGCGGCGC TGTGCGCCGC GGCCAGTGAT CTCGCCGCTG CCACGAACCG ACTTGGTCTG GCTGAGCCCG TCGACCCCGC GCCGCGCCGC TTTCACACCC GGGACATCCA CGTACTCGGC GCGCAGCGCC TGACCCGTGC GTTGACCGAC GCGATCCGCG ACCCGCGGCT ACGAGCCCTC CTCGCCCGAC TAGGGAACCG ACCCGACGGC CCGCTCGGCC AGCTCCCCGG CGCCATCGAC CAGGCAGTCG ACAGCGTCGA GATCCTCACC CGACCGAGCC GCCGCCGCGA CTACGCACCC GTCCTCGGCC TGCGGGCCTG A
|
Protein sequence | MFGLDLSRRF YEQAVRPLVQ GVPHAAALLG EGSEVLGFDD EVSADHDFGP RVQLFVLPDL DTTPIDIALE RLPAWFEGFP VVYPDSDRHD GRPHHQVEVT TARAFVVDRL GADPADGMEL VDWLLAPTQI LASLTGGVVL HDPLGLLTAR RRALAWYPDD IWRYVLAAGW LRISQEEAFV GRAGARGDDL GSRIIAARIA RDLVRIGFLV ERRWAPYSKW LMTAFARSTL ADQVGRHLRH ALGATRWQER EAALCAAASD LAAATNRLGL AEPVDPAPRR FHTRDIHVLG AQRLTRALTD AIRDPRLRAL LARLGNRPDG PLGQLPGAID QAVDSVEILT RPSRRRDYAP VLGLRA
|
| |