Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2516 |
Symbol | |
ID | 5670912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2995552 |
End bp | 2996532 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241433 |
Product | hypothetical protein |
Protein accession | YP_001506854 |
Protein GI | 158314346 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0235] Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTTT CCACCCATGA CGCGCCGGCA ACGGCAACCC CGTCGGTAAC GGCAACGCCG CCGGCGCCGG ACGTGGCCCC CGGGGCGTCC GACGGCGAGC GGAAGAGCCC GTGGGCCGGG CTGGTCGCGC CGGCCGAGGG CGGGGAGTGG CCTCGCCCGG TGCCGGTCAG GACGGTCAAG GAGGAGCGGC TGCACCGCAA GCGGAAACTC GCCGCCGCCT ACCGGCTCTT CGCCAAGCTC GGCATCGCCG AGGGACTGGC CGGCCACATC AGCGCCCGCG ACCCCGAGCT GACCGACCAT TTCTGGGTCA ACCGGCTCGG CCTCGACTTC GGCAGGATCA AGGTCTCCAA CCTGCTGCTG GTCGACGACA AAGGGGAGAT CGTCGAGGGT AAGCCGCCGC TGAACAGGGC GGCGTTCACC ATCCATTCGC AAATCCACGC CGCCCGGCCG GACGTTGTCG GTGCCGCGCA CACCCATGCC CTGTACGGAC GGGCACTCGC CGCGATCGGC GAGCCGCTGC ATCCGATCTC CCAGGACTCC CTCGCGTTCT ACCAGGACCA CGTGATCTTC GACGAGTACA ACGGGGTCGT GCTGGACGAG GAGGAGGGCC GGAAGATCGC CGCCGCGCTC GGCCCGCACA AGTTGGCGAT CCTGCGCAAC CACGGCCTGC TCACCGTCGG CACCAGCGTC GAGGCGGCGG CGTACTGGTA CATCGCCGCG GGCGGGCGGC GAGGACGCAG CTCGTCGCGG CGGCGGCCGG GACGCTGCGC CTCCTGGACC ACGAGATCGC CAGCGCCACC GCCAGTCAGG CGCGAGGTGA CGAGGGCGCC CGCTGGTCCT TCGAGGCTCT CTACGAGATC ATCGTCGAGG AACAGCCCGA CCTGCTCGAC TAGCCCGACC AGCCCGACCT CGACTAGCTC GCTGGAAGGG TGCCGCCCCG GCCCGGTCGG GGGATCAGCG GGGGCGACGC GTCGACGGTG A
|
Protein sequence | MTLSTHDAPA TATPSVTATP PAPDVAPGAS DGERKSPWAG LVAPAEGGEW PRPVPVRTVK EERLHRKRKL AAAYRLFAKL GIAEGLAGHI SARDPELTDH FWVNRLGLDF GRIKVSNLLL VDDKGEIVEG KPPLNRAAFT IHSQIHAARP DVVGAAHTHA LYGRALAAIG EPLHPISQDS LAFYQDHVIF DEYNGVVLDE EEGRKIAAAL GPHKLAILRN HGLLTVGTSV EAAAYWYIAA GGRRGRSSSR RRPGRCASWT TRSPAPPPVR REVTRAPAGP SRLSTRSSSR NSPTCSTSPT SPTSTSSLEG CRPGPVGGSA GATRRR
|
| |