Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5092 |
Symbol | |
ID | 5673427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6094379 |
End bp | 6095338 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243943 |
Product | PLP-binding domain-containing protein |
Protein accession | YP_001509357 |
Protein GI | 158316849 |
COG category | [R] General function prediction only |
COG ID | [COG0325] Predicted enzyme with a TIM-barrel fold |
TIGRFAM ID | [TIGR00044] pyridoxal phosphate enzyme, YggS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.340872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00296969 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGGCC TCGAGCGCGA GGTGGACCCG CGCCGGCTGG CCGAGTTGCG CGAGCGGCTG ACAGTCGTCC GTGACCGGAT CACCGTGGCG GCCCGGGACG CCGGCCGCGA CCCGGCGGAG CTGACCCTGA TCGCCGTCAG CAAGACCCGA CCCGCCGAGG ACGTCCTCGC GCTGGTCGCG CTGGGTGTCC GCCACTTCGC CGAGAACCGC GAGCAGGAGG CCGGGCCGAA GACCGAGGCC GTCCGGCTGG CGCTGGCCGC CGCCGCGGCG GGCGGCTCGG ATGAGCACAC GATGTGGTCC GATGATGACC GTTTCCGGGA ATCGAGTCCG TTGGCCGACA CGTCGGGGGC ATGGCGTCCA TCACGTGTGG ATCAGGGTCC GACCCTCGGG CCGGCCGCCG GGCCCGATGT GCCGGTCTGG CATTTCGTCG GCCAGCTCCA GCGCAACAAG GCCCGGGCCG TGGCGGGCTG GGTGCACTGC GTGCAGTCGG TGGACCGAGC GCGACTGGTC GAAGCCCTTT CCCGGGCAGC AACGGCACGT GGTCGTCAGA TTTCAGTGTG CATACAGGTC AATCTGGACA TTGCGGAGCC AGTCGGTGCA CCGGGTGCGC CCGCGGCGCG AAATGCCCTG TCTACCAGGG AAAACGAGCC CACCGCCGGG GCTGCGGACG GCGCTCGCGG CGGGGCCCTG CCCGCGGACG TACCGGCGCT CGCCGAGCTC GTCGCGGGCT CCGAAGGGCT TGTTCTGACG GGTGTGATGG CTGTCGCGCC ACGCGGAGCA CCCGCCAGGC CGGCCTTCGC TCGACTCCGT GAGGTAGCCG ACCGACTTCT TTCCGAACAT CCCGCGGCGA CGCTAGTCAG TGCAGGGATG TCCGGCGATC TCGAGGACGC GGTCGCGGAG GGCGCGACAC ACCTTCGGAT CGGCACCGCT TTGTTCGGTG AACGGCACGG TGTCCCTTAG
|
Protein sequence | MSGLEREVDP RRLAELRERL TVVRDRITVA ARDAGRDPAE LTLIAVSKTR PAEDVLALVA LGVRHFAENR EQEAGPKTEA VRLALAAAAA GGSDEHTMWS DDDRFRESSP LADTSGAWRP SRVDQGPTLG PAAGPDVPVW HFVGQLQRNK ARAVAGWVHC VQSVDRARLV EALSRAATAR GRQISVCIQV NLDIAEPVGA PGAPAARNAL STRENEPTAG AADGARGGAL PADVPALAEL VAGSEGLVLT GVMAVAPRGA PARPAFARLR EVADRLLSEH PAATLVSAGM SGDLEDAVAE GATHLRIGTA LFGERHGVP
|
| |