Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0754 |
Symbol | |
ID | 5669170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 879921 |
End bp | 881171 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239681 |
Product | hypothetical protein |
Protein accession | YP_001505118 |
Protein GI | 158312610 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.200233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000521831 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGAGC CCGACCCCCT CTCCGCCCCC GCGCGCGACG TCGTGTTCAC CCTGTCCAGG GAGACGCTGC GGGACATGGC GATCCGCTCC TACATGCGGC CGCCGGACCG CGTCCTGCTC ACCCTGATGC AGTCGCCGCG GGTGCGACGG CTGCTCGTCG CCGAGCCGTT CCGCAGCAGG GTGACAGCGT TGGCGAAGGG CGACCCGGTG GTGGCCATTC CACCGACGAG CAGGCCGGAC CGCTACGTCG TGTCCGCGCG CCGCTGGCGT CGCGAGGACC CGGTCTCGCC CGCGATGCTG CGTCACACCT ACCGGCGCTA CGACCGGGTG CTGCGACGGG CCGCGGAGCA GGCCCGCTGC GAGCGGCCCG TGGTCGTCAC GACGTACCCG CTGCTGGCCG GGCTCGCCGA GCTGGAATGG GCCGACTCGG TCGTCTACTT CGCCCGTGAC GACTGGGCGA CCTACCCCCC GCTGGAACGC TGGCACCCGG CGTTCCAGGA GGCCTACGCC GAGATCCGCC GCCGGCGGCG GCCGGTGGTC GCGGTGTCGG AGTCGCTGCG CCGGCGGCTC GCTCCCACCG GAGGCTCACT GGTCGTCCAC AACGGCGTCG ATCCCGCCGA GTGGGAGCGG CTGCCGCCGG CACCCGCGCT GATCGCGAGC CTGCCCCGGC CGTGGTGCGT CTACGCGGGC ACCGTCGACG ACCGGCTCGA CGTCGACATG GTCGCCCGGC TGGCGACGGA GAGCACGGTG ATACTCGCCG GGCCCGTCAA GGACGAGCGA CACGCCGCCC CGCTGCGGGC GTTGCCCTCG GTCGTCCTGC CAGGCCATCT CCCCCGGCCG GCCGTCACCG GGCTGATCGC CGCGGCGGAC GTGTGCCTGC TACCGCACCG GGTGACCTCG CTGACGGAGG CGATGGATCC GATCAAGCTG TACGAGTACC TGGCGGCGGG GCGGCCCGTC CTGGCGAGCG ACCTGACACC GGTCCGGGGC ATGGGACCGC GGGTCAGGCT GCTGGCGCCG GGCGACGATC CGGTGGCCGC GTTCAGGGAG GTCCGCGGCT GGCCGGAGGT GACCGAAGCG GAACGCCACC GGTTCGTCGC AGCGAACAGC TGGTCAGCCC GTCATATGGA GCTGCTGGAC TTCGCTCTTG GCGGCGATCC GGAGCGGCAG CGGCGGCCCG CGGCCCGGCC GGCCCCGCCG ATCGAAGGCC ACGCTAGGGC GACGTCAGCG AGGGCCGGCG AGGCGACGTG A
|
Protein sequence | MPEPDPLSAP ARDVVFTLSR ETLRDMAIRS YMRPPDRVLL TLMQSPRVRR LLVAEPFRSR VTALAKGDPV VAIPPTSRPD RYVVSARRWR REDPVSPAML RHTYRRYDRV LRRAAEQARC ERPVVVTTYP LLAGLAELEW ADSVVYFARD DWATYPPLER WHPAFQEAYA EIRRRRRPVV AVSESLRRRL APTGGSLVVH NGVDPAEWER LPPAPALIAS LPRPWCVYAG TVDDRLDVDM VARLATESTV ILAGPVKDER HAAPLRALPS VVLPGHLPRP AVTGLIAAAD VCLLPHRVTS LTEAMDPIKL YEYLAAGRPV LASDLTPVRG MGPRVRLLAP GDDPVAAFRE VRGWPEVTEA ERHRFVAANS WSARHMELLD FALGGDPERQ RRPAARPAPP IEGHARATSA RAGEAT
|
| |