Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1603 |
Symbol | |
ID | 5670006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1918421 |
End bp | 1919995 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240522 |
Product | hypothetical protein |
Protein accession | YP_001505948 |
Protein GI | 158313440 |
COG category | [R] General function prediction only |
COG ID | [COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain |
TIGRFAM ID | [TIGR02946] acyltransferase, WS/DGAT/MGAT |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGC TCGACGCGAT CACGAACATG GGTCACACAC CGGCGCACGT CCGCCCGCTG ACCCCGGGCG ACCGCGCCTA TCTCGCCTTC GTCCGGCGCA ATCCCGGTGA GCACCAGGAC ATCGGCGCGC TGCTGCACTT CGACGGCCCC CCGCTGGACC TGCCGGCCCT GCGGGCCCAC GTGGCCGAGC GGCTGCGCGA CCCGCGGGCC CGGATGCTGA CCGACCGTCT CGACACAGTG CGGGTCTGGT CCCCCGAGCG GGGCGCGTCG GCCGAGGAGA CCCGCTGGGT CTCCGACCCC GACATGAACC TCGACGACCA TGTCGTCGCC TTCGACCTGC CCCCCGCAGA CGGGGGCGCC GCGGCGGGGC AGTCCGCCGA CGCGTCCCAC GACGCCCGGC TGCGGGCCGC CGTCGACGCG ATCGTCGCCC GGCCGATCGA CCTCACCCGG CCGCCGTGGA TGCTCTACCT CCTGCGTGAC CCGACCCCGG GGGCGACTGG TACCGCGCTG GTCTACCGCT CCAGCCACGT CCAGCAGGAC GGCTTCGCGC TCTACCGGGT GATGTACCTG CTGTTCGGTG AGAGCGACGA GGTCGATCTC GGGCTGGCGC CGACGATCCG CCGCCCCCGC CCGGCCGACT ACGCCCGGTT CGTCGGCCGC GGGATCTCCT GCCTGCTGCC GACCCGGCGC CTCGAGTCCT GGGGCGGCCC ACCGAGCGGC CCGGCGAGAC TTACCTGGGT GACCACGGAG CTCGCCACGC TGCGCGCGGT GGCCCGCCAG CACGGCGTCA CCGTGAACGA CGTCTACCTG GCGGCGCTCG CCGGGGCGCT GCGCGCCTGG TCGCTGCCGG AGTGGGAGCG CAGCGGCCGT CAGCTGCACG CGCTGATGCC CGTCAGCATC CGCTCCGCGG CCGAACAGGA CGTCCTGTCG AACCACAGCA CCGGGGCGCG CGTCCCGCTG TTCTGCGGTG AGCCCGACCC GGCCCGGCGG GTGGCCATGA TCGCGGCGGA GACCCGCCGG ATGAAGCAGG GCGGGCTGGG CCTGGTGGAG CGCCAGCACT TCCCGCTCAT GGCGGCGAAG GCCTCCCAAC GCATGCTCGC GAACGTCGGC AGCTATCCGG CCCAGATCAA CAAGATGGCG CTGGTGGCGA CCAACGCCCG GTCGATCCGC GGCCCGCTCT CCATCGCCGG CCGCCGGATG ACCGGGCTGA TCGGCATGGG CCCGCTGCTC GTCGGACGTC AGCACCTGGC CGTGGCGATG TTCGGCGTGG ACGACCGGGT CGGGGTCACG TTCGTGGCCA GCGAGAGCGT CCCGGACCAC GCCCGGCTTG CCGACCTGTG GCTGGCCGAG CTGGCCGCGC TGGGCCGGTC CGACTCGCCC GTCGGGGTGA GCGTGCCGAC CCAGCGCCTG TCCTCGGCAT CGGCCGCGGT GACGGCCGTC ATGGCCGGCG CGGGGCCCGG CGCTGTCTCG GGTGCCGTCT CAGGTGCGGT GACCGGTGCC GTCCGGACCG AGGTCGGCGC CCTCATCCGC CCCTGGCGCC GCCGCCCCAC CACCCCGGCA GCCCCCGCCA TGTAA
|
Protein sequence | MGLLDAITNM GHTPAHVRPL TPGDRAYLAF VRRNPGEHQD IGALLHFDGP PLDLPALRAH VAERLRDPRA RMLTDRLDTV RVWSPERGAS AEETRWVSDP DMNLDDHVVA FDLPPADGGA AAGQSADASH DARLRAAVDA IVARPIDLTR PPWMLYLLRD PTPGATGTAL VYRSSHVQQD GFALYRVMYL LFGESDEVDL GLAPTIRRPR PADYARFVGR GISCLLPTRR LESWGGPPSG PARLTWVTTE LATLRAVARQ HGVTVNDVYL AALAGALRAW SLPEWERSGR QLHALMPVSI RSAAEQDVLS NHSTGARVPL FCGEPDPARR VAMIAAETRR MKQGGLGLVE RQHFPLMAAK ASQRMLANVG SYPAQINKMA LVATNARSIR GPLSIAGRRM TGLIGMGPLL VGRQHLAVAM FGVDDRVGVT FVASESVPDH ARLADLWLAE LAALGRSDSP VGVSVPTQRL SSASAAVTAV MAGAGPGAVS GAVSGAVTGA VRTEVGALIR PWRRRPTTPA APAM
|
| |