Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0611 |
Symbol | |
ID | 5669028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 709944 |
End bp | 711356 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239538 |
Product | hypothetical protein |
Protein accession | YP_001504976 |
Protein GI | 158312468 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGGTA CTGGGGGGAA GACCGCGGCG CGCAGAGCGA CCCGCGTGTC GGCGGTCGTG GGTTCGGCGG TCATGGGCAT CGCGCTGGCG GTGCCGTTCG CGGTCGCCGT CGGACCGGCG GCGTCGGCGG CGTCGGCGGC GGCACCGGTG GTGTCCGCGG GCGGGGACGC GGCCAGGGGG GCCGGATCGA CCGTGCCGAC GGTGCTGTCC ACCGCGACGC TGCCCGACAT CCCGTTGGCC GACTTCTCCA ACGGGCTGAT CCGGGGATCG GTCGACACCG ACCGCGGGGT CGACCTCGGC GGGATCGGCA GTGACCTGTT CCCGGCCGGC CGCCCGAACG AGTTCTGGAC GATCACCGAC CGGGGCCCGA ACGGGCAGAT CAAGATCGAC GGTAAGAACC GGCGGACCTT CCCGGTGCCC GGCTTCGACC CGGCCATCGT GCGGGTCCGG GCCGACGGTT CGACGATCAA GGTGCTGGAC GCGCTGCCGA TCACCACCGC GCACGGCAAG CCGGTCACCG GGCTGTCGAA CATCAACGGC TTCGACGAGA CCCCTTACAC CTGGGACGCC CAGACCCCGC TCCCGTTCGA CCCGAACGGT CTGGACACCG AGGGGCTGAT CCGCACCCGG TCCGGCGAGT TCTGGCTGGT GGACGAGTAC AGCCCGTCGC TGCTGCGGGT GAGTGCGCGC GGCCAGGTCC TGGCCCGTTA CATCCCCGCC GGCGTCAATC TGACCGGGGC TGACTACCCG GTGGTCGCCT CGCTGCCGGG CGTCCTCGGC GGGCGCAAGA TCAACCGTGG CTTCGAGGGC ATCGCGCTGG CTCCGGACGG GCGCACGCTG TACCTCGCAG TGCAGAGCCC GCTGCAGCTG CCCGACGCCG GCACCGGCAA CGCCTCGCGC AACGTGCGGA TCTTCCGGTT CGACACCGCC AGCAGCAAGG TGACCGGCGA GTACGTCTAC CGCTTCGAGG ACGTCGCCAC CTTCGACCCG GACGCCGACG GTGACCCGTC CGAGATGAAG ATCTCCTCGC TCGCCGCCGT GGACGGTAAC ACCCTGCTGG TCAACGAGCG GACCGACGCC GTCTCCCGGC TCTACAGCGT CGACCTGCGC CAGGCGACGA ACATCCTCGG CAGCCGGTGG GACCACACGG CGACCGCTCC GTCCCTCGAG TCGCTGGCCG ACCCGGCGAA GGCTGGCGTC ACCGTGTTGC CGAAGCGTCT CGCTGTCGAT CTCGAAGGCG TTGCGGGCAT GCCGGACAAA ATTGAGGGCA TCGCGATCGT TGATCGACGG ACGATCGCCG TCGCGAACGA CAACGACTTC GGGCTCGGAT CGTTCAACGA GGCCGGCCAG CTCGTGGATT CGGGTGTCGA GAGCAAGATT CTCCAGCTGC GGCTGAACCG ACCGCTCGGC TGA
|
Protein sequence | MSGTGGKTAA RRATRVSAVV GSAVMGIALA VPFAVAVGPA ASAASAAAPV VSAGGDAARG AGSTVPTVLS TATLPDIPLA DFSNGLIRGS VDTDRGVDLG GIGSDLFPAG RPNEFWTITD RGPNGQIKID GKNRRTFPVP GFDPAIVRVR ADGSTIKVLD ALPITTAHGK PVTGLSNING FDETPYTWDA QTPLPFDPNG LDTEGLIRTR SGEFWLVDEY SPSLLRVSAR GQVLARYIPA GVNLTGADYP VVASLPGVLG GRKINRGFEG IALAPDGRTL YLAVQSPLQL PDAGTGNASR NVRIFRFDTA SSKVTGEYVY RFEDVATFDP DADGDPSEMK ISSLAAVDGN TLLVNERTDA VSRLYSVDLR QATNILGSRW DHTATAPSLE SLADPAKAGV TVLPKRLAVD LEGVAGMPDK IEGIAIVDRR TIAVANDNDF GLGSFNEAGQ LVDSGVESKI LQLRLNRPLG
|
| |