Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4899 |
Symbol | |
ID | 5673239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5882087 |
End bp | 5883853 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243754 |
Product | hypothetical protein |
Protein accession | YP_001509170 |
Protein GI | 158316662 |
COG category | [S] Function unknown |
COG ID | [COG5305] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0731796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0477636 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACG TCGCGGTGCG GCCGGCGGAA CACACCCCCG AGCCGGTCGA CAGGGTCGGA GCTCGACCCG ACCGCGAGCA CCGGATCTTC CGGGGCGCGC TGGTGCTGAT CATCGCTGCG GCGGTGGTGG TGCGCTTCAT CGCACGCCAG CCGCTGTGGC TCGACGAGGC ACAGAGCGTG GCGATCGCCC GCCTGCCGCT GTCGGGTGCG GCGCCCACCA TGTGGGACGG CCTCCTCCAG GACGGCTCGC CGCCCCTCTA CTACATCCTG CTGCACCTGT GGATCAGCGT CTTCGGCGAC GGCACCGCGT CGGTGCGCGG GATGTCCGCC GTCATCAACC TGGGCTCCGC GGTGCCGGTG TTCTATCTCG GACGCCGGCT CGTCGGTGAC CGCGGCGCGA AGGTCGCGGT GGTGCTGTAC CTCACCTCGC CGTTCGCGCT GTACTTCGGC ACCGAGACCC GGATGTACAG CCTGATCGTG CTGCTCACGG CGCTCGGCGG GCTGGCCCTG GAGCGGGTGC TGCGGGTGCC GTCGGTGAAG AACGTGGTGC TGCTGGCCGT GGCGTCGGGA TGCCTGGCCC TGACGCACTA CTGGTGCCTC TACCTGCTGA TGACCGTGGG TGCGTGGCTG GTCGGGCTCG TCTTCGTCCG GCCGCGGGTG GCGGCGGCGC GCGCCGCCCG CCGCGGTCGC GCCGGGCGTT CCGGGGCGCA CTCGCGCGGC GGGCGCCGTC CGGGCGGGCC GGCGCCGACG CCGGCTGGCG CATCCGAGCA GCCCACGGCC GCGCCCGAGC TGCTCGCCGA CGCCGCCGGC GGCGCACGCC GGGCCGTGCC CACCTGGCAC CCCCGCGGCC CGCTCGCCGG GATCATCGGG ATCATCGCCG GCGGCCTGGT GTTCGCTCCG TGGCTGCCGA ACTTCCGCTC CCAGCTCGCC CACACCGGGA CGCCCTGGGG CGAGCCGGCG AGCTTCGCCG CCGTCAGCCA CGCGTACGGG CAGTGGGCCG GTGGGCCGAC CACACTCGGC CGGCTGTTGC TGTTCCTGAT CACCGGCCTG GTCGCCGCCG GGATCGCCGG CCGTCCACTG GGCGGGCGGT TCGTCCTGCT CGACCTGAAG GGCCTCGAGC CGGGCCGCAC GCTGTTCTTC CTGGCCACCG GGACGCTGAT CGTCGCCGTG GCGGCCGGCA AGCTGGTGGG CAACGCGTGG GCCGACCGGT ACACCGCGAC GGCGTTCGTC CCGTTCCTGC TCGTGGTGGG ACTGGGCGCG ACAATGCTGG CCGACCGCCG GGTGTTCCAC GGCGTGGTCG CGGTCGCGGC GCTCGTCGGT GTCATCGCCG GGACGAGCGA CGTGCGGCGG GAACGGTCCC AGGCCGGCGA GGCCGCGGCC GTCCTCACCC GGCTGTCCAG GCCGGGCGAC ATCCTGCTGG TGTGCCCGGA CCAGCTGGGG CCGGGCCTCG CGCGCACCGT GCCGTCCTGG CTCAAGGTGT ACGTGGTGCC GACCTACGCC GCCCCGGACC GGGTCGACTG GGTCGACTAC GAGGAGCGCA ACGAGTCCGC CAACGGCGTG GCGATCGCCC AGCGGGCGAT CGCGGAGGCC GGGACGAACA CCGTGTTCAT GGCCGGCTCC GGCGCCTACC GGACCTACGA GGTGCTGTGC ACCCAGGTCC GGGCGACGCT GCAGACGCAG CGGCCGGTGG CCGACGAGGT GATGGAGCAG GGTCTGCCGG CACGGGTCTA CGAGAACTAC GCCCTGCTGC GGTTCCGGGC GTCGTGA
|
Protein sequence | MTDVAVRPAE HTPEPVDRVG ARPDREHRIF RGALVLIIAA AVVVRFIARQ PLWLDEAQSV AIARLPLSGA APTMWDGLLQ DGSPPLYYIL LHLWISVFGD GTASVRGMSA VINLGSAVPV FYLGRRLVGD RGAKVAVVLY LTSPFALYFG TETRMYSLIV LLTALGGLAL ERVLRVPSVK NVVLLAVASG CLALTHYWCL YLLMTVGAWL VGLVFVRPRV AAARAARRGR AGRSGAHSRG GRRPGGPAPT PAGASEQPTA APELLADAAG GARRAVPTWH PRGPLAGIIG IIAGGLVFAP WLPNFRSQLA HTGTPWGEPA SFAAVSHAYG QWAGGPTTLG RLLLFLITGL VAAGIAGRPL GGRFVLLDLK GLEPGRTLFF LATGTLIVAV AAGKLVGNAW ADRYTATAFV PFLLVVGLGA TMLADRRVFH GVVAVAALVG VIAGTSDVRR ERSQAGEAAA VLTRLSRPGD ILLVCPDQLG PGLARTVPSW LKVYVVPTYA APDRVDWVDY EERNESANGV AIAQRAIAEA GTNTVFMAGS GAYRTYEVLC TQVRATLQTQ RPVADEVMEQ GLPARVYENY ALLRFRAS
|
| |