Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1747 |
Symbol | |
ID | 5670149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2096065 |
End bp | 2097396 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240668 |
Product | hypothetical protein |
Protein accession | YP_001506091 |
Protein GI | 158313583 |
COG category | [S] Function unknown |
COG ID | [COG4198] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.506846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000552489 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCTCA TGGTTGCCCC TGACGCCGCC GTGCCCGTTC CAGCCGGGCT CGTCCTCGCG CCGTTCCGCG CGGCGCGGTT CACCGCCGCC GGCCCGGATC TCGCCGCGCT GACCTCCCCG CCCTACGACG TCATCGACGA GGACGGCCGC ACCGCCCTCG AGGCGGCCGC GGAGCACAAC GTCGTCCGGC TGATCCTGCC GCGGGACCTC TCCCCCGCAG AGTCCCCCGG CGAGCCGGAC AGCCGGTACG ACCGGGCGGC GCGCACCCTG CGGGAATGGC TGGACGCCGG GATCCTCGCC CGGGACGAGG CCCCCGCGCT CTACGTCTAC GAGCAGGAGC AGGACGGCCA CCTCCAGCGC GGACTCGTCG GCGCCCTCGC CCTGGCCGAC CCGGACGCGG GCATCGTCCT GCCCCACGAG AACACCATGG CCGGCCCGGT CTCCGACCGG CTCGCGCTGA CTCGGGCGAC CGCCGCGAAC CTGGAACCGA TCTTCCTGCT CTACGACGGC GGCGGCCCGG CCAGCCAGGT CGTCGCCGCC GCGGTGACCA CCCCGCCGAT CGTGGACGCC CACACCGACG ACGGGGTGAC CCACCGGATC TGGGCGATCG ACGACCCCGC CCAGCTGGAG ACCGTCGCGG CGGATCTCCT CCCCCGCCGC GCGGTCATCG CCGACGGCCA CCACCGGTAC GCCACATACC GCCATTACCA GGCGGAACGG CACGCCGCGG GCGACGGTGC CGGCGCCTGG GACTTCGGGC TCACCTTCCT GGTGGACGCG ACTGCCAACG GCCCGCAGGT GCACGCGATC CACCGCGCCG TGCTGGGACT CACCCTCGCC GACGCCGTAG CGCGGGCCGA GGGCGCCTTC ACCGTCCGCC GGCTGACCGA GGCCGCCGGC GCCGGCGCAC CGGTCGACCC CGCGGCGCTG CTCGACGAGC TTGCCAAGGC GGGCCACGAC GGCCACGCCT TCGTGATCAG CGACGACTCG GACGCCTATC TGCTGACGGC GCCCGCCGCC GATCTCCTCG CTCGCGCGCT GCCCGCCGAC CGGTCGGCGG CGTTCCGCGG CCTCGACGTG ACCGTCGCGC ATCTGGCGCT GATCACGAAC GTGTGGGGTC TGGAGGACAA GGTCGGCGTC GTCGACTACT ACCACGACGC CCCCGCCGCG CTCGCCGCCG CACGCGCCAC CGGCGGTGTC GCGCTGCTAC TCAACCCGAC CCCGGTCGCC GACGTGACCG CCGTCGCCGG GGCCGCGGAG CGGATGCCCC GCAAGTCGAC CCTGTTCACC CCGAAGCCAC GCACCGGCCT GCTGATACGC CCGTTGGACT GA
|
Protein sequence | MRLMVAPDAA VPVPAGLVLA PFRAARFTAA GPDLAALTSP PYDVIDEDGR TALEAAAEHN VVRLILPRDL SPAESPGEPD SRYDRAARTL REWLDAGILA RDEAPALYVY EQEQDGHLQR GLVGALALAD PDAGIVLPHE NTMAGPVSDR LALTRATAAN LEPIFLLYDG GGPASQVVAA AVTTPPIVDA HTDDGVTHRI WAIDDPAQLE TVAADLLPRR AVIADGHHRY ATYRHYQAER HAAGDGAGAW DFGLTFLVDA TANGPQVHAI HRAVLGLTLA DAVARAEGAF TVRRLTEAAG AGAPVDPAAL LDELAKAGHD GHAFVISDDS DAYLLTAPAA DLLARALPAD RSAAFRGLDV TVAHLALITN VWGLEDKVGV VDYYHDAPAA LAAARATGGV ALLLNPTPVA DVTAVAGAAE RMPRKSTLFT PKPRTGLLIR PLD
|
| |