Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1056 |
Symbol | |
ID | 5669470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1239228 |
End bp | 1240496 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239985 |
Product | hypothetical protein |
Protein accession | YP_001505418 |
Protein GI | 158312910 |
COG category | [S] Function unknown |
COG ID | [COG2899] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.969674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGGTT CCGTACGGAC CGCGGAGCTC GTCCGTAAAC GCCTGTCGGA GCACTGTGAG CCGGTCCGGG AGGTGTTCCA AGGAGTGCAC GCACTTCGAA TTTTCGGCTG GTCCCTCGCC ATCACGGTGA TCGGGGTCGC GGGGGCTGGT GTGATCGGCG GCGGGGACGC CGCGGCGATC GTCGCGATCC TCGCCGTCCT CGAGATCAGC CTCTCGTTCG ACAACGCGGT CATCAACGCG ACGATCCTGC GCCGGATGAG CGAGTTCTGG CAGCGCATCT TCCTCAGCGT CGGCGTCATC ATCGCCGTGT TCGGGATGCG GCTGCTGTTC CCGATCGTGA TCGTGGCGCT GACGGCGCAT CTCAGCCCGG TGGACGTCTT CGACCTGGCG CTGAACCACG AGGAGGAGTA CGGGGCCCGG CTGCACGACG CGCACCCCTC GATCGCCGCC TTCGGCGGGA TCTTCCTTTT TATGATCTTC CTTGACTTCA TGTTCGACCC GGAGCGCGAG ATCCAGTGGA TCAAGCGCAT CGAGGAGCCG TTCCGCCGGG CCGGCCAGCT CGATGTCGTG TCCGTCGTGC TGGGGCTCGT CGCGTTGCTG GTCGTGGGGG AGGCCTTCTC CGGCGACCAC ACCCAGCAGG TGCTGACGGC GGGGGTCGCC GGTCTCGCCA CCTATCTGGG TGTCCGCGGG CTGGGCGAGT TCTTCGAGGC CCGCGGAATC GGCGCGGACG ACGACGAGGA CGACGAGGAC GAGAAGGCCG GGGCCAACGG CTCGGCACCC GGTCGCACCA CGGGAACCTC GGACGTGGTC CTTGCGACCG GCCGGGCCGC CTTCTTCCTG TTCCTCTACC TCGAGGTGAT CGACGCGTCG TTCTCGTTCG ACGGCGTCGT CGGGGCGTTC GCCATCTCGC AGAACATCTT CATCATCGCG GCCGGCCTGG GTATCGGCGC CATGTACATC AGGTCGACCA CGGTGTACCT CGTCCGGCGC GGGACACTGG GCGAGTACAT CTACCTGGAA CACGGAGCGC ACTACGCGAT CGGCGCGCTC GCCGTCATCC TGGCGGTCTC GATCGAGACC GAGGTGCACG AGATCGTCAC CGGGCTGATC GGTGTGGCGT TCATCGGGCT GGCCCTGCTG TCCTCGATCC GCCACCGCTC GAAGGAGCGG CAGGGGAACC TCGACGGCGG GGACGCCGCG GCCGCGGGTG ACCAGCCCGG GGACGCCGGC GATCCCGAGG ACGCGCCGGT CGTCGGCACC CGGAGCTGA
|
Protein sequence | MVGSVRTAEL VRKRLSEHCE PVREVFQGVH ALRIFGWSLA ITVIGVAGAG VIGGGDAAAI VAILAVLEIS LSFDNAVINA TILRRMSEFW QRIFLSVGVI IAVFGMRLLF PIVIVALTAH LSPVDVFDLA LNHEEEYGAR LHDAHPSIAA FGGIFLFMIF LDFMFDPERE IQWIKRIEEP FRRAGQLDVV SVVLGLVALL VVGEAFSGDH TQQVLTAGVA GLATYLGVRG LGEFFEARGI GADDDEDDED EKAGANGSAP GRTTGTSDVV LATGRAAFFL FLYLEVIDAS FSFDGVVGAF AISQNIFIIA AGLGIGAMYI RSTTVYLVRR GTLGEYIYLE HGAHYAIGAL AVILAVSIET EVHEIVTGLI GVAFIGLALL SSIRHRSKER QGNLDGGDAA AAGDQPGDAG DPEDAPVVGT RS
|
| |