Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0940 |
Symbol | |
ID | 5669354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1100652 |
End bp | 1102010 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239867 |
Product | hypothetical protein |
Protein accession | YP_001505302 |
Protein GI | 158312794 |
COG category | [S] Function unknown |
COG ID | [COG5282] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03624] putative hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.685586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG GCGGCTTCCC CTTCGGCTTC GGCCCCACTC CGGGTGGCGA CCCGGAGCGC CCCGCTGGCG GGGCCCCGTT CTTCGCCGAG CTCGAACGGC TGCTCTCCTG GCAGGGCGGT CCGGTCAACT GGGAGCTCGC CCGCCAGGTG GCGGTGCGGA CCCTGGGCGG CGACGACCGC GCGGTGCGGG CCGCCGAGAC CGGTGAGGTC GAGAAGGCGC TGCGCATCGC CGACGTGTGG CTCGACCCGA TGACCGCGCT GCCGGCCGGC GCGACCACCG CCGCGGCGTG GTCCCGCGAG CAGTGGATCG AGGCCACGCT GCCCGTCTGG CGGACGCTGT GCGACCCGGT GGCGGGCAAG GTCGTCGAGG CCATGCGCAC CGGGATCTCG TCCGGCCTGA GCCAGCTCGG CGGCGGCGAC CTGCCGCCCG AGCTCGCGGG CGCGCTGCCG CCCGGCGTCG ACCTCGGCCG GCTGATGGGC GCCGGCGGGC CGGTCGTGCA GATGATGAAC CAGGTCGGCG GCATGCTCTT CGGCGCGCAG GTCGGCCAGG CCATCGGCAG CCTGGCGGCC GAGGTGGTCA GCTCGACCGA GGTCGGGCTG CCGCTGGGCC CCGCGGGCAC GGCCGCCCTG CTGCCGGCGA ACGTGGCCGC CTTCGGGCAG GGTCTGGGCG TCGAGGACGA AGAGGTCCGT ATCTACCTGG CGCTGCGCGA GGCGGCGTCG AACCGGCTGT ACGCGCACGT CCCCTGGCTG CGGGCGCACG TGCTGGGCGC GGTCGAGGAG TACGCGCGTG GCATCGCCGT CGACCCGGAG GCGGTCGGCC GCGTGATGCG GATGATCGAC CCCACCGCGC TGATGAACCC CGAGCGGCTC ACCGAGGCGC TCGGGGAAGA TGTCTTCGCC GACGCGGACA CCCCCGAGCA GAAGGCGGCA CTGGCCCGCC TGGAGCTGAT CCTGGCGCTG ATCGAGGGCT GGGTGGACCA CGTCGCGGAC ACCGCCGCGT CCGAGCACCT GCCGTCCGCG GCGAAGCTGC GCGAGATGGT CCGTCGGCGC CGCGCCGAGG GCGGCCCGGG AGAGCAGATC TTCTCGACCC TCGTCGGCCT GTCGCTGCGC CCGCGCCGGC TGCGCGAGGC CGCCGCGCTG TGGGAGGCGT TGCGCGAGGC CCGCGGGCAC GACGGGCGGG ACGCGGTATG GGCGCATCCG GACCTGTTGC CCGGCGGGGA GGACCTCGCG GACCCGTCCG CGTTCGTCTC CGGCGCCGGA GCGGGCTCCG ACGTCGACAT CATGGCGGAA ATCGAGAAAC TCGACGACAC GGCGCCCGGA GAAGACACTC CGCCGGCCTC GGGTGACTCG CCTTCCTGA
|
Protein sequence | MSGGGFPFGF GPTPGGDPER PAGGAPFFAE LERLLSWQGG PVNWELARQV AVRTLGGDDR AVRAAETGEV EKALRIADVW LDPMTALPAG ATTAAAWSRE QWIEATLPVW RTLCDPVAGK VVEAMRTGIS SGLSQLGGGD LPPELAGALP PGVDLGRLMG AGGPVVQMMN QVGGMLFGAQ VGQAIGSLAA EVVSSTEVGL PLGPAGTAAL LPANVAAFGQ GLGVEDEEVR IYLALREAAS NRLYAHVPWL RAHVLGAVEE YARGIAVDPE AVGRVMRMID PTALMNPERL TEALGEDVFA DADTPEQKAA LARLELILAL IEGWVDHVAD TAASEHLPSA AKLREMVRRR RAEGGPGEQI FSTLVGLSLR PRRLREAAAL WEALREARGH DGRDAVWAHP DLLPGGEDLA DPSAFVSGAG AGSDVDIMAE IEKLDDTAPG EDTPPASGDS PS
|
| |