Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0053 |
Symbol | |
ID | 5668479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 66165 |
End bp | 67283 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641238982 |
Product | hypothetical protein |
Protein accession | YP_001504427 |
Protein GI | 158311919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCCG TAGCCCTGGC TTTGGCCTGC TTCTTCCCCT TGATCGGCCG TCCCCGCCTG GCCTGTCAGC CTCTGCCCGA TCGAGTTGCG GAGATCGCCG AGATCGCCCA AGCGGCGGCC CGGGACGGTG CGGACGGACT GGCCGAAGGA GCACATGCCC TGAACAAGGC GGCGCTGCTC GCCAGCGACT GCGGTCTGGC GCCCCTCGCC CGTGACCTGT GCTGGCAGCA CATCAACATC TACTGCGCTG TCCCGCGCCC GCTCACCGTC CACGAGGCCC GATACATGCT CGAACCGGCC CTCAACCTCG CCCGGCTCCA GATCCGCGCC AGCGACGGGG AACAGGCGCT CGGGCTACTC ACCGCGATGT TCCAGGCCGT CTCGTCGAAC ACCGACCTGG TCGTCGACGG CCGGGTCCTT CCGCTGACTG ACCTCATCGG CACCCGCGAC GAGCGCCACA AGCTGCGCGA ATGGGTGTGG CTGCACCTCG TCGGGGACGG CGTCCGTGCC CTGGCACTCG CCGGCCGCTG GGACGATGCG GTCATCCACG CAGACACGTA CCGAGGGATC GGACTGCATC TCCTGGAAGG CCGCCAGGCG AAGATCCTCG CTCACTGCCT GACCGGGACG TCAGCCGAGG CCCGCGCGGC CCTGGCGGAG AGCACGCCGA TGTACCCGTG GGAGCTCCAG GTCGCCTCGT GCCTGGAGGT GATGTGCACC GAGGACACAT CCACAGCACA CGGTGTCACC ACCATGATCG GGCAGTTCCT GGGACAACGA CCGATGCCCG GCTACGCGGT CTTCCGTGCC CACCTCGGCA TGACCGTAGC CGCTCTCGCC GCCACCACCG ACCCAGACGC CGCCACCCGC GTTCTCACCC AGACAGTCGA GGAAGTGATC GAAGCCGAGG ACGGGTACGC GGCACGGGAC GTTCTCCGGC TTCGCCCCAC ACAAGCGGTC GACCTGCCAG CCAGGCACGA AAAGGCGCTC GCCGACCTAC TCAACGCCTC CGGCCTACGA GCAGAAACAC CGCCGGAACC GGTCCTGGAG TCTGTTCTCG GCTCCGCCCG GACCGCCGAA GCCGCGATCG TCGCGGCGAC ACACCCCCAG CGACGATGA
|
Protein sequence | MNPVALALAC FFPLIGRPRL ACQPLPDRVA EIAEIAQAAA RDGADGLAEG AHALNKAALL ASDCGLAPLA RDLCWQHINI YCAVPRPLTV HEARYMLEPA LNLARLQIRA SDGEQALGLL TAMFQAVSSN TDLVVDGRVL PLTDLIGTRD ERHKLREWVW LHLVGDGVRA LALAGRWDDA VIHADTYRGI GLHLLEGRQA KILAHCLTGT SAEARAALAE STPMYPWELQ VASCLEVMCT EDTSTAHGVT TMIGQFLGQR PMPGYAVFRA HLGMTVAALA ATTDPDAATR VLTQTVEEVI EAEDGYAARD VLRLRPTQAV DLPARHEKAL ADLLNASGLR AETPPEPVLE SVLGSARTAE AAIVAATHPQ RR
|
| |