Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5242 |
Symbol | |
ID | 5673576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6303469 |
End bp | 6304755 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244096 |
Product | phosphoesterase PA-phosphatase related |
Protein accession | YP_001509506 |
Protein GI | 158316998 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGTT GGGCGCGCGC CCGCCGTGGA GTCGTGTGTA CGGCCGTCGT CGGTGTCCTC GCCGCGGCGT TACCGGTCGC GGCCGGGCTC CCGGCCGCGC GGGCGGCCGG TGAGAACACA TCCACCAATG TCGTCATCAT CTGGGACCGC AATGCGCAGA CCGCGATCTG GGACGTCGCC GGCCAGCAGC CGCAGGTCCA GGCGCGCAGC TTCGCGATGG TGCACGGGGC CGTTTACGAC GCGGTGAACG CCATCGCCGG GCGGCCTTAC CAGCCTTATC TGCTCGCCCC GCGGGCCAGC GGGCGCGAGT CGACGGACGC CGCCGTCGCG ACCGCCGCCT TCCAGGTACT CAGCTCCCTG TTCCCGGCCC AGCGGCCGCG GCTGCAGACG CAGTACGACG AGTGGATGGC GAACCTTCCC GACGACGCGG CGAAGCGGAG CGGGACCGCC GTGGGTGGCC AGACCGCCGC AGCGATGATC AGTGCTCGGC AGAACGACGG GGCCTTCGGT AATCCGACCT GGCCGGTGGG CACCCAGCCC GGCCAGTGGC GGCCGACTCC GCCGACCTTC GCCTCCGACA CGGCCTGGGT GGCGAACCTC AGGCCGTTCC TGATCCCGAG CGCGTCGATG TTCCGCTCGG CCGGGCCGCC GGCGCTGACC TCCGAGCGCT ACGCCCGGGA CCTCAACGAG GTCAAAACGA TCGGCGCCGT CAACAGCACG ACCAGGACGC TCGACCAGAC CCAGGCGGCG ATCTGGTGGC ACGACCGGCA CCTGGGTGAA TGGGAGATCA AGCGCCAGCT CGCCACGGGC CGCCGTCTGA GCACCCTGCA GACGGCCCGC ATGTTCGCGA TGGTCGACCT CACCGAGGCC GACGCGACGA CCGCCTGCTT CAACGAGAAG GCGGCCTGGA CGTCCTGGCG GCCAGTCACC GCGATCCAGC TGGCCGACAC CGACGGCAAC CCGGCAACCA CCGCCGACCC GACCTGGGCA CCGCTGCTCG TCACCCCGCC ACACCCCGAC TTCACGTCCG GGCACACCTG CTTCACGACG GCGAGCATGT CGACGCTGGC GTTCTTCTTC GGCCGGGACG ACATCCCGTT CAGTGCGTAC AGTGCCGATT CGGGTACCAC ACGCTATTTC CGTGGTTTCT CCCATGCCAT CGCCGAGGTG ATCGAGGCTC GCGTCTGGGG TGGCATCCAC ACTCGGTCGG CCGACACCGA GGGCGCGAAG ATCGGCGCCA AGGTGACCGC CTACGCGACC AGGAACTATT TCCGCCCGCG GCGTTGA
|
Protein sequence | MARWARARRG VVCTAVVGVL AAALPVAAGL PAARAAGENT STNVVIIWDR NAQTAIWDVA GQQPQVQARS FAMVHGAVYD AVNAIAGRPY QPYLLAPRAS GRESTDAAVA TAAFQVLSSL FPAQRPRLQT QYDEWMANLP DDAAKRSGTA VGGQTAAAMI SARQNDGAFG NPTWPVGTQP GQWRPTPPTF ASDTAWVANL RPFLIPSASM FRSAGPPALT SERYARDLNE VKTIGAVNST TRTLDQTQAA IWWHDRHLGE WEIKRQLATG RRLSTLQTAR MFAMVDLTEA DATTACFNEK AAWTSWRPVT AIQLADTDGN PATTADPTWA PLLVTPPHPD FTSGHTCFTT ASMSTLAFFF GRDDIPFSAY SADSGTTRYF RGFSHAIAEV IEARVWGGIH TRSADTEGAK IGAKVTAYAT RNYFRPRR
|
| |