Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3520 |
Symbol | |
ID | 5671890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4181376 |
End bp | 4182716 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242407 |
Product | hypothetical protein |
Protein accession | YP_001507827 |
Protein GI | 158315319 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.80961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACACTG GACCGGGCCC TGAACCGCGA CCCGCTCTTG ACCAGCTCAT CGCGCACATC GCCGCACTCA AACAGCGGCG AAACGCGAGC TGGTCGGCCT TGGAACAGCA GACCGGTATC ACCAGCCAGG CCCTCTCCGC CGCAGCCCAG GGTCACCAGC CCGGCGGCCG GGAGTGGCGA ATCCCCTCCG ACTCCGTGAT CATCGCCCTG GACCGCGGCC TTCGCGCCGA TGGGAGCCTG TTCGAGCGCT GGGTACAGGT CAAACGCGAA GACGAGGAAA TCCGCCTGGG ACGCATCGCG GCGAAAGCGT TTCCCGCAAT GGATGGACTA CCCCCACCGC CAGGGCAGAC AGTGGACAGG GCAGGGGAGG CGAGAACGAC GGACAGAAGA CTTTTCGGTG TCGGAGGCCT CAGCGTGGCG GCGGTGCTGG CATTCGCTGG CGACGTGGAC GACCAGCTCC AGTCACACCG ACCCAAGATC ACGGACGTGG CGGACGCGGA GACCGCGCTC GCCAATCTCG AACGAGACCG CGACGCCGCG GACCCGGCGG ATCTGTTCCC GCCCGCCTAC GAAGCCTGGA CGGCGGTTGA GGGGATCCTG CCGAGGCGAG TCCATCCCGC GTACGTCCCG AAATTGACCC TGCTGGCAGG GACTCTCGCG GCCGGCCTGT CGACGGTCGC CTCGTTCGCC GGGCACGAGC GGTTCGGCCG GGTCTTCGCC GGCATCGCCG AAGTACACGC CAACGCCGCC GGAGAACCAG CACTTCGGGC CCGCGTCGCC GGAATCCAGT CATGGCAGGC GCTCGACGCG GGTCTCGCGC TCGACGCCGC CGACATCGCC GCGCGGGGAC GCCAGCACGC GGATCCGGCA GACCGGGCTC GCCTCGCCGC CTACGAGGCG GAGGCCGCCG CCGCAGGCCT GTACAGCCGC GCCGACGAGG CAGTCGCGGC GATGCGAACC AGCATGCGCG CCGCCGCTAC CGGCCGGCCC GCGATCGCCT GGGGCGACGC CAACGAACAG CTCTTCACCG CGCTCACCGC GGCACAGACA CCCGGCCGCG CCACAGTGGC GATCACCCTC GGCACCAGAG CCGCCGAATC GTTCGACCGT CCGTGCCAGG GCATGGCTCT CGCTCATCTG GCTGTCGCCA CTGGGTACGT CCGTAAGGAT CGTCCGGCCC CGGACGTGGC CGCCGCCTCA GCCGTCGCCG CACTCGACAT CGTCGAACAC GCCCCGAACG CAGAGGTCCA TGACCGTGCT CGCCGGCTCG CCACCGAGCT GTCCGCGTGG AGCTCCGAGT CACTGGTGCA GGAACTCGAC CAGCGGACCG CCGCCCTCTG A
|
Protein sequence | MHTGPGPEPR PALDQLIAHI AALKQRRNAS WSALEQQTGI TSQALSAAAQ GHQPGGREWR IPSDSVIIAL DRGLRADGSL FERWVQVKRE DEEIRLGRIA AKAFPAMDGL PPPPGQTVDR AGEARTTDRR LFGVGGLSVA AVLAFAGDVD DQLQSHRPKI TDVADAETAL ANLERDRDAA DPADLFPPAY EAWTAVEGIL PRRVHPAYVP KLTLLAGTLA AGLSTVASFA GHERFGRVFA GIAEVHANAA GEPALRARVA GIQSWQALDA GLALDAADIA ARGRQHADPA DRARLAAYEA EAAAAGLYSR ADEAVAAMRT SMRAAATGRP AIAWGDANEQ LFTALTAAQT PGRATVAITL GTRAAESFDR PCQGMALAHL AVATGYVRKD RPAPDVAAAS AVAALDIVEH APNAEVHDRA RRLATELSAW SSESLVQELD QRTAAL
|
| |