Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4171 |
Symbol | |
ID | 5672526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4958615 |
End bp | 4959571 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243044 |
Product | 5'-3' exonuclease |
Protein accession | YP_001508461 |
Protein GI | 158315953 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000586174 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.269504 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCGGC TGCTGCTCGC GGACACCCCG TCGTTGTATT TCCGGGCTTT TCACGGGGTG CCCCGGTCGG TGCGGGCCCC CGACGGGATG GCGGTGAACG CGGTGCGGGG TCTGCTGGAC GTGCTGGCCC GGCAGATCGT CGAGGTGCGG CCGCGGCGGT TGGTGTGCTG TTTCGACGCC GACTGGCGGC CGGCGTTCCG GGTGGCGCTG ATCGGCTCGT ACAAGGCGCA TCGGGTCGCC GGCCCCACCC CCGGCCCCGC GGCCGGGGCT GGTGGGGTGG AGGAGGAGGT GCCGGACGAG TTGGAGGCGC AGCTACCGGT GATCGACGCG GTGCTCGACG CGTTCGGGAT CGCCCGGGTG GAGGCGGCTG GCTTCGAGGC CGACGACGTG ATCGGCACGC TGGCGACCCG GCACAGCGGC CGGCCCGGTG GCGGGCCGGT GGACATCCTG ACCGGGGACC GGGATCTGTT CCAGCTGGTG TGTGACGACA CCGGCGTGGT GGTGCGGTAC GCGGTGGAAC GGTTCGCGGT GGTCGACGAG GCGTCGATCA GCGCCCGCTA CGGGATCCCT GGCCGGGCCT ATGCCGATTT CGCGGTGCTG CGCGGTGATC CCAGCGACGG TCTGCCGGGG GTGGCGGGGA TCGGCGCGAA GACGGCGGCG GCGCTGCTGG GCCGCTTCGG GTCGCTGCGG GCGATCCTCA CCGCGTTGGA CGCCGGGGGT GAGGACGGGT TCCCGGCCGG GGCGCGCCGC CGGCTGACCG CGGCCCGTGA CTATGTGGAC GCCGCGGTCA CCGTGGTCGG GGTGGTCCGC GATGTGCCGC TGGGCCCGGT CGCCGACGTG CTGCCCGTTC GGCCGGTGGA TGAGGCGGCG CTGGCGGCGC TGGCGGGCCG GTACGGGCTG GGCGGGTCGG TGGACCGGCT GACCATGGCA TTGGCCACCC TCGCCGAGGG CCCCTGA
|
Protein sequence | MDRLLLADTP SLYFRAFHGV PRSVRAPDGM AVNAVRGLLD VLARQIVEVR PRRLVCCFDA DWRPAFRVAL IGSYKAHRVA GPTPGPAAGA GGVEEEVPDE LEAQLPVIDA VLDAFGIARV EAAGFEADDV IGTLATRHSG RPGGGPVDIL TGDRDLFQLV CDDTGVVVRY AVERFAVVDE ASISARYGIP GRAYADFAVL RGDPSDGLPG VAGIGAKTAA ALLGRFGSLR AILTALDAGG EDGFPAGARR RLTAARDYVD AAVTVVGVVR DVPLGPVADV LPVRPVDEAA LAALAGRYGL GGSVDRLTMA LATLAEGP
|
| |