Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4886 |
Symbol | |
ID | 5673226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5861940 |
End bp | 5862944 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243741 |
Product | hypothetical protein |
Protein accession | YP_001509157 |
Protein GI | 158316649 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2887] RecB family exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.335853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAA CGCCTGACCA GGCCCCGGGC TCGGCCGCCG AAGGTTGCCC GACCGGGCCG GTCCAGGTCG GGCCCGGCTG GCATGAGGGC CGCGACCCCG CGGCGGCCCC GGCCTCGCCG CCGACGCCAC CGGCCGCGGC GCCACAGGCG CCGCCGATAA CGGCGTCGTT ATCCCCCTCC CGGGCCGCCG ACTTCGTCAA CTGCCCGCTG CGCTACCGCT TCCGCGTGGT CGACCGGCTG CCGGAGGCGC CGAGCGAGGC GGCGACGCGG GGCACGGTGG TGCACGGCGT GCTCGAGCGG CTGTTCGACC TGCCCGCCCG GAGCCGGACC CAGCCGGCCG CGACCGAGCT CGTCGAGCCG GTCTGGGCCG ACCTGCTCGC CCGCGACCCG GCGCTCGGTG GCCTGTTCGA CGACGACGGC GCCCTGCGCT CCTGGCTGGA CAGCGCGCGT GACCTTCTCG CCGGCTACTT CACCCTCGAG GACCCGACCA GGCTGGCCCC GGCCGCCCGG GAGCTCTACG TCGAGCATGT CCTGGCCTCC GGGCTGCGGC TGCGCGGCTA CGTGGACCGC CTCGACTCGG CCGAGACCCC CCAGGGCACC GCGCTGCGCG TCATCGACTA CAAGACGGGC CGCTCCCCCG GCCCGGCGTT CGAAGGCGCG GCCATGTTCC AGATGCGGTT CTACGCGCTC GTGCTATGGC GTTCGCGTGG CGTCATCCCG CGCGAGCTCC GGCTCTATTA CCTGAGTGAC CGAACCTGGC TGCGTGCCAC GCCGGAGGAG TCGGAGCTGC GCGCCACCGA GCGCCGGATC GAGGCGCTGT GGGCGGCCAT CGCCCGGGCG CACCGGACGG GCGACTGGCG GGCCACGCCG GGCCGGCTGT GTGACTGGTG CGATCACAAG CCGAGGTGCC CGGCATTCGG CGGCACCCCT CCCCCGCTGC CGGAGAACCG CGCGGAGCTG CCCGTCGACG ACGCCGCCCC CACCGTCTGC GAGGCGGACG GCTGA
|
Protein sequence | MTTTPDQAPG SAAEGCPTGP VQVGPGWHEG RDPAAAPASP PTPPAAAPQA PPITASLSPS RAADFVNCPL RYRFRVVDRL PEAPSEAATR GTVVHGVLER LFDLPARSRT QPAATELVEP VWADLLARDP ALGGLFDDDG ALRSWLDSAR DLLAGYFTLE DPTRLAPAAR ELYVEHVLAS GLRLRGYVDR LDSAETPQGT ALRVIDYKTG RSPGPAFEGA AMFQMRFYAL VLWRSRGVIP RELRLYYLSD RTWLRATPEE SELRATERRI EALWAAIARA HRTGDWRATP GRLCDWCDHK PRCPAFGGTP PPLPENRAEL PVDDAAPTVC EADG
|
| |