Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6593 |
Symbol | |
ID | 5674908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8023418 |
End bp | 8024704 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641245444 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001510836 |
Protein GI | 158318328 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCCGA ATCCCCCGCC GCGAAACGAC AACCTGCGGT TTCGGTTGGC CATCCTCACC GGTGACCGGA TACGGCGAGC CGGTGCCCTT CTCCTCGGTG ACGCGGGCCG CGGGTTCCGC CGCTGGTGCG CTCTGCCGGT GGCGGTGCTG GCCGTCGCGG CGACGGGCGC GCTGTTCGCC GTCGACGCCG GGAACACCAA CGCGGCGCAG GCTCACCCGG CGGGCGGCCC ACCCGGCGCG GCGACCGCGG CCGCGGACCT CTCGGACAGC CCCGCCACGC CGGAGCGCGC TCCTACCGGG AGAGGTGATG CGTCCGCGCC GGGTTCTCCC GGGACGTCGC GGCCCGGCTC GGCGTCGCCA TCTCCCCCAC TCACCTCGAC TCAGCCGCCG GCGCAGCCTC CGCCCTCGTC GGCGTCGCGC CCCGGCCCGG CCCGCCCCGA GGCCGGTTCC GCCGAACGGG TGATTCCGCC GCCGGCCGCC GGGCTCACCG CTGAGCCCGG TGACGGCCGA GCCCGGATCT GCTGGAATCC CGCGCCGAAC GCCACCGGCT ACGTCATGTA CCACCGGGAC GTCACCACCG GCGAAGGCTG GTACCGGGTC CCCTACCCCG TCACCGACAC CTGCAGCACC GTCACCCAGC TCACCAACAG CCACACCTAC GAGTTCAAGG TCCGATCGAG CAACGCCAAC GGCGAGGCCG ACTACTCAGG GACGGTGCAG GCACGGCCGG CCGGCACGAT CCCGCCACCG GCCACCGGGC TCACCGCGGC CCCCGGGAAC GGCTCCGCCC GGATCTGCTG GAATCCCGCG CCGAACGCCA CCGGCTACGT CATGTACCAC CGGGACGTCA CCACCGGCGA AGGCTGGTAC CGGGTCCCCT ACCCCGTCAC CGACACCTGC AGCACCGTCA CCCAGCTCAC CAACAGCCAC ACCTACGAGT TCAAGGTCCG ATCGAGCAAC GCCAACGGCG AGGCCGACTA CTCAGGGACG GTGCAGGCAC GGCCGGCCGG CACGATCCCG CCACCGGCCA CCGGGCTCAC CGCGGCCCCC GGGAACGGCT CCGCCCGGAT CTGCTGGAAT CCCGCGCCGA ACGCCACCGG CTACGTCATG TACCACCGGG ACGTCACCAC CGGCGAAGGC TGGTACCGGG TCCCCTACCC CGTCACCGAC ACCTGCAGCA CCGTCACCCA GCTCACCAAC AGCCACACCT ACGAGTTCAA GGTCCGATCG AGCAACGCCA ACGGCGAGGC CGACTACTCG GGCACCGCCG CCGCCACGCC CGGCTGA
|
Protein sequence | MEPNPPPRND NLRFRLAILT GDRIRRAGAL LLGDAGRGFR RWCALPVAVL AVAATGALFA VDAGNTNAAQ AHPAGGPPGA ATAAADLSDS PATPERAPTG RGDASAPGSP GTSRPGSASP SPPLTSTQPP AQPPPSSASR PGPARPEAGS AERVIPPPAA GLTAEPGDGR ARICWNPAPN ATGYVMYHRD VTTGEGWYRV PYPVTDTCST VTQLTNSHTY EFKVRSSNAN GEADYSGTVQ ARPAGTIPPP ATGLTAAPGN GSARICWNPA PNATGYVMYH RDVTTGEGWY RVPYPVTDTC STVTQLTNSH TYEFKVRSSN ANGEADYSGT VQARPAGTIP PPATGLTAAP GNGSARICWN PAPNATGYVM YHRDVTTGEG WYRVPYPVTD TCSTVTQLTN SHTYEFKVRS SNANGEADYS GTAAATPG
|
| |