Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1394 |
Symbol | |
ID | 5669801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1690153 |
End bp | 1691343 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641240319 |
Product | integrase family protein |
Protein accession | YP_001505746 |
Protein GI | 158313238 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0589061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA CCGGCAGACC ACCCCTCGGC CTCGGAACGT ACGGTGAGAT CCGGGTCTAC AAGATGGACT CCGGGCGTTA CAAGGCCCGC ACGCTCTACC GCGACTTCGA CGGCGTGACC CGCCCGGTCG CCCGGAACGG CGCGAGCAAG AACGCCGCGG AGACGGCCCT CAAGAACCAC CTCCGTGACC GCGTCCGGGA GGCCGGAGCC GAGGCAGAGA TCACCGCGGG GTCGACCGTC GAGGTGCTCG CCGAGGCGTG GTGGGCCGAG TTCTCCAAGC AGGACAAGTC CCCCGGCACC TTCCGCCTCT ACCGCGACCG GCTGGACAAC CAGATCATCC CGGCACTCGG AAAGATCCGG ATCCGGGAGC TCACGACTGG GGCCGCCAAC CGGCACATCA GTACGGTCAG CCAGAACAAC GGCGCTGGCG TGGCCAAGGC AACCCGCACG GTCCTCAGCA ACATGTGTGC CTTCGCCTGC CAGCGCGACG TGATGAAGAC CAACCCCATC CGCGAGGTGG CCCCGGTCCG GCCGAAGGCC AAGAAGGTGC CGAAGGCTCT CAGCGTCGCC GAGCTCCAGC AACTCCGCGC ATTGTTCACT TACGACCCCG CCGCGGTGCG CCGAGACATC CCCATGCTGT CGAGCATCCT GCTGGCGACC GGCGTGCGGA TCGGAGAGTG TCTGGCGTTC GTCGAGGACG CCCTCGACCC CAAGGAGGGC TCGATCGAGG TGCGCGGGAC GGTGATCTGG CTCAAGGGGG TCGGACCCAT CGTCAAGCCC GCACCGAAGA GCGCCGCGGG CTTCCGGCGG CTCCTGCTAC CAAAGTGGGC CGTCAACCTG CTTCGGTCCA GGTTCGAGGA GTCAGCCGTG ATCAGCAAGC CAGTGCCGGT GCTGAACGGC GAGGCATGGG ACTCCCCGCT GGCGTTCCCC ACATCCACAG GGCGGCTGCG GGACATCACC AATGTCGAGA GTTACTGGCG AGAAGCCGTC ACCACCGCAG GATTCGACTG GGTGGTGCCC CACACTTTCC GCAAGACCGT CGCCACCGAG ATGGACCGCG CGGGTCGGAC AGCACGCGAA ATCGCGGACC AGCTCGGCCA TTCTCAGATC ACACTGGTAC ACAATACTTA CCTAGGCCGC AAAGCCCGTG ACACCGGCGC CGCCGCAGCT CTTGAAGGGC TGGTCGCATG A
|
Protein sequence | MPRTGRPPLG LGTYGEIRVY KMDSGRYKAR TLYRDFDGVT RPVARNGASK NAAETALKNH LRDRVREAGA EAEITAGSTV EVLAEAWWAE FSKQDKSPGT FRLYRDRLDN QIIPALGKIR IRELTTGAAN RHISTVSQNN GAGVAKATRT VLSNMCAFAC QRDVMKTNPI REVAPVRPKA KKVPKALSVA ELQQLRALFT YDPAAVRRDI PMLSSILLAT GVRIGECLAF VEDALDPKEG SIEVRGTVIW LKGVGPIVKP APKSAAGFRR LLLPKWAVNL LRSRFEESAV ISKPVPVLNG EAWDSPLAFP TSTGRLRDIT NVESYWREAV TTAGFDWVVP HTFRKTVATE MDRAGRTARE IADQLGHSQI TLVHNTYLGR KARDTGAAAA LEGLVA
|
| |