Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4467 |
Symbol | |
ID | 5672818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5332965 |
End bp | 5334365 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243335 |
Product | integrase catalytic region |
Protein accession | YP_001508751 |
Protein GI | 158316243 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2826] Transposase and inactivated derivatives, IS30 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGGC CGGCGGATTG GACGCAGACG GTCACGGGGC GGGCGGGGAT GTGCTCGCCG GGGCGGCCGC CGGTGGCGCG GTGGGAGCAT CAGCAGCGGT TCTGGGCCGC GGTCGCGCGT GGGCTGAGCA GCGAGGATGC CGGTGTCGCG GCCGGCGTGT CGCCGGCGGT CGGGACCCGG TGGTTCCGCA ACTGTGGCGG GATGCCCCCT TCGGACTTCC CCGCCCCGTC GGGCCGATAC CTGTCGTTCG CGGAGCGGGA GGAGATCGCC CTGGGTCGTG CCCGCGGGGA CAGCATCCGC CGGATCGCGC GGGGTTTGGG CCGGCCTGCG TCGACGGTGT CACGGGAGCT GCGGCGGAAC GCCGGTACCC GCGGCGGGAC CCTGGTCTAC CGGGCCACGC TCGCGCAGTG GCATCGAGAC CGTCGGGCTG CCCGGCCCAA GACTGCCAAG CTCGCAGGCA ACGAGCGGCT GCGTGGCTAT GTGCAGGACC GGCTCGCCGG GCCGGTCCTG CGCGGCGACG GCACGGTGGT GCCCGGCCCG TGGACGGCGC CGTTCACCGG CAGGAACAAG CCGCACCGCC AGGACCGCCG TTGGGCCAGC GCGTGGAGCC CGGAGCAGAT CTCGCGGCGG CTGCGGGTCG ATTTTCCCGA CGACCCGGCG ATGCGGATCT CGCATGAGGC GATCTACCAG GCCCTGTACA TCGAGAGGCG GGGGGCGTTG CGCCGTGAGC TGGTCGCGTG CCTGCGTACG GGCCGCGCCC TGCGGGTGCC ACGCGCGCGG GCCGGACGCC GCCCCGACGG CATGGTCACC CCCGAGGTGC GGATCGGAGC CCGGCCTGTT GAGGCCACAG ACCGGGCGGT CGCGGGGCAC TGGGAAGGCG ACCTGATCAT CGGGTTGAAC CGGTCCGCGA TCGGCACGCT GGTCGAGCGC ACCACCCGGC TCACGGTCCT GCTGCACCTG CCCCGCATGG ACGGCTACGG CCATGAACCA CGGGTGAAGA ACGGTCCGGC GTTGGCAGGC CGCGGCGCCG ACGCTGTCCG GGACGCGATC ACACGAGCGT TCGCGGAGCT GCCCGAGCAG CTACGGCGGA CCCTGACCTG GGACCGCGGC AAGGAGATGG CCGGACACGC CGCGCTGACC GCCGACACGG GCCTGGGGGT CTACTTCGCC GACCCGCACA GCCCCTGGCA GCGCGGCACG AACGAGAACA CCAACGGGCT GCTACGCCAG TACTTCCCCA AGGGCACCGA CCTGTCCCGC TGGACCCGCC ACGAACTCGC CACCATCGCC GCGACCCTCA ACGACCGGCC CCGCAAGACC CTCGACTGGA AGACCCCCAC CGAAGCGATG AACAACCAGC TACTCTCACT TCAACAACCC GGTGTTGCGA GGACCGGTTG A
|
Protein sequence | MGRPADWTQT VTGRAGMCSP GRPPVARWEH QQRFWAAVAR GLSSEDAGVA AGVSPAVGTR WFRNCGGMPP SDFPAPSGRY LSFAEREEIA LGRARGDSIR RIARGLGRPA STVSRELRRN AGTRGGTLVY RATLAQWHRD RRAARPKTAK LAGNERLRGY VQDRLAGPVL RGDGTVVPGP WTAPFTGRNK PHRQDRRWAS AWSPEQISRR LRVDFPDDPA MRISHEAIYQ ALYIERRGAL RRELVACLRT GRALRVPRAR AGRRPDGMVT PEVRIGARPV EATDRAVAGH WEGDLIIGLN RSAIGTLVER TTRLTVLLHL PRMDGYGHEP RVKNGPALAG RGADAVRDAI TRAFAELPEQ LRRTLTWDRG KEMAGHAALT ADTGLGVYFA DPHSPWQRGT NENTNGLLRQ YFPKGTDLSR WTRHELATIA ATLNDRPRKT LDWKTPTEAM NNQLLSLQQP GVARTG
|
| |