Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2542 |
Symbol | |
ID | 5670936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3021334 |
End bp | 3022998 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641241458 |
Product | integrase domain-containing protein |
Protein accession | YP_001506878 |
Protein GI | 158314370 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTCC ACGACAGACC GCGCGCCAAC AGTGCCAAGA CCGCGACCAG TTGTCTGTCG TGCCTGGCCT GGGGGTTGCC GTGTCGGCGA GGGAAGTGCC GAGCCTGCTG CGACTTCGTC GCACCCCGCT ACGGCCATGC GGTCGGCGAG TGCGGTGCCT GCCGTCGGCG GGAGCCGCTG AAGAAGGGGT TCTGCCGGTT GTGCTGGTGT CAGGCACGCC TGGAACGCAC GACGGGCACC TACAAAATGC TGATGCCCCA CGTCCGTCTG GTCCGCCATC ATCAGCTGTT CTTCGCCGAC ATGGCCGGCT ACCGAGACAC CCACACGGCC CCGCGACGTT TCGACGAGAC CGGTCGAAAA CCCCCGCCGC CACCCGCTTA CCGGCCCGAG ACCCGCCACG TCCAGCCGGC TCTGTCCGAC GACGTCGTTC GCCGGCACTA CCGGTACGGA CGTTACGACC TGCGCCGAGG ACCGGCGCCT GACAACCCGT GGTTGGCCTG GGCGCTTTAC ATCGCCTATG ACCTCGCTGA GAAAAGGGGA TGGGGATCGT TCGTCCGAGG TGGAATGCAA CGGACCCTGG TCATGCTGCT GGCCGGCCAC ATCGACGGAG AACTGATCCG CGTCAGCGAC TTCTACGAGA CCGTGGCCGA GCACTCCACG AACATCGACG ACACCATCGA GATCCTCACC GTCATGGACG TCGTCCTCGA TGACCGGGAA CAGGCTTTCG ACCGGTGGCT GCGAGCTGAA CTCGACGGGC TCGCCCCAGC AATCCGACGG GACACCCATA CCTGGACCAC GTTGCTGCAC AATGGCGGCC CGCGGAACCG AGCCCGTGCG CCGGGCACCG CCGCTGACTA CCTGCGCACC ATCCGGCCCG CGCTGACGGC CTGGTCGACC CGCTACGGCC ACCTGCGGGA GGTCACCCGT GACGACGTCA TCGCCTACCT CGAACCGCTG ACCGGCCCGC CGCGGGAGAG GGCCACGACC GCCCTGCGGT CGCTGTTCGT ATGGGCGAAA CGCGCCAATG TCATCTTCCG TAACCCCGCC GTCCGCCTGC GCGTCCCACA GCGAGCAGAC CCGGTCTGGC AACCGCTGCG CCCGGACGAA CTCGCCCGCA CCGTCGCGAC GGCCACCACC GTGCACGCCC GCCTGTTCGT CGCCCTCGCC GCCGTCCACG CCGCCCGCGT CGGACAGATC CGCGCCCTGC AACTCGACGA CGTCGACCTC GGCAACCGAC GCATCACCAT CGCCGGCCAT GACCGGCCAC TGGACGACCT CACCCACCAG ACCCTGCTGG CGTGGCTGGA GCACCGCCGG ACCCGCTGGC CAGTCACCGC CAACCGCCAC CTGGTCATCA GCCCATGCAC CGCCGGAGGG CTCGGACCCG TCAGCTACCC CTGCCTCGCC CGCCCGCTGC GCGGGCTGCC CGCGACCCTC GACCGGCTCC GTGTTGACCG GCAGCTCGAG GAGGCACTTA CCTCCGGCGC CGACCCCCTG CACGTCGCCG CGGTCTTCGG CGTCAGCGAC GCCACCGCGA TCCGCTACGC GGACAATGCC CGTCAGCTCC TGGCGCGCCC TCACGAGAGC GGTCCCTCGG GATCGCCGCG AACCCAAGGG TCCAGACCCG GCAAAGAGCC TGATCGACAC TTCAGTTCCC GCTGA
|
Protein sequence | MTLHDRPRAN SAKTATSCLS CLAWGLPCRR GKCRACCDFV APRYGHAVGE CGACRRREPL KKGFCRLCWC QARLERTTGT YKMLMPHVRL VRHHQLFFAD MAGYRDTHTA PRRFDETGRK PPPPPAYRPE TRHVQPALSD DVVRRHYRYG RYDLRRGPAP DNPWLAWALY IAYDLAEKRG WGSFVRGGMQ RTLVMLLAGH IDGELIRVSD FYETVAEHST NIDDTIEILT VMDVVLDDRE QAFDRWLRAE LDGLAPAIRR DTHTWTTLLH NGGPRNRARA PGTAADYLRT IRPALTAWST RYGHLREVTR DDVIAYLEPL TGPPRERATT ALRSLFVWAK RANVIFRNPA VRLRVPQRAD PVWQPLRPDE LARTVATATT VHARLFVALA AVHAARVGQI RALQLDDVDL GNRRITIAGH DRPLDDLTHQ TLLAWLEHRR TRWPVTANRH LVISPCTAGG LGPVSYPCLA RPLRGLPATL DRLRVDRQLE EALTSGADPL HVAAVFGVSD ATAIRYADNA RQLLARPHES GPSGSPRTQG SRPGKEPDRH FSSR
|
| |