Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6718 |
Symbol | |
ID | 5675031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8161443 |
End bp | 8163413 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245566 |
Product | integrase family protein |
Protein accession | YP_001510958 |
Protein GI | 158318450 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGGTA CCCTGTTCAA AAAATACGGA TGCCGAGACC CCGAAACGGG ACGGCAACTC GGCCGCTCCT GCCCCCGGCT ACGTCGCGCT AACGGCGGAT GGCGTTCGGA CCACGGCACC TGGTCCTACC GCGTCGACCT ACCGCGCTAT CCCGACAGCA AACGCCGCCT CGTCAGCCGC GGCGGTTTCC CCACACAGGG AGAAGCCCGC GAAGAACTCG AACGCATCGA AGCACTTCTC GCCGTCGCCG ACAAGGGCGA CAAGGGCGAC GAGGGCGGGC TACAGAAAGT CGCGGACCTC ATCGGCAACG CCATCACCAC CGGAACACCC CTCCCGAACC CGGTCGACGT GAAAAACCGC TACCGGCGCA GCGCCGACCT CAACCCAGAC ATCACCGTCC AAGACTGGAT GACCCGCTGG CTCGCCAGCC GCAAAACCAT CCGCCCCACC ACCCGGCAAG GCTACGAAGC CTACATCAAG GTCCACATCG TCCCCGCGAT CGGAACCATC CGCCTCGACA AACTCACCGT CACCCACCTC GATGACATGT TCACCGCCAT CGACAACACC AACAACGACA TCCGAAAAGC CCGCGAGTCC GACGACCCCG ACATCCGGCG CGCCGCCCAA CGCAAACGCA TCACCGGCCC CGCCACCCAA CAGCTCGTCC GCGAGGTCCT CCGCGCCGCA CTCAACGACG CCATCCGACG CGGACTCATC ACCCACAACC CCGCGAAACA CCTCGAGCTC GCCTCCGGGA AACGCCCCAA AGCCCTCCTC TGGACCGACG AACGCGTCAG CCGCTGGCGC GAAACCGGGC TCAGACCCTC CCCCGTCATG GTCTGGACCC CCGCCCAGAC CGGACGCTTC CTCGACCACA CACAGACCGA CCCGCTCTAC CCCATCTACC ACCTCATCGC CTACCGCGGA CTCCGCTGCG GCGAATCCGT CGGCCTCCAC TGGGACGACA TCGACTTCAC CCACGCCACC CTCACCATCC GCTGGCAAGT CATCCAGATC GGCTACGCCA CCCAGCTCGG CCGACCCAAA TCCGACGCAG GCGACCGCGT CATCTCCCTC GACACCAACA CCCTCACCGT CCTGAGAATC GCCCGCACCC GCCAACACGC CGCCCGCCTC GCCGCCGGCC GCGACTGGCC CGACACCGGA CTCACCTTCA CCCACCCCGA CGGCCAGCCC ATCCACCCCG AACACCTCAC CAACCGCTTC CACACCCTCC TCGACGACGC CGCGCTACCC CCCATCAGCC TCCACGGCCT CCGCCACGGC GCCGCCACCC TCGCCCTCGC CGCCGGCGCC GACCTCAAAG CCGTCCAAGA ACTCCTCGAC CACTCCACCA TCACCCTCAC CGCCGACACC TGCACCCACA TCCTCCCCGA CCTCGCCAGC GAGATCGCCG AGAACACCGC CCGGCTCATC CCCCGCGCCC GCACCCCCAC GACTACCCGA AGACCATCAC CAAGCACTGG CCCCCACCAG CCCCGCCCGG GCCCGACCCG GGCGGGGCTG GCACCCACAC CACCCACCAA CCCCCAGAAA CACCCCCGAC GCGCCCACCC GCCGGAACCT CACGCCCCGC CCTCCCCCCG ACCGAGCAGT TGGGTACGAT GGTCCACAGC AGGGCCGTTA GCTCAATTGG CAGAGCAACC GGCTTTTAAC CGGCGAGTTT CCCGCCCGGC GGCCGTCGCG AACCTTGATC GACATCGCCT TGACCTGCAT CGTTCGAAAA AACGACCCTC CAACGCGAGC GCGTCGGGCC GTGTGTGTGC TCAATGTGTG CTCATTGTGT CGTCGCCCGC CAAACGATCT CCGGTTGCCG CGCTGCGCAG CGGGCGAACC CGGCCGGCCG AAAGAGGCGG GCTCGCGGCG ATACACCAGA TGAGTTTGAT CGGTATGGCG GGCCAGCACT GCTGCGTCGG AGATCACAAG CCGAAAAGTC CTGAGATCTG A
|
Protein sequence | MKGTLFKKYG CRDPETGRQL GRSCPRLRRA NGGWRSDHGT WSYRVDLPRY PDSKRRLVSR GGFPTQGEAR EELERIEALL AVADKGDKGD EGGLQKVADL IGNAITTGTP LPNPVDVKNR YRRSADLNPD ITVQDWMTRW LASRKTIRPT TRQGYEAYIK VHIVPAIGTI RLDKLTVTHL DDMFTAIDNT NNDIRKARES DDPDIRRAAQ RKRITGPATQ QLVREVLRAA LNDAIRRGLI THNPAKHLEL ASGKRPKALL WTDERVSRWR ETGLRPSPVM VWTPAQTGRF LDHTQTDPLY PIYHLIAYRG LRCGESVGLH WDDIDFTHAT LTIRWQVIQI GYATQLGRPK SDAGDRVISL DTNTLTVLRI ARTRQHAARL AAGRDWPDTG LTFTHPDGQP IHPEHLTNRF HTLLDDAALP PISLHGLRHG AATLALAAGA DLKAVQELLD HSTITLTADT CTHILPDLAS EIAENTARLI PRARTPTTTR RPSPSTGPHQ PRPGPTRAGL APTPPTNPQK HPRRAHPPEP HAPPSPRPSS WVRWSTAGPL AQLAEQPAFN RRVSRPAAVA NLDRHRLDLH RSKKRPSNAS ASGRVCAQCV LIVSSPAKRS PVAALRSGRT RPAERGGLAA IHQMSLIGMA GQHCCVGDHK PKSPEI
|
| |