Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3186 |
Symbol | |
ID | 5671562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3753177 |
End bp | 3754928 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641242080 |
Product | integrase catalytic region |
Protein accession | YP_001507500 |
Protein GI | 158314992 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.605604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCCC CAGAGCACGA CGATTTGTGG CCGACACCAC AGGCCGACAG CCTGCCCTCC GGACAGCGGC CAATCCGTCC GGACCCAACC GACCACGACG CCGATTCGGC GGGCGCGACT GACCGCCCAG TAGCCGGTCC ACATTCCCAT GCCTCAGGCA CCCCAAAACC TCGCCGTCAC CGCCGGCGAC GAACCATCGC GATCATTCTT CTCGTTCTGC TCGCACTTTT CCCGGTCTTC GACCGGATCA CGGTCTCAAT GGTCGAGAAG CAGGTCGTCA AACAACTCGA AACCGCCGTC GCGGACACCC TGGACTGCAA TGCACCCCAA CCAGCCGTCA GGCGTGTCAA CATCGCCGGT TTCCCGTTCC TCACCCAGGT GCTGCTCGGG AAGCCCAGGG ATGTCAGTCT GTCCATCGAC GACCTGTCGA CACCCGGCCC CCGGATCTCC TCGCTGGACG CGAACGCCAA AGGCATCAAA ATTCCGATCT ACAACATGAT CACCGGCGGT GATGGAAAAC TTTCTGTCGA CGAGGTACGG GCCACGGTCA AGATGAGTTA CACAGACTTG AACGCCTATC TCGCCGGGAA GACGGGCCAC CTGCAGGTGA AACCGGCAGA CGGCGGACGG AGCCTCAACA TCTCTGGAAC AGTAGACCTA CCGCTGATCG GCTCCCAGCA GATTGACGGC GTCACCACCT TCCAGGTCCG CGACAATGAG ATCGAGTTGA CACCATCCCA CCTCACGTTA CGCGGAGCGA TCAACCTCGA CTTTCTGGTC CCGCTAGGCC AACTAATTCC CTCGATCCCG ATCCCGGTTG GGGAACTACC GTTCGAGGTA AAGGTGGAGT CAGTGTCCAC CGGCTTGCTG AGCCGGGTGT CCCCGGGGCG GCCACGGGAG GTCCCGGGGC GGGTACGGGC GCGGATCCTG GCGTTGACCA GGACCACTCC TCCACCGGAG ACCGGACTGA GCCACTGGAC GAGCACCGAG ATGGCGCGGT ACCTGAAGCG CCGCGAAGGA GTGTCGGTCT CGCACACCTT CGTGGCCCAG CTGTGGCGGG AGAACGATCT CCAGCCGCAC CGGCACCGAG TCTTCAAGCT CTCGGCGGAC CCGGATTTCG AGGCCAAGGT GGAGGACGTC GTCGGCCTCT ACCTTGATCC CCCCGAGGGC GCCGAGGTCC TGTCGATCGA CGAAAAGCCT GGGGTGCAGG CACGCGACCG GACGCAGCCA CCGCGGCCGG TCGCCTCCGG CCGGGTCGCC ACCCGCACGC ACGACTACCA GCGGAAGGGC ACGACCGACC TGTTCGCCGC CCTCGACGTC GGGACGGGGC GGGTCACCGC CAGGTGCTTC CCCAGCCACA CCAGGGCCGA TTTCCTCACG TTCATGGACC AGGTCATCGC GGAATACGGC GGTGCGGAGC TCCATGTCGT GGTCGACAAT CTGGCCACCC ACTACGGCCC CGACGTCGAC ACATGGCTAC GCAGACACAA GAACGTCACG TTCCATTTCA CCCCGTCCGG CGGTTCATGG CTCAACCAGG TCGAGAACTG GTTCGGTATT CTCACCCGGC ACGCACTCCA GCACGGGGCG TTCGTCTCGG TCCAGGACCT CGTCAACACC ATCAACAACT ATGTCAAGAA CTGGAACTGG GACGCCCATC CGTTCGAGTG GACAGCCACC GCAGAAGAGA TCGTAGCCAA GGTGGAGGTA CTCCACCGGG AATTCAGGAA GCTGCTCGCC AACAACTTGT GA
|
Protein sequence | MSPPEHDDLW PTPQADSLPS GQRPIRPDPT DHDADSAGAT DRPVAGPHSH ASGTPKPRRH RRRRTIAIIL LVLLALFPVF DRITVSMVEK QVVKQLETAV ADTLDCNAPQ PAVRRVNIAG FPFLTQVLLG KPRDVSLSID DLSTPGPRIS SLDANAKGIK IPIYNMITGG DGKLSVDEVR ATVKMSYTDL NAYLAGKTGH LQVKPADGGR SLNISGTVDL PLIGSQQIDG VTTFQVRDNE IELTPSHLTL RGAINLDFLV PLGQLIPSIP IPVGELPFEV KVESVSTGLL SRVSPGRPRE VPGRVRARIL ALTRTTPPPE TGLSHWTSTE MARYLKRREG VSVSHTFVAQ LWRENDLQPH RHRVFKLSAD PDFEAKVEDV VGLYLDPPEG AEVLSIDEKP GVQARDRTQP PRPVASGRVA TRTHDYQRKG TTDLFAALDV GTGRVTARCF PSHTRADFLT FMDQVIAEYG GAELHVVVDN LATHYGPDVD TWLRRHKNVT FHFTPSGGSW LNQVENWFGI LTRHALQHGA FVSVQDLVNT INNYVKNWNW DAHPFEWTAT AEEIVAKVEV LHREFRKLLA NNL
|
| |