Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0910 |
Symbol | |
ID | 3906285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1053297 |
End bp | 1055528 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637878243 |
Product | recombinase |
Protein accession | YP_480023 |
Protein GI | 86739623 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCAACC AGAGACCACC AGCGATCCGC GGCGAGCAGG CCATCGCCTA CTACCGCGTC TCATCTTCCG GGCAGGTCAA TACCGACTAC GACCCGGAAG GCATCTCGCT GCCGGCTCAG CGGGTGGCCT GTAAGCAGCG CGCCCGAGAG CTGGGCGTCG TCCTGGTGGA CGAGTACATC GACCCGGGGA AGAGCGGCAA GACGATTGAC CAGCGGCCGG CCTTCCAGGA GATGATCGCC CGCATCAAGG CCGACCGGCA CATCAAGCAC GTCTTCGTCT ACGCCCTGTC GCGGTTCGCG CGGAACCGCT ACGACGATGC GATCATGATG ATGACGCTCG AACGGCTCGG CGTGCAGCTC CACTCGGCCA CCGAGAAGAA CCTCGACACC ACGCCAGCCG GCCAGGCGAT GCACGGCATG ATCGCCGTCT TCAACGAGTA CCAGGTGCGC GTCAGCGGCG AAGACATCAA GTACAAGATG GGCCAGAAGG CCAAGAAGGG CGGCACGCTC GGCGTGGCGC CCTTGGGCTA CCTCAACGTC CGCGAGCAGT TCGAGGGCCG CGAGGTACGC ACGGTGGCGC TTGATCCGGA GCGCGCGCCG TTCGTGGTCA TGGCCTTCGA GCTGTACGCC ACGGGCAAGT TCAACTTCCA TACCCTCCGG GACGCCCTGA CGGAGGCCGG CTTCAGGACC CGGCCCACGA AGCAGTGGGC GAGTCGCCCG ATCTCGATCA ACAAGATTGG CGAGATGCTG CGTGACCGCT ACTACCTCGG CTACGTCCGC TACGACGGCG AGGAGTACGA GGGGCGGCAC GAGCCGCTGA TCTCCCAGGA GCTCTTCGAC CGGGTACAGC GGGTGCTGTA CGCCGAGCGG CGGGCCGGCA CGCGGCACCG GACTCATGAT CACTACTTGA AGGGTCTGGT CTGGTGTGAC CGTTGCCGAC GGCGGCTGAT CATCATGCCT GGCAAGAGCA AGAGCGGCGT CCGGTACTTC TACTACATCT GCCAGGGCCG GCTCGATCAC CAGTGCGACT TGCCATACAT GGCCGTCAGC AGGGTCGAAC GGGCCATCGA GGACTTCTAC GTCAACGTCC GGCTGACGCC CGACTTCCGC GCGACGGTCC AGGCCCACCT CGACGAGATG ATGGCCAGCA CTTCGGACGC CAGTCGCCGC CTGCGGGCCC GGTACGAACG GCAGCTCAAG GAGCTGGACG TCCGGGAGGA CGGTCTGCTG GACCTCGTGG GCGATCCCGA CTGGCCCAAA GAGAAGTTGA CAGCCAAGAT CCGTGCCGTA CGCGAGGAGC GCACCCGGCT GGAAGATCGG CTGGCCGAGA GCGACCGGCC GCTGGACACC GGGCACGAGG TGCTGGCCAC CGTGCTCCGG CTGCTCGAAG ATCCGCAGAC ACTGTACCGG CGCGCGGGTG TCCGTGCCCG CAAGGTCCTC AACATGGCGA TCTTCACCAA GCTGCACGTC GATGTACAAG GCGAGCCGGT GGTGACCTCC GATGACCTCA AGGAGCCGTT CGCGGCCACG GTCTCGGCGC ACCGCGCCTG GAGTCTCACC GAAGCCGTGG ACGGCGTGCT GGCCGATCGG GAGCGCCAGG TTGCCCCGGC ACGACAAAGC GGCGCTCCCC AGGGGGATGA CGCCGCTCTC GATGATCTTT CTGACCGTGA CCTCTTGATC ACTGCTCTTT CGGGCGGGTG TTCAAGTACG GGGGTTCTGG TGCGGGAGGG GGGACTTGAA CCCCCACGCC CGAAGGCGGC AGATCCTAAG TCTGCTGCGT CTGCCATTCC GCCACTCCCG CGTCACCTCC AGAGTAGCGA TCGCCTCACC TGGGGGGCGC TCCGCCTCCC GCCGGGTTAT GCCCCGCATC CTCACCAGGA CCGCGGACAA CCGCGGACAA CAGTGGACAA CCGCGGACAA CAGTGGGCAG CCGTGGACGA CCGTGGGCAG CCGCGGGTAG CCATGGAGGC TGCCCCAGTC CGGCCACGAA CCTTCGTAGT GAGCCATTCA CGAGGTTTAC CGGTATCGGC CGGGTTTGGT GGACGCCAGG GCGGACAGAT ATCGACAGTC CAGAGGGCTC ACGCCGCCGG TCGCCGAGGC TCGTCCCGCG GTCGTCGGGA CGACAACCGT GAGAGGGCCG TTGTCGTCAG CCTGACTCGC ACGGCTATCC GACCGGCGGC AGGGTTGTTT CGCGCCGGTT TGGCCAGGAC GACTGGGAGA CGATTCATAT GA
|
Protein sequence | MINQRPPAIR GEQAIAYYRV SSSGQVNTDY DPEGISLPAQ RVACKQRARE LGVVLVDEYI DPGKSGKTID QRPAFQEMIA RIKADRHIKH VFVYALSRFA RNRYDDAIMM MTLERLGVQL HSATEKNLDT TPAGQAMHGM IAVFNEYQVR VSGEDIKYKM GQKAKKGGTL GVAPLGYLNV REQFEGREVR TVALDPERAP FVVMAFELYA TGKFNFHTLR DALTEAGFRT RPTKQWASRP ISINKIGEML RDRYYLGYVR YDGEEYEGRH EPLISQELFD RVQRVLYAER RAGTRHRTHD HYLKGLVWCD RCRRRLIIMP GKSKSGVRYF YYICQGRLDH QCDLPYMAVS RVERAIEDFY VNVRLTPDFR ATVQAHLDEM MASTSDASRR LRARYERQLK ELDVREDGLL DLVGDPDWPK EKLTAKIRAV REERTRLEDR LAESDRPLDT GHEVLATVLR LLEDPQTLYR RAGVRARKVL NMAIFTKLHV DVQGEPVVTS DDLKEPFAAT VSAHRAWSLT EAVDGVLADR ERQVAPARQS GAPQGDDAAL DDLSDRDLLI TALSGGCSST GVLVREGGLE PPRPKAADPK SAASAIPPLP RHLQSSDRLT WGALRLPPGY APHPHQDRGQ PRTTVDNRGQ QWAAVDDRGQ PRVAMEAAPV RPRTFVVSHS RGLPVSAGFG GRQGGQISTV QRAHAAGRRG SSRGRRDDNR ERAVVVSLTR TAIRPAAGLF RAGLARTTGR RFI
|
| |