Gene Francci3_0910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0910 
Symbol 
ID3906285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1053297 
End bp1055528 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content66% 
IMG OID637878243 
Productrecombinase 
Protein accessionYP_480023 
Protein GI86739623 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAACC AGAGACCACC AGCGATCCGC GGCGAGCAGG CCATCGCCTA CTACCGCGTC 
TCATCTTCCG GGCAGGTCAA TACCGACTAC GACCCGGAAG GCATCTCGCT GCCGGCTCAG
CGGGTGGCCT GTAAGCAGCG CGCCCGAGAG CTGGGCGTCG TCCTGGTGGA CGAGTACATC
GACCCGGGGA AGAGCGGCAA GACGATTGAC CAGCGGCCGG CCTTCCAGGA GATGATCGCC
CGCATCAAGG CCGACCGGCA CATCAAGCAC GTCTTCGTCT ACGCCCTGTC GCGGTTCGCG
CGGAACCGCT ACGACGATGC GATCATGATG ATGACGCTCG AACGGCTCGG CGTGCAGCTC
CACTCGGCCA CCGAGAAGAA CCTCGACACC ACGCCAGCCG GCCAGGCGAT GCACGGCATG
ATCGCCGTCT TCAACGAGTA CCAGGTGCGC GTCAGCGGCG AAGACATCAA GTACAAGATG
GGCCAGAAGG CCAAGAAGGG CGGCACGCTC GGCGTGGCGC CCTTGGGCTA CCTCAACGTC
CGCGAGCAGT TCGAGGGCCG CGAGGTACGC ACGGTGGCGC TTGATCCGGA GCGCGCGCCG
TTCGTGGTCA TGGCCTTCGA GCTGTACGCC ACGGGCAAGT TCAACTTCCA TACCCTCCGG
GACGCCCTGA CGGAGGCCGG CTTCAGGACC CGGCCCACGA AGCAGTGGGC GAGTCGCCCG
ATCTCGATCA ACAAGATTGG CGAGATGCTG CGTGACCGCT ACTACCTCGG CTACGTCCGC
TACGACGGCG AGGAGTACGA GGGGCGGCAC GAGCCGCTGA TCTCCCAGGA GCTCTTCGAC
CGGGTACAGC GGGTGCTGTA CGCCGAGCGG CGGGCCGGCA CGCGGCACCG GACTCATGAT
CACTACTTGA AGGGTCTGGT CTGGTGTGAC CGTTGCCGAC GGCGGCTGAT CATCATGCCT
GGCAAGAGCA AGAGCGGCGT CCGGTACTTC TACTACATCT GCCAGGGCCG GCTCGATCAC
CAGTGCGACT TGCCATACAT GGCCGTCAGC AGGGTCGAAC GGGCCATCGA GGACTTCTAC
GTCAACGTCC GGCTGACGCC CGACTTCCGC GCGACGGTCC AGGCCCACCT CGACGAGATG
ATGGCCAGCA CTTCGGACGC CAGTCGCCGC CTGCGGGCCC GGTACGAACG GCAGCTCAAG
GAGCTGGACG TCCGGGAGGA CGGTCTGCTG GACCTCGTGG GCGATCCCGA CTGGCCCAAA
GAGAAGTTGA CAGCCAAGAT CCGTGCCGTA CGCGAGGAGC GCACCCGGCT GGAAGATCGG
CTGGCCGAGA GCGACCGGCC GCTGGACACC GGGCACGAGG TGCTGGCCAC CGTGCTCCGG
CTGCTCGAAG ATCCGCAGAC ACTGTACCGG CGCGCGGGTG TCCGTGCCCG CAAGGTCCTC
AACATGGCGA TCTTCACCAA GCTGCACGTC GATGTACAAG GCGAGCCGGT GGTGACCTCC
GATGACCTCA AGGAGCCGTT CGCGGCCACG GTCTCGGCGC ACCGCGCCTG GAGTCTCACC
GAAGCCGTGG ACGGCGTGCT GGCCGATCGG GAGCGCCAGG TTGCCCCGGC ACGACAAAGC
GGCGCTCCCC AGGGGGATGA CGCCGCTCTC GATGATCTTT CTGACCGTGA CCTCTTGATC
ACTGCTCTTT CGGGCGGGTG TTCAAGTACG GGGGTTCTGG TGCGGGAGGG GGGACTTGAA
CCCCCACGCC CGAAGGCGGC AGATCCTAAG TCTGCTGCGT CTGCCATTCC GCCACTCCCG
CGTCACCTCC AGAGTAGCGA TCGCCTCACC TGGGGGGCGC TCCGCCTCCC GCCGGGTTAT
GCCCCGCATC CTCACCAGGA CCGCGGACAA CCGCGGACAA CAGTGGACAA CCGCGGACAA
CAGTGGGCAG CCGTGGACGA CCGTGGGCAG CCGCGGGTAG CCATGGAGGC TGCCCCAGTC
CGGCCACGAA CCTTCGTAGT GAGCCATTCA CGAGGTTTAC CGGTATCGGC CGGGTTTGGT
GGACGCCAGG GCGGACAGAT ATCGACAGTC CAGAGGGCTC ACGCCGCCGG TCGCCGAGGC
TCGTCCCGCG GTCGTCGGGA CGACAACCGT GAGAGGGCCG TTGTCGTCAG CCTGACTCGC
ACGGCTATCC GACCGGCGGC AGGGTTGTTT CGCGCCGGTT TGGCCAGGAC GACTGGGAGA
CGATTCATAT GA
 
Protein sequence
MINQRPPAIR GEQAIAYYRV SSSGQVNTDY DPEGISLPAQ RVACKQRARE LGVVLVDEYI 
DPGKSGKTID QRPAFQEMIA RIKADRHIKH VFVYALSRFA RNRYDDAIMM MTLERLGVQL
HSATEKNLDT TPAGQAMHGM IAVFNEYQVR VSGEDIKYKM GQKAKKGGTL GVAPLGYLNV
REQFEGREVR TVALDPERAP FVVMAFELYA TGKFNFHTLR DALTEAGFRT RPTKQWASRP
ISINKIGEML RDRYYLGYVR YDGEEYEGRH EPLISQELFD RVQRVLYAER RAGTRHRTHD
HYLKGLVWCD RCRRRLIIMP GKSKSGVRYF YYICQGRLDH QCDLPYMAVS RVERAIEDFY
VNVRLTPDFR ATVQAHLDEM MASTSDASRR LRARYERQLK ELDVREDGLL DLVGDPDWPK
EKLTAKIRAV REERTRLEDR LAESDRPLDT GHEVLATVLR LLEDPQTLYR RAGVRARKVL
NMAIFTKLHV DVQGEPVVTS DDLKEPFAAT VSAHRAWSLT EAVDGVLADR ERQVAPARQS
GAPQGDDAAL DDLSDRDLLI TALSGGCSST GVLVREGGLE PPRPKAADPK SAASAIPPLP
RHLQSSDRLT WGALRLPPGY APHPHQDRGQ PRTTVDNRGQ QWAAVDDRGQ PRVAMEAAPV
RPRTFVVSHS RGLPVSAGFG GRQGGQISTV QRAHAAGRRG SSRGRRDDNR ERAVVVSLTR
TAIRPAAGLF RAGLARTTGR RFI