Gene Francci3_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3989 
Symbol 
ID3906949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4772901 
End bp4774742 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content66% 
IMG OID637881317 
Productrecombinase 
Protein accessionYP_483068 
Protein GI86742668 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCT CGACTGCGCG CCGTACTACC GCGAAACGCA CCAAGCAGGC CGTCGCCGCG 
CAACCCGAGA TCGTCCGGGT CGGGATCTAC CTGCGTCGCT CCACTGACGA CGAGAACCAG
CCCTACACGA TCGAGGCCCA GGAAGAACGG CTGCGCTCCT ACGTCGACTC CCAACCCAAC
TGGATCGTCG CCCTGCGGTT CGCCGACGAC GCCTCCGGCG CCACCACCGA ACGCAAGGAC
CTCCAGCGGG CCCTGGCCGC CGCCCGCAAC GGGCTCATCG ACGTCCTGCT CGTCTACCGC
GTCGATCGGC TCTCCCGCAG CCTGCGCGAC ACTGTCGATC TCCTCGAAGA ACTCGAACAG
GCCGGGGTCG TGTTCCGCTC GGCCACGGAG CCGTTCGACA CCGCCACTCC CATCGGCCGC
ATGCTTCTCC AGATACTGGC GATGTTCGCG CAGTTCGAGC GCGATATGAT CATCGACCGG
GTCATCGCCG GGATGGAACG CAAGGCCGCC AAGGGTCTGT GGAAAGGCGG ACGGCGCCCG
TTCGGCTACC AGGTCGACAA GATCGCCAAA AAGCTCATCA TCGATGTCGC CGAAGCCACG
ATCGTCCGGC TGATCTTCGA CCTGTACGTC CGCGACCGCC TCGGAACCAG AGCCATCGCG
AGCGTCCTGA ACAACCGCGG CCTGCGTACC ACCGTCGGCG GCCCCTGGTC CGGCCACAAG
ATCCTGCGCA TGCTCGACAA CAGAGCCTAT CTGGGTGAAC TGACCTTCCG CGAGATCACC
GTCGAAGGCA CCCACGAACC GATCATCGAC GAGGAGACGT TCGACGCCGC TCAGAAGATC
CTCACCGAAC GCAGCGAGGA GACCTCACGC CGGGCCTCCA ACCCCTCCGA CTACTACCTC
ACCGGCCGGA TGCGCTGCCC CCAGTGCGGC ACAGCCCTCA TCGGCACCCG CGCGACCGGC
CGCAACCACA CCTACCGCTA CTACACCTGC CACACCCGCA ACCGCTACAA CCGCCACGAA
TGCGACGCCC CCCGCCTCGA CGCCGACGCC GTCGACTACG CCGTACTCAC CGCTCTTGCC
GGCTTCTACC GCGACCACCA GCAGCTCATC GCCGACGCCG TCCTCAAAGC CCAGCGCAGC
CACCGCGACG CCCGATCCGA ACACACGGCC GAACTCAGCA CCATTGAGAC GGAACTCACC
CTGACAGACC AGGCCATCGA CCGTTATCTT GGGGCGTTCG AGCGCGGTAC CCTCGACGAC
GAGACCCTCG CCACACGCCT CGAAGCACTA CGCACCAAGC AGAAGCAGCT CCGCCAACGG
CAGGCCGAAC TCACCGAAGA GATCGACCAC GAACCCGTTA TGCCGGCCCG TTCCAGCCTT
CGCGCAGTCA CCCGGCACAT CGAAACGATT ATCGAGACAG GCGACGACCT CGGACGCAAG
GCTCTCATCG AAGCCTTGGT CGCCGAAGTA AAGATCACCG GACCCGACCG GCTCACACCG
ATCTTCAAGG TCCTCGGACC CGACGCTCCA AGGGACGTCA CCAACGTAGA CACTGGAGAC
ATCAGCCAGC CGAAGCTGAC CCAACCAGCC ACGTCCCCCG CCCCCCACAG GGGCGCCGCA
GCTGTCCTAC CAGCCACAAC GCCCCCGAAG GGAGCGGTTC GCGCAATGCC TACATTGGTG
GAGGTGAGGG GACTCGAACC CCTGGCCTCC TCCGTGCGAG GGAGGCGCTC TACCAGGCTG
AGCTACACCC CCTGGAAGCA TCTGAAGCTT ACAGGACCGC GTAGCCCATG TGACGACCAC
CCGGTTCGGA TGGATCGCCA CCTCCGGCCG GCCGTCAGCT GA
 
Protein sequence
MPRSTARRTT AKRTKQAVAA QPEIVRVGIY LRRSTDDENQ PYTIEAQEER LRSYVDSQPN 
WIVALRFADD ASGATTERKD LQRALAAARN GLIDVLLVYR VDRLSRSLRD TVDLLEELEQ
AGVVFRSATE PFDTATPIGR MLLQILAMFA QFERDMIIDR VIAGMERKAA KGLWKGGRRP
FGYQVDKIAK KLIIDVAEAT IVRLIFDLYV RDRLGTRAIA SVLNNRGLRT TVGGPWSGHK
ILRMLDNRAY LGELTFREIT VEGTHEPIID EETFDAAQKI LTERSEETSR RASNPSDYYL
TGRMRCPQCG TALIGTRATG RNHTYRYYTC HTRNRYNRHE CDAPRLDADA VDYAVLTALA
GFYRDHQQLI ADAVLKAQRS HRDARSEHTA ELSTIETELT LTDQAIDRYL GAFERGTLDD
ETLATRLEAL RTKQKQLRQR QAELTEEIDH EPVMPARSSL RAVTRHIETI IETGDDLGRK
ALIEALVAEV KITGPDRLTP IFKVLGPDAP RDVTNVDTGD ISQPKLTQPA TSPAPHRGAA
AVLPATTPPK GAVRAMPTLV EVRGLEPLAS SVRGRRSTRL SYTPWKHLKL TGPRSPCDDH
PVRMDRHLRP AVS