Gene Francci3_4124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4124 
Symbol 
ID3907089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4926107 
End bp4927405 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID637881452 
Productrecombinase 
Protein accessionYP_483201 
Protein GI86742801 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCC CGGACTACGC GGAACTGTTG GCGGACCTGC GGTCAGGGTT TCTGGACGGG 
GCGATCGTGT GGGACCTGGA CCGTCTCACC CGGGACCTCC GCGACCTGGA AGACGCCATC
GAAGTCGTCG AGCTGTACGG CCGCCCGATC GTCGGGCCCG GTGTCGACCT GACCACCGAG
TACGGGAAGG CGAACGCCCG CGCGCAGGCG GTCGCGGCGA ACAAGGCCAG CGCGGACACC
TCCCGCCGGG TCCGCCGGTC CCACAAGCAG CGCGCCGAAC GCGGGGTGCC GGTCGGGGGC
AGCCGTCCGT TCGGCTGGAA GGACGACAAG CGGACGTTGG ATCTCACCGA GGCCGCGATC
CTGCGGGAAG GGGCCCGCCG GATCCTCGCC GGGGTGCCGG TCGTCGATCT GGTGAACGAG
TGGAACGCCG CCGGTGTCCG GGGGACCCGG GGGAAGAAGT GGACCAAGAG CAGTGTGTTG
AAGGTCTACC GCAACCCGCG GATCTGCGGC CTCCGCAGCC GGGGTGTGGA GGAACCGAAC
ATCAACGGCC AGGTCGCGAA GTACATGCAG GTCGTCACCC GCAAGGAACG CACCCCGGAC
GGGCGGACGA TCGAGGTGCC GGTGAAGGGC CAGTGGAAAG CGATCATCGG TGTGCGCCGG
TGGGACCAGG TGATCGCGAA GATCGGGGAC CGGACCTATG CCCAGCAGGG GCATAACTCT
CGCCGCTACC TGCTCAGCGG TGTGGTGGCG TGTGGCCGGT GTGGCCGGTC AATGTTCGGG
TCACCGCCGT ACCGGGAACG TAAGCACGCG ATCTACCGGT GTCCCGCTCC GACACAGGGC
GGATGCGGGA AAGTCTCCCG CCACGGACCC CACACCGATG ATCATATTCT GGCGGCGTTG
TTCCACAAGA TCGAGCTGGA GACCGCGAGC GCTGTCGTCG ACGTCGCCCC CTGGACGGGA
GAAGCCGCGT TGGCCGAGGT CCAGGAGAGC ATCACCGAGA CACGCGCCGC GTGGACGTCG
GTGCCCCGGC GGATTTCCCC GAAGGACTAC TTCCCGACCA TGGAAGACCT CCGAGCCCAG
GAAGAAATCC TGCTGAGGGA ACGCAACGAT CATCTGGTGG CCACGGCGAA CGCGCACGCC
CGCCCCGCCG ACGTCCGCGC CGAGTGGGAC GGCTACTCGT TGGCCCGCCA GCGCGCGATC
ATCAAAGAGC ATCTGATTGC CGTGGTCGTT CACCCTGCCG GCCGGGGCCG CCGGTTCGAC
CCCGACCTGT TGGACCCGGT CTGGCGGGAG GAGACCTAA
 
Protein sequence
MIRPDYAELL ADLRSGFLDG AIVWDLDRLT RDLRDLEDAI EVVELYGRPI VGPGVDLTTE 
YGKANARAQA VAANKASADT SRRVRRSHKQ RAERGVPVGG SRPFGWKDDK RTLDLTEAAI
LREGARRILA GVPVVDLVNE WNAAGVRGTR GKKWTKSSVL KVYRNPRICG LRSRGVEEPN
INGQVAKYMQ VVTRKERTPD GRTIEVPVKG QWKAIIGVRR WDQVIAKIGD RTYAQQGHNS
RRYLLSGVVA CGRCGRSMFG SPPYRERKHA IYRCPAPTQG GCGKVSRHGP HTDDHILAAL
FHKIELETAS AVVDVAPWTG EAALAEVQES ITETRAAWTS VPRRISPKDY FPTMEDLRAQ
EEILLRERND HLVATANAHA RPADVRAEWD GYSLARQRAI IKEHLIAVVV HPAGRGRRFD
PDLLDPVWRE ET