Gene Francci3_2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2551 
Symbol 
ID3904042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3014471 
End bp3015709 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID637879878 
Productrecombinase 
Protein accessionYP_481644 
Protein GI86741244 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGAGG GCGCCCTGGA GGACTTGAAG CGGGGAGCCA CCAAAACCGG CAAGCCCCTT 
GACGGCCTCA TCGTCTCCGA CGTGGACCGG CTCACCCGCG ACCCGCGCCA CCTGGAAGAC
GCCATTGACG TGGTGGTCCA GTACGGGCGG CCCGTCATTG ACATCAGCGG CACCCTTGAC
CTCCTCACGG ACAACGGGCG CTCCGTGGCC CGGATCGTCG TGGCCCTGAA GAACCAGCAG
TCCGCCGACA CCTCCCGGCG GGTGCGGACG GCCCACCGCG AGCTCGCCAA GGCGGGTGTC
CCCGTGGGCG GCTACCGGCC CTTCGGCTGG GAGCCGGACA AGCGCACCAT CCGCAAGGCC
GAGGCCGACA TGATCGTCGT CGGTGCTGAC GAAATCCTTG CCGGGGTCGG CACGCACACG
CTCTGCCGCC GCTGGAATGA GCTGGGCATT CTCACGACCC GGGGCCACCA GTGGCAGCGC
CAGGTGATGA AGAACATGTA TCTGTCTCCC CGCCTCGCGG GCTACCGGGT ACACGGCCCG
ACCACGGTTC CCTTGGAGCA GCGCTACGCC TGCACCGTAG ACGGCCAGCC CGTCATGGGC
CTTCAAGGTC ACATCCTGGA CGTCGAGGTC TGGGAGAAGG TGGTGGCGAA GCTCCGCGAC
CCCGCTCGGA CCGGCAACCA GAACATCCAC ATAGGCGGCC GGAAGTATCT GCTGTCCGGA
ATCATCCGCT GCGCCTACTG CGGCGCGCGG CTCACCGGCG GCTGGGACAA GGGGTGGAAG
AAGCATCACT ACTCCTGCCG TCCCGTCACC GCTGGTGGCT GCGGCAGCGT CGCCGTGACC
GGCCACCACG TTGACGAACT GGTCACGAAC TTGGTGCTGG CCTATCTGGC AAACCGGGAC
GTGGAAGCCG AAAGCGGCCC GTGGCCCAAG GCGGCCGAGG CGGAGATCGC CGAGATCATG
GCCGCCTGGC GGGAGACGAA GCGCGGCGGC ACCCGCGCCC TCCAGATGGT GGAGGAGCTG
GAGGGGGACG TGGCGAAGCT CCGGGGGGAG CGCAACGACT GGCTGCGCGC CCACTCCGGC
CCCCAGCTGA CCAACGTGGC CTCGTCCTGG CCTCGGCTGG AGGTGGAGCA GCGCCGTGAC
ATCATCGCCA CCGTTATCGA GGCCGTGGTG CTGTCCAAGG CCGATGGGCC GAAGAACCGT
TTTGACCCGG ATCGCATCAC CGTGATCTGG CGTCCCTGA
 
Protein sequence
MYEGALEDLK RGATKTGKPL DGLIVSDVDR LTRDPRHLED AIDVVVQYGR PVIDISGTLD 
LLTDNGRSVA RIVVALKNQQ SADTSRRVRT AHRELAKAGV PVGGYRPFGW EPDKRTIRKA
EADMIVVGAD EILAGVGTHT LCRRWNELGI LTTRGHQWQR QVMKNMYLSP RLAGYRVHGP
TTVPLEQRYA CTVDGQPVMG LQGHILDVEV WEKVVAKLRD PARTGNQNIH IGGRKYLLSG
IIRCAYCGAR LTGGWDKGWK KHHYSCRPVT AGGCGSVAVT GHHVDELVTN LVLAYLANRD
VEAESGPWPK AAEAEIAEIM AAWRETKRGG TRALQMVEEL EGDVAKLRGE RNDWLRAHSG
PQLTNVASSW PRLEVEQRRD IIATVIEAVV LSKADGPKNR FDPDRITVIW RP