Gene Francci3_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1684 
Symbol 
ID3903071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2020477 
End bp2022474 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content69% 
IMG OID637879022 
Productresolvase-like protein 
Protein accessionYP_480789 
Protein GI86740389 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.560816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTG CGACCTCGCG TCGCAAGAGC GCGAACCGCA CCCCCCAGCC CGCCGTCGAC 
CCGCTCGACA CCGTCCGGGT CGGGATCTAC CTGCGGCGCT CCACCGACGA CGAGCACCAG
CCTTACTCCA TCGAAGCCCA GGAAGAACGA CTCCGTTCCT ACATCGACTC CCAACCCGGC
TGGGCCATCG CCCTCCGGTT CTCCGACGAC GCCTCCGGCG CCACCACCGA ACGCGACGAC
CTGCAACGCG CCCTGTCCGC GGCCCGCCAC GGACTGATCG ACGTCCTGCT CGTCTACCGG
GTCGACCGAC TCTCCCGCAA CCTGCGCGAC ACCGTCACCC TGCTCGAAGA ACTTGACCAG
GCTGGCGTCG TGTTCCGCTC GGCCACCGAG CCGTTCGACA CCGCGACCCC GATGGGCCGC
ATGCTGCTCC AGATGCTCGC GATGTTCGCA CAGTTCGAGC GCGACACGAT CATCGACCGG
GTCATCGCCG GCATGGAACG CAAAGCCGCC AAGGGCCTAT GGATGGGCGG CAACCGGCCC
TTCGGCTACC AGGTCGACCG CGCCAACTGG AAACTGCTCG TCGACGAGAA GGAAGCGCCC
GTCGTCCGCC TGATCTTCAA CCTCTACGTC AAGGAACGGG TCGGCACCCG CGCCATCGCC
AAGACCCTCA ACGAGCGCGG CCACCGCACC ACCACCGGCG GACCCTGGTC CGGACACCAG
GTCCTCCGCG TCCTGGACAA CCGCATCTAC CTCGGCGAGC TGACCTTCCG CGAGATCACC
GTCACCGACA CCCACAAGCC GATCATCGAG GCGGCCCAGT TCGCTGAGGC CGAGAAGATC
CTCACCATCC GCAGCGACGG ACACACCCAC CGAGCCGCCA GCGACTCCGA CTACTACCTC
ACCGGCCGCA TGCGCTGCCC GCAATGCGCC AAAGCGATGC TCGGCTCCAA CGCCGGGGGA
CGCAACCGCA CCTACCGCTA CTACACCTGC TTCACCCGCC TGCGCTACAG CCGCGACCGC
TGCGACGCCC CCCGCCTCGA CGCCGACGCC CTCGACCAAG CCGTCCTCAC CGCACTCGCC
GCCTTCTACC GCGACCACCA GCAGCTCATC TCCGACGCCG TCCACCACGC CCGCCAGCGC
CACCACGACG CCCACGCCGA CCGCACCGGC GAACTCGCCA CCGTCCAGGC CGACCTCACC
CAAACCGACC AGGCCATCGA CCGCTACCTA TCCGCGTTCG AGCGCGGCAC CCTCGACGAA
GAAACCCTCG CCACCCGACT CGCCACGCTG CGCACCAAGC AGAAGCAGCT CCGCCGCCGA
CAGACCGAAC TCACCGCCCA GATCGACGAC GAACCCGTCA TGCCCCCACG CGCCACCCTC
AGCAAGATCG CAGGCCACAT CGATACGATC ATCGAGGTCG GCACCGACCT TCAGCGCAAA
GCCCTCGTCG AAGCCCTCAT CCACGAAGTC AAGATCCTCG GCCCCGGACG GCTCCAACCG
GTCTTCAAAG TCCCCAGACC CGAGCCGAGC GAGACCGCCG CAGCCGCCCT ACCAGCCACA
ACGCCCCCGA AGGGAGCGGT TCGTACAATG CCAAACATGG TGGAGCGGGT GGGACTCGAA
CCCACGACCG ACGGAAACAC CGTCGCAACC ACGACGTCCA TACCGTCGGC ACGGTGCCGC
ACGCGTCGTC GAGGTGTCAC CCTGTTCCCG TGGACACACC GTCGATCCAT CAGCCCCGCC
AGCGCCGGTT CACCGACAGC GACGATCCCG GGCCGAACGA AGATCCCGAG CAGCTCCGGC
TCTTCCACCG CTGGCGTGCG GCCCGGTCAC CGTCGTCGGT GGTTCATCTC GACTCGGCCG
GCGCGGCCCG CCCCAGTCGG GCCACGATCG CGGCCGAGGC CGCGTACCTG GAGCGCGAGA
GCGCACTCGG CGCGTACGCC GCCGAGGTGG AGGCAGGCAG CGGGCTCCAC GCCACGCGGG
CCAGGCTCGT CAGCCTGA
 
Protein sequence
MARATSRRKS ANRTPQPAVD PLDTVRVGIY LRRSTDDEHQ PYSIEAQEER LRSYIDSQPG 
WAIALRFSDD ASGATTERDD LQRALSAARH GLIDVLLVYR VDRLSRNLRD TVTLLEELDQ
AGVVFRSATE PFDTATPMGR MLLQMLAMFA QFERDTIIDR VIAGMERKAA KGLWMGGNRP
FGYQVDRANW KLLVDEKEAP VVRLIFNLYV KERVGTRAIA KTLNERGHRT TTGGPWSGHQ
VLRVLDNRIY LGELTFREIT VTDTHKPIIE AAQFAEAEKI LTIRSDGHTH RAASDSDYYL
TGRMRCPQCA KAMLGSNAGG RNRTYRYYTC FTRLRYSRDR CDAPRLDADA LDQAVLTALA
AFYRDHQQLI SDAVHHARQR HHDAHADRTG ELATVQADLT QTDQAIDRYL SAFERGTLDE
ETLATRLATL RTKQKQLRRR QTELTAQIDD EPVMPPRATL SKIAGHIDTI IEVGTDLQRK
ALVEALIHEV KILGPGRLQP VFKVPRPEPS ETAAAALPAT TPPKGAVRTM PNMVERVGLE
PTTDGNTVAT TTSIPSARCR TRRRGVTLFP WTHRRSISPA SAGSPTATIP GRTKIPSSSG
SSTAGVRPGH RRRWFISTRP ARPAPVGPRS RPRPRTWSAR AHSARTPPRW RQAAGSTPRG
PGSSA