Gene Francci3_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0409 
Symbol 
ID3903232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp481648 
End bp485805 
Gene Length4158 bp 
Protein Length1385 aa 
Translation table11 
GC content72% 
IMG OID637877739 
Productpeptidoglycan-binding LysM 
Protein accessionYP_479525 
Protein GI86739125 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.617719 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGC CGCTCCCGCC ACCATCGACA CCACTTCGAT CCGCGCGGCC GTCCTCGCCG 
CCGGCATCGA TGACGTCCGC CTCTCCGGCG GCGGCACCGC GGTCACGACG GCGGCTTCCG
CGTGCCGGCG ACCCTCCCTA TCGGATGGTC GATGACAGGG CCGGGAGTGT GCCGGCCCGC
CAGCCGGTGC CGGTGGTCTC CGCGGAGCTG CGGTCTCGGG AACGGCTGCG CGCAGCCGGG
GCCATCGCTG CGATCGCCGG TGTCGTCCTC GGCGTTCCGG TCGTACTGAT CCTGGTACTG
GGTCCACCGC TCCAGAGCAG GGACGGAACC GGCGCGGTCG TCCGAGCCAT CCTGGCCCTG
CTCTTCTGGG CCGCCTGGCT GCACTTCACC GCCTGTCTCA TCGCGGAATG CCGAGCCGAG
CTGCGCGGTA CCGGCCTCGC CCCGCGGATA CCGCTGGGCG GCGCCCCCCA GAACCTCGCC
CGCCGTCTCG TCGCCACATC CGTGTTGTTC TCCGGGACGA CGGTTCTGGT CGCCCCGATG
ACATACGCCC GGGCCCCCGC CGTTCCGCCT GCGCCCGGCC GGATGGGGAC GGCAATGCCG
GTATCGACGA CCTCGGCGGC CCTGGTCGCT CCGCGGCCGG CTCCGGACGA CAGGGCGCCC
GCGCTCACGA GGGGTGATCG GGCTCCGGTG ACCACGACAA CGGATCCCGG CGCCGGCGCA
CCGACCCAGC AGACCTGGCC GTCACGGCCG GAACCGGGGA GCAGCCCGGC TGGCCCGGCT
GGCCGCATGA CACCGCGTGT CCGCCACGCC GAGTTGATCA GTCCGATCGC GGATCCGAAC
GGCCGGCTCA TGCCCAACGG GCGCATGATG AAGCTCTACG AGGTGCGACC TGCCGAGGGA
CGCCACCACG ACACCCTCTG GGGCATCGCG GAGCGCTTCC TCGGTGACGG TCTACGGTAT
CGGGAGATCT TCGACCTCAA CGAGGGTCGT GCCCAGCCGG ACGGCCGCAC GCTGTCCAAG
CCATCGCTCA TCCATGCCGG CTGGGTGCTG CTGCTGCCCG CCGACGCGCA GGGCGAGGGC
CTGCAGACAC TGCATGTACC GGACTCCGTC CCGGGATGGC CTGCCGCCGA AACGGGTACC
GCGCCCCCCA CCCCCGGCGG CTACGACCTC ACTGACCTCG ACAACCACGG CCCCGGCGAC
CCGGGCCTCG ATGATCCAGA ACCCAGCTAT CCCGATTTCG AAGACCCCGG GCCGGGGGAC
AGCGACGGTC GTCTCGGCGC CTCTCGTGCG CTGGCAGTCC TGGATGACAC GGCCGCGGTC
GTCTCGGCGC GGAACACCGC GGTGGTCACC GGTGGCCCCA CTACACCCGG CCTGCTCACG
TCCGACTCCT TCGGACCGTC CTCCCAGGGT CTGCTGGGGA TAGCAGCCGC CAGCCTGCTC
GCCGCCGGAG TCATGGTGGC GTTGTCCACA CGCCGGGCGG GGCCGGCGAG CCCCCCGGAG
GACGAGACAG AGCGCGCGCT GCTCCTGGCC GCCGACACGG ATGCCGCCCG ATTTGTCGAC
CGCTCACTGC GCGTTCTGTC TGCCGGCCTG ACCGATCTGG GACGTCTGCT TCCCCCGGTG
TACGCGGCGA TCCTCACCGA CGACCTGCTG ATCCTGCACA TGGCACCCGC GGAGGAGGAG
CCTCCACCGC CCCCATGGAC CGTTGGCGAG GCGCCCGGGT CCTGGCGGAT CGAGCGGATG
CCCGGTCTGC CCGGTGACCA GCCGGCCCAG ATGATCGCCC AGATGACGGC CGGCGTGCCG
GCTCCGTTCC CGGCCCTCGT CACCTTCGGC CGGGACGACG CCGGGTCACG CATTCTCGTC
GACCTGGAAG GCGCTCCGGG CATCATCAGC CTGATCGGCG ACCTGGACGT CGCGACGGAG
GTCGCGATCG CCGTGGCGAT CGAGCTGGGA ACCAACGTGT GGTCGGATGA TCTGCGGGTC
TGTCTGGTGG GTTTCCCGGC CGACACCGGA GCCGACCTGA CGACGATCGC ACCCGCTCGG
GTCTGGACAG CGGACGACCT GAGCGCCGTC CTCGACGAAC TCGCCGGACC ACAGGACGAC
GCCGGTCGCA CCGGCGTCGA TCCGGCACCG GCAGACCCGC CGACCGTCAC CGCGGGCGGC
CGCGATCGGG TCACCGGCAA TCCACGGGCG CTGGCACCCG ATCTACTCAT TCTCGCGGCC
CCTCCCGCAG CCGCCGACGT CGCCCGGTTG ACCAGGATGG CGAACGGGCG GGACGACGCC
GTGGGAGTCC TCACCGTCGG CGACACCCCG GCCGCCCGCT GGCGCTTCAC GGTCGAGTCG
GACCGCCGCG TGTCCCTCGG CGTTCTCGGC GTCCAGGTAC GGGCGCAGGC TCTGAGCATG
ACCGAGTACA CCTCGATCAC CGCGCTGTTC CATCGGGCCG ATGCGGCGGG ATCTCCCGCC
CGGGCCTCGG GCAGCCTGCC GGACGAGAGT CTGCCGGACG AGGGCCCCCC ACCACCCCCC
GTATCACAGC CGCCCGCCGG ACCGATCGTC GCCGGACCCA CCGCGACCGG GCCGGCAACT
ACCCGGCCCG ACGAGCCGAC AAGGCCACCC TCACAACGCG GCTCCATGCC CGGGGCGCTG
CCCTGGACGC CGAAACTCTC AGCCCTCACC GGGCCTGCCT TCCCCCAGCC GCTCTCGACC
GTCCCCACTC CGGCAGGCCC ACTCCCCGAC CCCGGCTCTG CCACACCGGA GCCACGCGGA
GCGGAACCGC AGGACATGCA CGGTCGCGCC GCGGTCCTGC CCGGCACCGG ATTCCCACCC
ACCATTTCCC CATCGGACGA CGATCCGACG CGACCGCCCT CGAACGGCCG CCGGCTCCCC
ACCGAACCCC CGCCGCCCAC GCCCAAGCCT CCCAGCCCTC CGCGAAGGAC CGCATCGGCC
GGTCCCGTCC TACCGACCAC CACGGACACC GGCGACGCCC CCGAAGCCGC CGGTCTGCTG
GCCCATCCGG GGCGACCGCT CATCCACCCG ACGGATATCC CACGGGTACC ACCCGTGGTC
TACTTCGAGA CGCCCCGGAT GCGAGCGAAC CGGCAGGGCC GCGTCCCGGC CGAATCGATC
CAGCCCGGGA CGGAGGCACC ACCTGGACCC CCTGCCCCCC CGCCGCCAAC CACCGGGTAC
CGGACCGCGA TCCCACCCCG GCCGCCGATC CCGCCTCCGA ACGAACGGCT GGCGGCGGAC
GCGCACCGTG CACCGCCTCA GACTGCTCGG GCGCGCTACG CCGACGGAGA TACCACCGTC
CCGGATGTCG GCGTTTCACG GCCCCCGACG GGGACGCCCG GGGCGGCGCC ACAGCGTGTG
CCGCTCGAAA CCACGCCGCC CGCCGATCCC ACCGCGCCCG CCGAGGCGCA GGTCCGGATT
CTCGGACTGC CCACCGTCGA GGGTCCCGGG CGACCACCCG CCGAGCGCAT CGACCTGTTG
ACCGAGGTCA TCGTCTACCT CGCGCTGCAC CGGGAAGGCG TGGACAGTCG GCTCCTCGCC
GAGGAGATCT GGCCATCCGA CACCACGGAG GAGATCGCCG ACGCCGCGTT GACCGAAGCG
CGCCAGTGGC TGGGGACCGA TGCCAGCGGC TTGCCTCGGA TCATGATCGG CCCGGATGGC
CGGTGGTGGC TGAGTCCGGA TGTGCGCTGC GACTGGGAAC TCTTCGTGGC CTACACCCAC
CGGGCCGGTC TGCCGGGCAG CGACGCCGAG GCGGATCTGA CGACAGCCCT GCGGATGGTC
AGCGGTCCCC TGTGGACGGA TCTGCCTGCC GGGCGGTACC GCTGGGCGAC CACCGGGCCA
ATCGCCCGAA GCACCCGGGC GGGCGTGGTC GACGTGGCGC ATCGCCTCGC CCAGCTGACG
TTGGACTTCG GTGACACGAT GACGGCGATG GCGGCCTGCC GCACCGGGCT ACGGGCCGTG
CCGACCTCCG AAGTGCTCTG GCGGGACCTG CTGCGCACGG TCGCCGCACG GGGTGATCGC
CGGACTCTGC AGGCTGTGGC AACGGAGATG TACCGAACTA TCGGTGTCGG CGGGGTGCGT
CGCGGCGGGC GGGCCGAGGC CGAGACGGAC GCACTGGTGC AGACCCTGCT GCCCGGATTC
CGTCGCGGCC GGCGGTAG
 
Protein sequence
MSTPLPPPST PLRSARPSSP PASMTSASPA AAPRSRRRLP RAGDPPYRMV DDRAGSVPAR 
QPVPVVSAEL RSRERLRAAG AIAAIAGVVL GVPVVLILVL GPPLQSRDGT GAVVRAILAL
LFWAAWLHFT ACLIAECRAE LRGTGLAPRI PLGGAPQNLA RRLVATSVLF SGTTVLVAPM
TYARAPAVPP APGRMGTAMP VSTTSAALVA PRPAPDDRAP ALTRGDRAPV TTTTDPGAGA
PTQQTWPSRP EPGSSPAGPA GRMTPRVRHA ELISPIADPN GRLMPNGRMM KLYEVRPAEG
RHHDTLWGIA ERFLGDGLRY REIFDLNEGR AQPDGRTLSK PSLIHAGWVL LLPADAQGEG
LQTLHVPDSV PGWPAAETGT APPTPGGYDL TDLDNHGPGD PGLDDPEPSY PDFEDPGPGD
SDGRLGASRA LAVLDDTAAV VSARNTAVVT GGPTTPGLLT SDSFGPSSQG LLGIAAASLL
AAGVMVALST RRAGPASPPE DETERALLLA ADTDAARFVD RSLRVLSAGL TDLGRLLPPV
YAAILTDDLL ILHMAPAEEE PPPPPWTVGE APGSWRIERM PGLPGDQPAQ MIAQMTAGVP
APFPALVTFG RDDAGSRILV DLEGAPGIIS LIGDLDVATE VAIAVAIELG TNVWSDDLRV
CLVGFPADTG ADLTTIAPAR VWTADDLSAV LDELAGPQDD AGRTGVDPAP ADPPTVTAGG
RDRVTGNPRA LAPDLLILAA PPAAADVARL TRMANGRDDA VGVLTVGDTP AARWRFTVES
DRRVSLGVLG VQVRAQALSM TEYTSITALF HRADAAGSPA RASGSLPDES LPDEGPPPPP
VSQPPAGPIV AGPTATGPAT TRPDEPTRPP SQRGSMPGAL PWTPKLSALT GPAFPQPLST
VPTPAGPLPD PGSATPEPRG AEPQDMHGRA AVLPGTGFPP TISPSDDDPT RPPSNGRRLP
TEPPPPTPKP PSPPRRTASA GPVLPTTTDT GDAPEAAGLL AHPGRPLIHP TDIPRVPPVV
YFETPRMRAN RQGRVPAESI QPGTEAPPGP PAPPPPTTGY RTAIPPRPPI PPPNERLAAD
AHRAPPQTAR ARYADGDTTV PDVGVSRPPT GTPGAAPQRV PLETTPPADP TAPAEAQVRI
LGLPTVEGPG RPPAERIDLL TEVIVYLALH REGVDSRLLA EEIWPSDTTE EIADAALTEA
RQWLGTDASG LPRIMIGPDG RWWLSPDVRC DWELFVAYTH RAGLPGSDAE ADLTTALRMV
SGPLWTDLPA GRYRWATTGP IARSTRAGVV DVAHRLAQLT LDFGDTMTAM AACRTGLRAV
PTSEVLWRDL LRTVAARGDR RTLQAVATEM YRTIGVGGVR RGGRAEAETD ALVQTLLPGF
RRGRR