Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0409 |
Symbol | |
ID | 3903232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 481648 |
End bp | 485805 |
Gene Length | 4158 bp |
Protein Length | 1385 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637877739 |
Product | peptidoglycan-binding LysM |
Protein accession | YP_479525 |
Protein GI | 86739125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.617719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGC CGCTCCCGCC ACCATCGACA CCACTTCGAT CCGCGCGGCC GTCCTCGCCG CCGGCATCGA TGACGTCCGC CTCTCCGGCG GCGGCACCGC GGTCACGACG GCGGCTTCCG CGTGCCGGCG ACCCTCCCTA TCGGATGGTC GATGACAGGG CCGGGAGTGT GCCGGCCCGC CAGCCGGTGC CGGTGGTCTC CGCGGAGCTG CGGTCTCGGG AACGGCTGCG CGCAGCCGGG GCCATCGCTG CGATCGCCGG TGTCGTCCTC GGCGTTCCGG TCGTACTGAT CCTGGTACTG GGTCCACCGC TCCAGAGCAG GGACGGAACC GGCGCGGTCG TCCGAGCCAT CCTGGCCCTG CTCTTCTGGG CCGCCTGGCT GCACTTCACC GCCTGTCTCA TCGCGGAATG CCGAGCCGAG CTGCGCGGTA CCGGCCTCGC CCCGCGGATA CCGCTGGGCG GCGCCCCCCA GAACCTCGCC CGCCGTCTCG TCGCCACATC CGTGTTGTTC TCCGGGACGA CGGTTCTGGT CGCCCCGATG ACATACGCCC GGGCCCCCGC CGTTCCGCCT GCGCCCGGCC GGATGGGGAC GGCAATGCCG GTATCGACGA CCTCGGCGGC CCTGGTCGCT CCGCGGCCGG CTCCGGACGA CAGGGCGCCC GCGCTCACGA GGGGTGATCG GGCTCCGGTG ACCACGACAA CGGATCCCGG CGCCGGCGCA CCGACCCAGC AGACCTGGCC GTCACGGCCG GAACCGGGGA GCAGCCCGGC TGGCCCGGCT GGCCGCATGA CACCGCGTGT CCGCCACGCC GAGTTGATCA GTCCGATCGC GGATCCGAAC GGCCGGCTCA TGCCCAACGG GCGCATGATG AAGCTCTACG AGGTGCGACC TGCCGAGGGA CGCCACCACG ACACCCTCTG GGGCATCGCG GAGCGCTTCC TCGGTGACGG TCTACGGTAT CGGGAGATCT TCGACCTCAA CGAGGGTCGT GCCCAGCCGG ACGGCCGCAC GCTGTCCAAG CCATCGCTCA TCCATGCCGG CTGGGTGCTG CTGCTGCCCG CCGACGCGCA GGGCGAGGGC CTGCAGACAC TGCATGTACC GGACTCCGTC CCGGGATGGC CTGCCGCCGA AACGGGTACC GCGCCCCCCA CCCCCGGCGG CTACGACCTC ACTGACCTCG ACAACCACGG CCCCGGCGAC CCGGGCCTCG ATGATCCAGA ACCCAGCTAT CCCGATTTCG AAGACCCCGG GCCGGGGGAC AGCGACGGTC GTCTCGGCGC CTCTCGTGCG CTGGCAGTCC TGGATGACAC GGCCGCGGTC GTCTCGGCGC GGAACACCGC GGTGGTCACC GGTGGCCCCA CTACACCCGG CCTGCTCACG TCCGACTCCT TCGGACCGTC CTCCCAGGGT CTGCTGGGGA TAGCAGCCGC CAGCCTGCTC GCCGCCGGAG TCATGGTGGC GTTGTCCACA CGCCGGGCGG GGCCGGCGAG CCCCCCGGAG GACGAGACAG AGCGCGCGCT GCTCCTGGCC GCCGACACGG ATGCCGCCCG ATTTGTCGAC CGCTCACTGC GCGTTCTGTC TGCCGGCCTG ACCGATCTGG GACGTCTGCT TCCCCCGGTG TACGCGGCGA TCCTCACCGA CGACCTGCTG ATCCTGCACA TGGCACCCGC GGAGGAGGAG CCTCCACCGC CCCCATGGAC CGTTGGCGAG GCGCCCGGGT CCTGGCGGAT CGAGCGGATG CCCGGTCTGC CCGGTGACCA GCCGGCCCAG ATGATCGCCC AGATGACGGC CGGCGTGCCG GCTCCGTTCC CGGCCCTCGT CACCTTCGGC CGGGACGACG CCGGGTCACG CATTCTCGTC GACCTGGAAG GCGCTCCGGG CATCATCAGC CTGATCGGCG ACCTGGACGT CGCGACGGAG GTCGCGATCG CCGTGGCGAT CGAGCTGGGA ACCAACGTGT GGTCGGATGA TCTGCGGGTC TGTCTGGTGG GTTTCCCGGC CGACACCGGA GCCGACCTGA CGACGATCGC ACCCGCTCGG GTCTGGACAG CGGACGACCT GAGCGCCGTC CTCGACGAAC TCGCCGGACC ACAGGACGAC GCCGGTCGCA CCGGCGTCGA TCCGGCACCG GCAGACCCGC CGACCGTCAC CGCGGGCGGC CGCGATCGGG TCACCGGCAA TCCACGGGCG CTGGCACCCG ATCTACTCAT TCTCGCGGCC CCTCCCGCAG CCGCCGACGT CGCCCGGTTG ACCAGGATGG CGAACGGGCG GGACGACGCC GTGGGAGTCC TCACCGTCGG CGACACCCCG GCCGCCCGCT GGCGCTTCAC GGTCGAGTCG GACCGCCGCG TGTCCCTCGG CGTTCTCGGC GTCCAGGTAC GGGCGCAGGC TCTGAGCATG ACCGAGTACA CCTCGATCAC CGCGCTGTTC CATCGGGCCG ATGCGGCGGG ATCTCCCGCC CGGGCCTCGG GCAGCCTGCC GGACGAGAGT CTGCCGGACG AGGGCCCCCC ACCACCCCCC GTATCACAGC CGCCCGCCGG ACCGATCGTC GCCGGACCCA CCGCGACCGG GCCGGCAACT ACCCGGCCCG ACGAGCCGAC AAGGCCACCC TCACAACGCG GCTCCATGCC CGGGGCGCTG CCCTGGACGC CGAAACTCTC AGCCCTCACC GGGCCTGCCT TCCCCCAGCC GCTCTCGACC GTCCCCACTC CGGCAGGCCC ACTCCCCGAC CCCGGCTCTG CCACACCGGA GCCACGCGGA GCGGAACCGC AGGACATGCA CGGTCGCGCC GCGGTCCTGC CCGGCACCGG ATTCCCACCC ACCATTTCCC CATCGGACGA CGATCCGACG CGACCGCCCT CGAACGGCCG CCGGCTCCCC ACCGAACCCC CGCCGCCCAC GCCCAAGCCT CCCAGCCCTC CGCGAAGGAC CGCATCGGCC GGTCCCGTCC TACCGACCAC CACGGACACC GGCGACGCCC CCGAAGCCGC CGGTCTGCTG GCCCATCCGG GGCGACCGCT CATCCACCCG ACGGATATCC CACGGGTACC ACCCGTGGTC TACTTCGAGA CGCCCCGGAT GCGAGCGAAC CGGCAGGGCC GCGTCCCGGC CGAATCGATC CAGCCCGGGA CGGAGGCACC ACCTGGACCC CCTGCCCCCC CGCCGCCAAC CACCGGGTAC CGGACCGCGA TCCCACCCCG GCCGCCGATC CCGCCTCCGA ACGAACGGCT GGCGGCGGAC GCGCACCGTG CACCGCCTCA GACTGCTCGG GCGCGCTACG CCGACGGAGA TACCACCGTC CCGGATGTCG GCGTTTCACG GCCCCCGACG GGGACGCCCG GGGCGGCGCC ACAGCGTGTG CCGCTCGAAA CCACGCCGCC CGCCGATCCC ACCGCGCCCG CCGAGGCGCA GGTCCGGATT CTCGGACTGC CCACCGTCGA GGGTCCCGGG CGACCACCCG CCGAGCGCAT CGACCTGTTG ACCGAGGTCA TCGTCTACCT CGCGCTGCAC CGGGAAGGCG TGGACAGTCG GCTCCTCGCC GAGGAGATCT GGCCATCCGA CACCACGGAG GAGATCGCCG ACGCCGCGTT GACCGAAGCG CGCCAGTGGC TGGGGACCGA TGCCAGCGGC TTGCCTCGGA TCATGATCGG CCCGGATGGC CGGTGGTGGC TGAGTCCGGA TGTGCGCTGC GACTGGGAAC TCTTCGTGGC CTACACCCAC CGGGCCGGTC TGCCGGGCAG CGACGCCGAG GCGGATCTGA CGACAGCCCT GCGGATGGTC AGCGGTCCCC TGTGGACGGA TCTGCCTGCC GGGCGGTACC GCTGGGCGAC CACCGGGCCA ATCGCCCGAA GCACCCGGGC GGGCGTGGTC GACGTGGCGC ATCGCCTCGC CCAGCTGACG TTGGACTTCG GTGACACGAT GACGGCGATG GCGGCCTGCC GCACCGGGCT ACGGGCCGTG CCGACCTCCG AAGTGCTCTG GCGGGACCTG CTGCGCACGG TCGCCGCACG GGGTGATCGC CGGACTCTGC AGGCTGTGGC AACGGAGATG TACCGAACTA TCGGTGTCGG CGGGGTGCGT CGCGGCGGGC GGGCCGAGGC CGAGACGGAC GCACTGGTGC AGACCCTGCT GCCCGGATTC CGTCGCGGCC GGCGGTAG
|
Protein sequence | MSTPLPPPST PLRSARPSSP PASMTSASPA AAPRSRRRLP RAGDPPYRMV DDRAGSVPAR QPVPVVSAEL RSRERLRAAG AIAAIAGVVL GVPVVLILVL GPPLQSRDGT GAVVRAILAL LFWAAWLHFT ACLIAECRAE LRGTGLAPRI PLGGAPQNLA RRLVATSVLF SGTTVLVAPM TYARAPAVPP APGRMGTAMP VSTTSAALVA PRPAPDDRAP ALTRGDRAPV TTTTDPGAGA PTQQTWPSRP EPGSSPAGPA GRMTPRVRHA ELISPIADPN GRLMPNGRMM KLYEVRPAEG RHHDTLWGIA ERFLGDGLRY REIFDLNEGR AQPDGRTLSK PSLIHAGWVL LLPADAQGEG LQTLHVPDSV PGWPAAETGT APPTPGGYDL TDLDNHGPGD PGLDDPEPSY PDFEDPGPGD SDGRLGASRA LAVLDDTAAV VSARNTAVVT GGPTTPGLLT SDSFGPSSQG LLGIAAASLL AAGVMVALST RRAGPASPPE DETERALLLA ADTDAARFVD RSLRVLSAGL TDLGRLLPPV YAAILTDDLL ILHMAPAEEE PPPPPWTVGE APGSWRIERM PGLPGDQPAQ MIAQMTAGVP APFPALVTFG RDDAGSRILV DLEGAPGIIS LIGDLDVATE VAIAVAIELG TNVWSDDLRV CLVGFPADTG ADLTTIAPAR VWTADDLSAV LDELAGPQDD AGRTGVDPAP ADPPTVTAGG RDRVTGNPRA LAPDLLILAA PPAAADVARL TRMANGRDDA VGVLTVGDTP AARWRFTVES DRRVSLGVLG VQVRAQALSM TEYTSITALF HRADAAGSPA RASGSLPDES LPDEGPPPPP VSQPPAGPIV AGPTATGPAT TRPDEPTRPP SQRGSMPGAL PWTPKLSALT GPAFPQPLST VPTPAGPLPD PGSATPEPRG AEPQDMHGRA AVLPGTGFPP TISPSDDDPT RPPSNGRRLP TEPPPPTPKP PSPPRRTASA GPVLPTTTDT GDAPEAAGLL AHPGRPLIHP TDIPRVPPVV YFETPRMRAN RQGRVPAESI QPGTEAPPGP PAPPPPTTGY RTAIPPRPPI PPPNERLAAD AHRAPPQTAR ARYADGDTTV PDVGVSRPPT GTPGAAPQRV PLETTPPADP TAPAEAQVRI LGLPTVEGPG RPPAERIDLL TEVIVYLALH REGVDSRLLA EEIWPSDTTE EIADAALTEA RQWLGTDASG LPRIMIGPDG RWWLSPDVRC DWELFVAYTH RAGLPGSDAE ADLTTALRMV SGPLWTDLPA GRYRWATTGP IARSTRAGVV DVAHRLAQLT LDFGDTMTAM AACRTGLRAV PTSEVLWRDL LRTVAARGDR RTLQAVATEM YRTIGVGGVR RGGRAEAETD ALVQTLLPGF RRGRR
|
| |