Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1350 |
Symbol | |
ID | 5669759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1622148 |
End bp | 1628288 |
Gene Length | 6141 bp |
Protein Length | 2046 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641240277 |
Product | YD repeat-containing protein |
Protein accession | YP_001505704 |
Protein GI | 158313196 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGCAT CCCCCGTACG TCGTCGTAGG TCATTCATCC GTTCTCTCTG CGTTCTGTTG GCCGCGTTGC TGGCGGCGAC CGTCACCCAG TTCCTGCCAC CCGTGAGCGG TGCCGCGTCC GCCGCACCCA CGGACGCCGG CACGGAGGTA CCACCTCCGA CCGGCCCAGC CGAACGTCCT GACCTGATCA CGGCTCAGCT CACCGCGCGG GCGGAGCAAC GCCGGATCGA GGTGACGTCC CTGACCACGG AGTCGACGCG GACCCTGGTC AACCCCAACG GCTCCATCAC GGTGGAATCC ACCTCCGGCA TCGCCCGCGT CCACCGTGAC GGACGCTGGC TACCCGTCGA CACCACCCTC GTCCTCACCG ACGGCAAGAT CACCCCCAAG GTGGCCAAGG CCGCCATCGC CCTGTCCGCA GGCGGCAAAG ACGCCGGCGA CATCGCGACG CTGCGCGACG GCGACCGGGA GATCGCCTTC GGCTGGCCCA CGGCGCTGCC CACGCCCGAG CTGAAGGACA ACGTCGCCAC CTACAAGGCG GTCGCGCAGG ACACCGACCT TCGCGTGAAG GTGACCCCGA CCGGCTACGA CGTCCAGATC GTCGCGCACA CCCCGGCCGC GGCCCAGGCG GCACTGAGCC TGCCGATGCG GTTGAAAGGC GTCACCGCCG AACGCACCGC CGGCGGCGAG CTGCGCCTGT CTGCCGCCGG GAAGATCGCC GCCCGGTCCC CGACCCCGCT GATGTGGGAC GCGCACGTCG ACGCCAAGAC TGGGCTCCCG GACCGGACCC ACACCGTCGA GGCGGCCCTC GAGCGGGCGG GCACCGCCAC CCCTGCCCTC GCCTTGCGGC CAGACAAGGC CTGGCTGACC GCCGATGACC GCCAGTACCC GGTCACGATC GACCCGGCCG CCGTCCTACC GGACAACCTC GACACCGACG TGGTCAACAC GTCAGCGACC ACGAACTACG ACACCTACGA GTACCTGCGG GTCGGGAACG TCTTCGGGTC CGTGCACCGG TCGTTCCTGC GGTTCGACAC CTCCGCCATC GAGGGCAAGC ACGTCACCGC AGCAGCACTG AAACTCACTC AGGCCGGCTC CTACACCTGC ACGCCGCAGC GGATGGTCGT GCAGGGCTCG GCCGGCCTGG CGTCGGGGGC GACGTGGAAC ACCCAGCCCA CCGCGGACGG GGTGAACTGG TCGGACACCA CCTTCAACGC CGGCGGGTTC TGCGGCTCCG GCGATGTCTC CCTCGACATC ACCGGACTGG CACAGGCATG GTCATCCAAC GGCCAGCCCT CACCGGAAAC CCTCACCCTG CGGGCGCCCG ACGAGGTGCT GTGGGAGCCC TACAAGTACT TCACCTCCGG CGACACGGCC ACCCCGCCGC GGATCGAGAC AACCTACAAC TCCTACCCGG CCACCGTCGG CGCCCGCACC ACCTCCCCCT GCTCAGCCCA GTGCGGGGGC ACGCCGCCGA CGGTGCTGAC AAACTCGACG ACGCCACGAC TGGCCGGATC GGCGACCGAC GCGGACGGTG GCACCCTTCG CCTCGACTTC GAGGTATGGA ACTCCGCCGG CACCACAAAG ATCACCCAGG GGTCCAGCGG ATTCGTCGCG CAGAACAGTG AGGCGGCTTG GACCGTCCCC TCCGGGCTGC TCGCCAACGG CACCAGCTAC CAGTGGCGGG TCCGCGCCTA CGACGGCACC GACTACTCCC AGAACTGGTC CACCTGGATC CCGTTCACCA TCGACACCAC CGCCCCCGCC ACCCCCACCG CCCTCGCCTC CACCGCATGG CCATCGGGTG GCTGGGCCAG CGCCACCTCG GGCACATTCA CCTGGACCTC ACCCGGCGGT GACACCGCCG GATTTCTCTA TGGCCTTGAC GAGCCATCCC CAACGACCTC ACCGACCCCA CCGACCGGCA CGACCAGTGC CTCGCTGACC GCCGACGAGG GACTGCACAC CTTCTACCTG CGCACCATCG ACACCGCCGG AAATCTCTCC CCCGTCATCT CCTATAACTT CGGCGTCGGC AACGCCGCGC TCGCCTCACC AGCCGAGCAG TCCCGCACCC AGCGATTCGC CACCCTGCGC GGCGAGGCAC CCTCATCGCA GGTATCGGTG ACCTACCGCT ACCGGGTCGG GACCAACCCG TCTACCGCGT GGACGGACGT CCCCACCGCC GAAACCACCA CCCAGGGCAC CACCAACCAC CCCACCTGGC CCGCCCCCCG CAACGGCTCA GGCGCCTTCG ACAACCAGAT CTGGAACATC CCCGGCACCC TCGGTGGCGG CTCCGACGGC CCCATCCAGA TCCAGGCCTG CTTCCGCACC GCAGCCGCAG TCGTCACCTG CACGGATGCG ACCACCATCC AGCTCACCCG CAATGCCTTC AGCGACACCG ACGCGGTCGG CGAGCTCGGC CCGGGATCAC TAGCCCTACT CACGGGAGAT TATTCCCTGT CGGCGACCGA CGTGTCGATG CCGACCTACA CCGGCACTCT CGCCGTCGGC CGCACGCTGA CCACACTGAC CCCGCCCGCA GCCACCACCG CCCCTAACGG GATCTTTGGC CCCGGGTGGA CATCGAGCGT TCCCGGTCCC GAAGCCGGAT CGGGAGACCG CACCCTGACC GATAGCACGG CCACCGAGGG GTACGTTACT CTCACCTCAC CCGAAGGCGC CCCCGCCGTC TACACCCGCG CTGGCACAGG TAGCTACCCT TACGAGTACA CCGGTGTTGC CGACACCGCC GCCGACGGAT CCAAGCTCAT CAAAGACTCG GCGACCAAGT TCACCCTCAC CGAGATCGAC GGCGCGAAGA CGATTTGGTA TTCCAAGACC ATCGGCAGCG CCACCATCTG GGTCGTCGAC CGCGTCGAGG AACCCGGCTC CAACACCACC TCTACCTTCA CCACCGACAG CCTCGGCCGC ATCACCCGCA TCCTCGCTCC CGTCCCCAGC GGCGTCGACT GCTCCGGCAC ACTCGGCGCC GGCTGCCGTG CTCTCACTCT TACTTACGCC ACCAGCACCA CCGCGACGGG ATCCGGTGGT AACCCTGCCG AATGGGGTAA CTACACCGGA CAGCTATCCT CGATCTCCAT GGCTCTGAAC GGTGCCACCT CCATCGAGGT CGCCCGCTAC AGCTACGACA CCACTGGCCA TCTACGCGCC GAATGGGATC CCCGCCTCGA CACCGCCGGC GGCAACCACC TCGCTACCCT CTACTGGTAC AACGCCGAAG GCCGCGTCCA GGGCTTCATC CCCACTGGCC AAGAGGCATG GGGCTTCGGC TACGACGGCT TCGGACGGCT CACCAATATC AGCCGACCTC GCCCGGCAGG TGCTGGCACC GCCACCAACA CGATCGTCTA CGGCGTACTG CTCTCCGGCG CCTCAGCGCC GGTCGATGTG TCCCCTGACC GGGTCGATGA CTGGGCCCAA CAGGACCTGC CCGTCTGGGC AACCGCCGTC TTCCCAGCCA GTCACGTCCC CGCCAGCCCA CCCACCGCCG CCGACTGGCC CTACGCCGCG ATCAACTACC TGAACGCTGA CGGCCGGCAG GTCAACACCG CCTCCTACGG GGCAGGTGCC TGGCAGGTTA CTACCACGGA ATACGACACG TTCGGCAACG TGACCCGTGA GTTGACTGCA GAGAACCGCA ATCAGGCGCT GACCCCTACC ACAGACACCG ACGCCACCGT CGCCGCCCTC ACCGACTCCG CCGCCCGCGC CCAACTCCTC GATACCAAAA TCACCTACAG CGTCGACGGG GTCGTACCCC TCGACGTATA TGGACCAAAA CATCGATACA TCAACAATGA CGGCGTCCGG TCGTCGGTAC GACAGCACGT CCACACCGAT TACGATCAAG GCGCCCCGCC GGCAACAGGA CCATACTGGC TGCCCACTAC AGTCACGACC ACGGCGTACA CCGGCAGTTC CGACATCGAC CCCCGTACAA CAATCAATGG CTACGCTGCG AAGACAGGCG CGGACGCTTC CACCTCCGGC TGGACCCTAC TTCAGCCGAC AACAATCACG ACGTGGATGG GTGGCGGGGC CACTCCCAAC ATTGTCCGCA CCACCTACTA CAATGCCGCT GGCCAACCCC TCGAAGTACG CCAACCCAAG GCCAACAGCG ACGGCACCGA CGCCTTCACC ACTATTTTCT CCTACTTCAC TCCAACTGGA TCCGGTGGAT GCGTTAATGC CTCCTGGGCA GGTCTAACCT GCTCCACTGG ACCCGCATCG CAGCCTACTT CTGGTAACAC TCTTCCAGTC ACCACCCTCA CCTACAACAA TCTCAATCAG CCGCTCACCA AGACCGAGAC CGTCACTTCC AGCGGCACGA CCACCCGCAC CACGACCTAT ACCTACGACA CCGCCGGCCG GACGCTCTCC GAAGGATTGG CCGTTAACCC CGCGGCCAAC GGCGGCACCG CCGTACCCAC CGCCACCTAC GGGTACAGCA CGATAAGCGG TCTCCCGACT ACCACGACCG CGAACAGCGT CACCCTGACC ACCGAATACG ACGCCTGGGG GCAAATCACC TCCCAGACCG ACTCCGACGG CAACACCAGT AACACAACCT ACGACATCGA CGGCCGAGTC TCGTCGGTCA ATGATGGTAA GGGTGCCTAC ACTTACATCT ATGATACCGC CACCGAACAC CGCGGCCTCG TCACCAGCCT CGGCATCGAC GCCGGCTCCG CGCCGTCCAC CTTCACCGCC ACCTATGACG GCGACGGGAA ACTGACATCC CAGAACTACC CAAACGGTCT GGTCGCCACC AGCCACTATA ACAACACTGG CAGACCCACC ACCCTTACCT ACACCAAGGG CACGTCCACC TGGCTGACCT TCACCCAAAT CGACAATATC AACGGCCAAG GGCGTGTCGT CGAATCCCCC GGCGGGATCC GCGAACACCT CTACGACCTG GTCGGCAGGC TCATCACCGT CCGGGACATC CGCAACGATA GCGGCAGCGT TATGTGCACC TCCCGCCGGT ACGTCTACGA CCCCGATTAC AACCGCACCC AGCAGATCTC CTACCCCGAC GCCGGCACCA ACCCCACCGG CCCCGCCTGC AGCACCTCCA CCACTCCCAC CTACTCGCTG AGCTCCAGCT ACGATCAGGC CGACCGCATC ACCGACACCG GCTACACCTA CGATCTGTTC GGTCGCACCA CCGCCGTCCC GGCAGCGGAA AACGATCTCA CGGTCGGCTA CCACGCCAAC GACATGGTCG CCTCCGAAAC CCAGGGCGCC GCCACCCGCA CCTACACCCT CGATCCCGCA CGTCGCATCC GAACCTGGGC TCAAGGTGCC ACTACCTGGA CCAACCACTA CACCTCGGGC TCCGACGACA GCGCCGCGTG GATCGGCGAC AGCGGCGGTG GATGGACCCG CAACATCCTC GGAATTAGCG GCACACTAGT TGCTGTTCAG GACCAGACTG GCACGGTCAC CCTACAGCTA TCAAACTTGC ACGGAGATAT CGTTGCCACC GTCGACGACA ACAGCTCTGC GACGGCACCC ACCACCTTCC AGGAAACCTG GGAATTTGGA CAGCCCTACA ACATTGCTAC CGCTTACCCT CGATATGGCT GGCTCGGTGG GCCACAACGG TCTCGCGATA CCATTTCAGG TATCACTCTT ATGGGGGCGC GACTATACAA CCCGGGAACA GGTCGCTTTC TTCAGGTAGA CCCAGTGCGC AATGGCGCCG CCAATCCGTA CGAATATGCC TACCAAGACC CATGGAATAT GACCGATCTC GATGGTCGCA TCCCGACTCC GAGTCTTGTT TACTCTTGTC CGCGAGGGTT CAACTGCCAG GTCCTCCTTC GCAGCACGAA ATACAGCGAG TGGAGGGCAT GGCAGCTGAC CGGTGTCGCC GATGGCCACC TAAGCTCGTT TTACTACGAA ACACGAAGAA GATCCGCTAA ATTTTATGTC TCGCGATACT TCTACTATAA CCGGCAAACT CAATTTATTA TGTACATCTA CACACAGGTA CAGGACAGGA TTAGAACCTT TACGTGGCCA GCAATCAGAA CGGAGAAGAT TCAAAGCGAC AGATTCGAGT ATATAACATA CGTAATGTGT CGCTCGGCCT ATATTTGCTA A
|
Protein sequence | MRASPVRRRR SFIRSLCVLL AALLAATVTQ FLPPVSGAAS AAPTDAGTEV PPPTGPAERP DLITAQLTAR AEQRRIEVTS LTTESTRTLV NPNGSITVES TSGIARVHRD GRWLPVDTTL VLTDGKITPK VAKAAIALSA GGKDAGDIAT LRDGDREIAF GWPTALPTPE LKDNVATYKA VAQDTDLRVK VTPTGYDVQI VAHTPAAAQA ALSLPMRLKG VTAERTAGGE LRLSAAGKIA ARSPTPLMWD AHVDAKTGLP DRTHTVEAAL ERAGTATPAL ALRPDKAWLT ADDRQYPVTI DPAAVLPDNL DTDVVNTSAT TNYDTYEYLR VGNVFGSVHR SFLRFDTSAI EGKHVTAAAL KLTQAGSYTC TPQRMVVQGS AGLASGATWN TQPTADGVNW SDTTFNAGGF CGSGDVSLDI TGLAQAWSSN GQPSPETLTL RAPDEVLWEP YKYFTSGDTA TPPRIETTYN SYPATVGART TSPCSAQCGG TPPTVLTNST TPRLAGSATD ADGGTLRLDF EVWNSAGTTK ITQGSSGFVA QNSEAAWTVP SGLLANGTSY QWRVRAYDGT DYSQNWSTWI PFTIDTTAPA TPTALASTAW PSGGWASATS GTFTWTSPGG DTAGFLYGLD EPSPTTSPTP PTGTTSASLT ADEGLHTFYL RTIDTAGNLS PVISYNFGVG NAALASPAEQ SRTQRFATLR GEAPSSQVSV TYRYRVGTNP STAWTDVPTA ETTTQGTTNH PTWPAPRNGS GAFDNQIWNI PGTLGGGSDG PIQIQACFRT AAAVVTCTDA TTIQLTRNAF SDTDAVGELG PGSLALLTGD YSLSATDVSM PTYTGTLAVG RTLTTLTPPA ATTAPNGIFG PGWTSSVPGP EAGSGDRTLT DSTATEGYVT LTSPEGAPAV YTRAGTGSYP YEYTGVADTA ADGSKLIKDS ATKFTLTEID GAKTIWYSKT IGSATIWVVD RVEEPGSNTT STFTTDSLGR ITRILAPVPS GVDCSGTLGA GCRALTLTYA TSTTATGSGG NPAEWGNYTG QLSSISMALN GATSIEVARY SYDTTGHLRA EWDPRLDTAG GNHLATLYWY NAEGRVQGFI PTGQEAWGFG YDGFGRLTNI SRPRPAGAGT ATNTIVYGVL LSGASAPVDV SPDRVDDWAQ QDLPVWATAV FPASHVPASP PTAADWPYAA INYLNADGRQ VNTASYGAGA WQVTTTEYDT FGNVTRELTA ENRNQALTPT TDTDATVAAL TDSAARAQLL DTKITYSVDG VVPLDVYGPK HRYINNDGVR SSVRQHVHTD YDQGAPPATG PYWLPTTVTT TAYTGSSDID PRTTINGYAA KTGADASTSG WTLLQPTTIT TWMGGGATPN IVRTTYYNAA GQPLEVRQPK ANSDGTDAFT TIFSYFTPTG SGGCVNASWA GLTCSTGPAS QPTSGNTLPV TTLTYNNLNQ PLTKTETVTS SGTTTRTTTY TYDTAGRTLS EGLAVNPAAN GGTAVPTATY GYSTISGLPT TTTANSVTLT TEYDAWGQIT SQTDSDGNTS NTTYDIDGRV SSVNDGKGAY TYIYDTATEH RGLVTSLGID AGSAPSTFTA TYDGDGKLTS QNYPNGLVAT SHYNNTGRPT TLTYTKGTST WLTFTQIDNI NGQGRVVESP GGIREHLYDL VGRLITVRDI RNDSGSVMCT SRRYVYDPDY NRTQQISYPD AGTNPTGPAC STSTTPTYSL SSSYDQADRI TDTGYTYDLF GRTTAVPAAE NDLTVGYHAN DMVASETQGA ATRTYTLDPA RRIRTWAQGA TTWTNHYTSG SDDSAAWIGD SGGGWTRNIL GISGTLVAVQ DQTGTVTLQL SNLHGDIVAT VDDNSSATAP TTFQETWEFG QPYNIATAYP RYGWLGGPQR SRDTISGITL MGARLYNPGT GRFLQVDPVR NGAANPYEYA YQDPWNMTDL DGRIPTPSLV YSCPRGFNCQ VLLRSTKYSE WRAWQLTGVA DGHLSSFYYE TRRRSAKFYV SRYFYYNRQT QFIMYIYTQV QDRIRTFTWP AIRTEKIQSD RFEYITYVMC RSAYIC
|
| |