Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3488 |
Symbol | |
ID | 5671859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4145401 |
End bp | 4148472 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242376 |
Product | YD repeat-containing protein |
Protein accession | YP_001507796 |
Protein GI | 158315288 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0275829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCAGATT TCCAGGACGA CCACGATCCG GGCGCCGTGT GGCGGCTGCC GGTCAGCGAC ACCACGATCC TGCTGGTCGA GCTCAGCACC GGCGAGCCGG TGCTGCGCCA GCGGAACCGG CTCACCCCGC GGGCCGGCGT GGCCCGGCCG CCGTCACGCG AGTACCGCGG CCGGCGGACC GGCGAGAACG ACGGGCCGGG ACCGGGCTGG TCGGTCACCG GGCCCGACGA CTCCGCGGGG GTCGAGTACG ACCCGGCCGG CAGGCCGGTG CGGCACACCG ACGCCGCTGG CGCGGTCACC GCCTTCGGCT GGGACGACGA GCACCGGCTG ACCGGGCTCA CCGACGCGGC CGGCACCCTC GTCCGCCTCT CCTACACCGC CGCCGGGGCC GTCAGCGAGG TCGTCCTGGA GGGCATCGGG CGTACCGGCT TCGCCTACCA GGACGAGCCG TCCGGGCCGG CGGGCTCCGG CGGGCCGGTG GCCGGGGGGA CCCGCACCGT GGTGACGGAC CCGCTCGGCC GCGCCACCGC GTACACCTTC GACGGCCACG GCCGGCTGCT GACGGTCACC GACCCGCTGG GCGGCGTGCA CCGTCAGGAG TGGGATCCGG CGGGCCGCCT GGCCGCCGTC GTCGACCGGA CGGGCGGGCG CACCGCCTAC GAGTACGACG CGCACGGCCG CCTCGTCGCC CGGGTCGCGC CGACCTCGGC CCGCTCGTCG GTCGGCTACG GCGACCCGGC GCATCCGGAC CTGGTCACCA CACTGCGGGA CCCGGCGGGC AACGAGGTCA TCCTCGAGCA CGACCCCGCC GGCCGGGTGG TGCGCGTCAG CACCGCGGAC ACAGCAGGGA CAGCCGGCAT CGCCGACACC GCGGGCACCG CCGGCTCTCT CGACCTGCGG GCCTACGACC CCGTGCACGG CCGGATCAGC GCCGTCACGA ACGGTGCCGG CCATCGCACC TCCTTCGAGT ACGACGCGGC GGGCGAGCTC GTCGCGGTCA GCCCGCCGGC GCCGCGGCGC CGCGCCTCGT ACGACTACGA CGAGCTCGGC CGCGTCGTCG CCGTGACCGG CGGCAACGGG CGGCGCACGG CGTACCGCCA CGACCCGGTC GGCCGGCTCG TCGAGGTCAG CGGCCCCGGC GGGGTGCTCC TCACCCAGGC GCACGACCCG GTGGGCCGGA TCGTGAGCCG CGCGGGCGCC GGCTGGCGCT ACGACTACAG CTGGGTGTCG ACGTCGGCCG GCAGCCGGCT GGCCGCGGTG GTGCGCACCG ACGACACCGG CCGGGAGGAG GTGCGCGCGG AGCACGGTCC CGACGGGGCG CTGCGCTCGC TGACGACCGC CGGCGGCACC ACCGACTACG ACTACGATCC CGCCGGCCGG CTGGCCGGGG TGCGCACCCC CACCGGGCAC CAGGCCCGCT TCACCCGCGA CGCCGCCGGG CGCGCGCTGC GGATCGAGTT CGGCGGCGTC GTCCAGGAGA TCTCCTACGA CGCTGCGGGG CGGCGCCGGG CACTGACCCT GCTCGGCGCG GACGGCGCGA CGCTGCTGAG CGCCGAGTAC GACTACCGGG ATCCGACCGG CGCGGACGGC GACCGGCTGC GCCGGCTCGT CCTGGACGGG CAGGTCACCG AGTACGCCTA CGACCCGCTC GGCCAGCTCG TCCAGGCCGG GCCCACCAGC TACGCCTACG ACGCGGCGCT CAACCTCGTC CGCCTCGGGG AGACCGCGTT CACCATCGGC GCGGCCGGCG AGGTCACCCG CTTCGGCGCG ACCGAGTTCG ACTACGACGG CGCCGGCAAC TTCGTCGAGG AGGTCAACCC GACCGGTTCG TTCCGCTACA GCGACACCAA CCAGACCGTG CTCGGCGTCT TCGGCGGCGC CGTCGTCGCC GACATCGCCC ACGACAGCCT CGGTCAGCAG ACCCCCCGGC GGGTCACCGA GACCACGGTC GACGGGCGGA CCGTCACCCA CGTCCTGACG CACGGCCCGC TGGGCGTCGC CCGGGTGGTC GACGACGGCG TGCCCCTCGA CGTGGTGCGC CTCCCCGACG GCACTGTGCT CGCGGTCATC ACCGCCGAGG GCCGGCTGCT GTGGACGGTG ACCGACCACC AGGGCTCCGT GCTGGCACTC GTCGACGAGC AGGGACGGCT GGCCGCCCGC TACGGCTACA CCCCGCACGG CGCCGTGACC GCCACCGGCC CGGACGCCGC CACCAACCCG TTCCGCTACC GGGGCGCCTA CCAGCTGCTG CGCAGCGCGC ACTTCCTGGA CAACCGCCTC TACAACGGCT ACTGGGGCCG GTTCACCCAG CCCGACCCCA CCGGCCGGCA GTACGGCCCC TACACGTTCG CGGACAACGA CCCGCTCGGC GCCGGCCTCC CCGGCCGGCA CGACTTCTGG GCGGCGCTGA CCGCGCCGCC GGAGCTCACC GCCGAGCTGT TCTTCCCGCC CGCCGACGTC CCGCCGCCCA CCGCCGGCGC CGCGGACGGC CCGCACGCGC GGGCGGCGCT GGCCGCGCTG ACCGGCCCCG GGGTGACCCC CGACCAGCTG CCACGCATCA CCGAACACGG CGCCGGCCAC ACCGCCGACC GCCGCACCGA CCGCACCACC GCGCCCGCCG GCCCGGCGGG CCGGGCGAAC CCCACCCCGA AAGGACGTCC CACCGTGGCT GACCAGATCG TGATCCGGGT TCCGAACGAG GTCGTCGTCA AGGTCGTCGA CGACGTGGTC GACCTGGCCG ACCCGCAGAT CGGCCAGACC GGCGTCTTCG ACGACGAGCT GTACGACGAG GACGGCAAGC TGATCGGCAC CTCGCACGGC TCGTTCCGCA TCGAGTACGT GCGCCCCGGC GACGGCGGCC TGATCACCTA CTACACCGAG GACATCACCC TCGACGACGG CACCATCCAC GCCGAGGGCT GGGCGGACTT CAACGACGTC AAGACGAGCA AGTGGGTGCA CTACCCCGCG ACCGGGACGG GCGGGCGCTA CGCAGGCCTC ACCGGTTTCC GCACCTGGCG GATGACGGGC GTGCGGGCCT CCGCCGAGGC CCGCATCCTG CTGTCCGACT GA
|
Protein sequence | MSDFQDDHDP GAVWRLPVSD TTILLVELST GEPVLRQRNR LTPRAGVARP PSREYRGRRT GENDGPGPGW SVTGPDDSAG VEYDPAGRPV RHTDAAGAVT AFGWDDEHRL TGLTDAAGTL VRLSYTAAGA VSEVVLEGIG RTGFAYQDEP SGPAGSGGPV AGGTRTVVTD PLGRATAYTF DGHGRLLTVT DPLGGVHRQE WDPAGRLAAV VDRTGGRTAY EYDAHGRLVA RVAPTSARSS VGYGDPAHPD LVTTLRDPAG NEVILEHDPA GRVVRVSTAD TAGTAGIADT AGTAGSLDLR AYDPVHGRIS AVTNGAGHRT SFEYDAAGEL VAVSPPAPRR RASYDYDELG RVVAVTGGNG RRTAYRHDPV GRLVEVSGPG GVLLTQAHDP VGRIVSRAGA GWRYDYSWVS TSAGSRLAAV VRTDDTGREE VRAEHGPDGA LRSLTTAGGT TDYDYDPAGR LAGVRTPTGH QARFTRDAAG RALRIEFGGV VQEISYDAAG RRRALTLLGA DGATLLSAEY DYRDPTGADG DRLRRLVLDG QVTEYAYDPL GQLVQAGPTS YAYDAALNLV RLGETAFTIG AAGEVTRFGA TEFDYDGAGN FVEEVNPTGS FRYSDTNQTV LGVFGGAVVA DIAHDSLGQQ TPRRVTETTV DGRTVTHVLT HGPLGVARVV DDGVPLDVVR LPDGTVLAVI TAEGRLLWTV TDHQGSVLAL VDEQGRLAAR YGYTPHGAVT ATGPDAATNP FRYRGAYQLL RSAHFLDNRL YNGYWGRFTQ PDPTGRQYGP YTFADNDPLG AGLPGRHDFW AALTAPPELT AELFFPPADV PPPTAGAADG PHARAALAAL TGPGVTPDQL PRITEHGAGH TADRRTDRTT APAGPAGRAN PTPKGRPTVA DQIVIRVPNE VVVKVVDDVV DLADPQIGQT GVFDDELYDE DGKLIGTSHG SFRIEYVRPG DGGLITYYTE DITLDDGTIH AEGWADFNDV KTSKWVHYPA TGTGGRYAGL TGFRTWRMTG VRASAEARIL LSD
|
| |