Gene Franean1_3488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3488 
Symbol 
ID5671859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4145401 
End bp4148472 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content75% 
IMG OID641242376 
ProductYD repeat-containing protein 
Protein accessionYP_001507796 
Protein GI158315288 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0275829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGATT TCCAGGACGA CCACGATCCG GGCGCCGTGT GGCGGCTGCC GGTCAGCGAC 
ACCACGATCC TGCTGGTCGA GCTCAGCACC GGCGAGCCGG TGCTGCGCCA GCGGAACCGG
CTCACCCCGC GGGCCGGCGT GGCCCGGCCG CCGTCACGCG AGTACCGCGG CCGGCGGACC
GGCGAGAACG ACGGGCCGGG ACCGGGCTGG TCGGTCACCG GGCCCGACGA CTCCGCGGGG
GTCGAGTACG ACCCGGCCGG CAGGCCGGTG CGGCACACCG ACGCCGCTGG CGCGGTCACC
GCCTTCGGCT GGGACGACGA GCACCGGCTG ACCGGGCTCA CCGACGCGGC CGGCACCCTC
GTCCGCCTCT CCTACACCGC CGCCGGGGCC GTCAGCGAGG TCGTCCTGGA GGGCATCGGG
CGTACCGGCT TCGCCTACCA GGACGAGCCG TCCGGGCCGG CGGGCTCCGG CGGGCCGGTG
GCCGGGGGGA CCCGCACCGT GGTGACGGAC CCGCTCGGCC GCGCCACCGC GTACACCTTC
GACGGCCACG GCCGGCTGCT GACGGTCACC GACCCGCTGG GCGGCGTGCA CCGTCAGGAG
TGGGATCCGG CGGGCCGCCT GGCCGCCGTC GTCGACCGGA CGGGCGGGCG CACCGCCTAC
GAGTACGACG CGCACGGCCG CCTCGTCGCC CGGGTCGCGC CGACCTCGGC CCGCTCGTCG
GTCGGCTACG GCGACCCGGC GCATCCGGAC CTGGTCACCA CACTGCGGGA CCCGGCGGGC
AACGAGGTCA TCCTCGAGCA CGACCCCGCC GGCCGGGTGG TGCGCGTCAG CACCGCGGAC
ACAGCAGGGA CAGCCGGCAT CGCCGACACC GCGGGCACCG CCGGCTCTCT CGACCTGCGG
GCCTACGACC CCGTGCACGG CCGGATCAGC GCCGTCACGA ACGGTGCCGG CCATCGCACC
TCCTTCGAGT ACGACGCGGC GGGCGAGCTC GTCGCGGTCA GCCCGCCGGC GCCGCGGCGC
CGCGCCTCGT ACGACTACGA CGAGCTCGGC CGCGTCGTCG CCGTGACCGG CGGCAACGGG
CGGCGCACGG CGTACCGCCA CGACCCGGTC GGCCGGCTCG TCGAGGTCAG CGGCCCCGGC
GGGGTGCTCC TCACCCAGGC GCACGACCCG GTGGGCCGGA TCGTGAGCCG CGCGGGCGCC
GGCTGGCGCT ACGACTACAG CTGGGTGTCG ACGTCGGCCG GCAGCCGGCT GGCCGCGGTG
GTGCGCACCG ACGACACCGG CCGGGAGGAG GTGCGCGCGG AGCACGGTCC CGACGGGGCG
CTGCGCTCGC TGACGACCGC CGGCGGCACC ACCGACTACG ACTACGATCC CGCCGGCCGG
CTGGCCGGGG TGCGCACCCC CACCGGGCAC CAGGCCCGCT TCACCCGCGA CGCCGCCGGG
CGCGCGCTGC GGATCGAGTT CGGCGGCGTC GTCCAGGAGA TCTCCTACGA CGCTGCGGGG
CGGCGCCGGG CACTGACCCT GCTCGGCGCG GACGGCGCGA CGCTGCTGAG CGCCGAGTAC
GACTACCGGG ATCCGACCGG CGCGGACGGC GACCGGCTGC GCCGGCTCGT CCTGGACGGG
CAGGTCACCG AGTACGCCTA CGACCCGCTC GGCCAGCTCG TCCAGGCCGG GCCCACCAGC
TACGCCTACG ACGCGGCGCT CAACCTCGTC CGCCTCGGGG AGACCGCGTT CACCATCGGC
GCGGCCGGCG AGGTCACCCG CTTCGGCGCG ACCGAGTTCG ACTACGACGG CGCCGGCAAC
TTCGTCGAGG AGGTCAACCC GACCGGTTCG TTCCGCTACA GCGACACCAA CCAGACCGTG
CTCGGCGTCT TCGGCGGCGC CGTCGTCGCC GACATCGCCC ACGACAGCCT CGGTCAGCAG
ACCCCCCGGC GGGTCACCGA GACCACGGTC GACGGGCGGA CCGTCACCCA CGTCCTGACG
CACGGCCCGC TGGGCGTCGC CCGGGTGGTC GACGACGGCG TGCCCCTCGA CGTGGTGCGC
CTCCCCGACG GCACTGTGCT CGCGGTCATC ACCGCCGAGG GCCGGCTGCT GTGGACGGTG
ACCGACCACC AGGGCTCCGT GCTGGCACTC GTCGACGAGC AGGGACGGCT GGCCGCCCGC
TACGGCTACA CCCCGCACGG CGCCGTGACC GCCACCGGCC CGGACGCCGC CACCAACCCG
TTCCGCTACC GGGGCGCCTA CCAGCTGCTG CGCAGCGCGC ACTTCCTGGA CAACCGCCTC
TACAACGGCT ACTGGGGCCG GTTCACCCAG CCCGACCCCA CCGGCCGGCA GTACGGCCCC
TACACGTTCG CGGACAACGA CCCGCTCGGC GCCGGCCTCC CCGGCCGGCA CGACTTCTGG
GCGGCGCTGA CCGCGCCGCC GGAGCTCACC GCCGAGCTGT TCTTCCCGCC CGCCGACGTC
CCGCCGCCCA CCGCCGGCGC CGCGGACGGC CCGCACGCGC GGGCGGCGCT GGCCGCGCTG
ACCGGCCCCG GGGTGACCCC CGACCAGCTG CCACGCATCA CCGAACACGG CGCCGGCCAC
ACCGCCGACC GCCGCACCGA CCGCACCACC GCGCCCGCCG GCCCGGCGGG CCGGGCGAAC
CCCACCCCGA AAGGACGTCC CACCGTGGCT GACCAGATCG TGATCCGGGT TCCGAACGAG
GTCGTCGTCA AGGTCGTCGA CGACGTGGTC GACCTGGCCG ACCCGCAGAT CGGCCAGACC
GGCGTCTTCG ACGACGAGCT GTACGACGAG GACGGCAAGC TGATCGGCAC CTCGCACGGC
TCGTTCCGCA TCGAGTACGT GCGCCCCGGC GACGGCGGCC TGATCACCTA CTACACCGAG
GACATCACCC TCGACGACGG CACCATCCAC GCCGAGGGCT GGGCGGACTT CAACGACGTC
AAGACGAGCA AGTGGGTGCA CTACCCCGCG ACCGGGACGG GCGGGCGCTA CGCAGGCCTC
ACCGGTTTCC GCACCTGGCG GATGACGGGC GTGCGGGCCT CCGCCGAGGC CCGCATCCTG
CTGTCCGACT GA
 
Protein sequence
MSDFQDDHDP GAVWRLPVSD TTILLVELST GEPVLRQRNR LTPRAGVARP PSREYRGRRT 
GENDGPGPGW SVTGPDDSAG VEYDPAGRPV RHTDAAGAVT AFGWDDEHRL TGLTDAAGTL
VRLSYTAAGA VSEVVLEGIG RTGFAYQDEP SGPAGSGGPV AGGTRTVVTD PLGRATAYTF
DGHGRLLTVT DPLGGVHRQE WDPAGRLAAV VDRTGGRTAY EYDAHGRLVA RVAPTSARSS
VGYGDPAHPD LVTTLRDPAG NEVILEHDPA GRVVRVSTAD TAGTAGIADT AGTAGSLDLR
AYDPVHGRIS AVTNGAGHRT SFEYDAAGEL VAVSPPAPRR RASYDYDELG RVVAVTGGNG
RRTAYRHDPV GRLVEVSGPG GVLLTQAHDP VGRIVSRAGA GWRYDYSWVS TSAGSRLAAV
VRTDDTGREE VRAEHGPDGA LRSLTTAGGT TDYDYDPAGR LAGVRTPTGH QARFTRDAAG
RALRIEFGGV VQEISYDAAG RRRALTLLGA DGATLLSAEY DYRDPTGADG DRLRRLVLDG
QVTEYAYDPL GQLVQAGPTS YAYDAALNLV RLGETAFTIG AAGEVTRFGA TEFDYDGAGN
FVEEVNPTGS FRYSDTNQTV LGVFGGAVVA DIAHDSLGQQ TPRRVTETTV DGRTVTHVLT
HGPLGVARVV DDGVPLDVVR LPDGTVLAVI TAEGRLLWTV TDHQGSVLAL VDEQGRLAAR
YGYTPHGAVT ATGPDAATNP FRYRGAYQLL RSAHFLDNRL YNGYWGRFTQ PDPTGRQYGP
YTFADNDPLG AGLPGRHDFW AALTAPPELT AELFFPPADV PPPTAGAADG PHARAALAAL
TGPGVTPDQL PRITEHGAGH TADRRTDRTT APAGPAGRAN PTPKGRPTVA DQIVIRVPNE
VVVKVVDDVV DLADPQIGQT GVFDDELYDE DGKLIGTSHG SFRIEYVRPG DGGLITYYTE
DITLDDGTIH AEGWADFNDV KTSKWVHYPA TGTGGRYAGL TGFRTWRMTG VRASAEARIL
LSD