Gene Franean1_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0258 
Symbol 
ID5668683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp311560 
End bp314049 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content75% 
IMG OID641239188 
Producthypothetical protein 
Protein accessionYP_001504631 
Protein GI158312123 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.828361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACAGC TGTGGTGGCG GGGAGCGGGT CGTCCGCGGC TGTTCGGAGC ATTCGACAGC 
GACCCCGGGC ATTTCTGCCT CACCTCGGCG GCTCGCGTCG CGATCGTCGC CCCCGCCCTG
CTCGCCCTCG TCTCCACGCT TGTCGGCGAT GTGCGGATGA GCCTGTTCGC CTGGTTCGGC
GCCTACGCGC TCCTCGAGTT CGTCGACTTC GACGGGCCCC GACCGGCCCG GCTCACCGCC
TACCTCACGC TCGCCACCAC CGGCGCCGGG ATGGTCGTCC TCGGCACGCT CGGCTCGCGC
TCGCCCTGGC TGGCGGCGTC GATGACCGCG GTTGTGGCCT TCGTTGTGAT GTTCTCCGGC
GTGCTGAACG CCAAGGTCGC CGTCGCCGGC CGATCCGCGT TGCTCGCCTT CGTGCTGCCT
GTGATGACAC CCGGCCCGAT CTCGGCGATA CCGGAACGGC TGGTCGGCTG GGGGCTGGCC
TCGGTCGCCT CGATCACCGC CGCGATGCTG CTCTGGCCGC GCCGCCCGCC GGACCGCCTG
CGCGCGGAGA CGGCGGACGT CTGCCACGCG CTCGCGACGG CCGTGAGCTG GCAGCCCCAG
GTACCGGACC CGCCGGCAGC GGTGTCGCGC GGTCCGGCGG CCGACGTGTC GCGGCGTCCA
GCGCCCGATC TCTGGCCGGC GCTGCGGCGG CTGCGGCGGC AGTTCGTCGC GACGGCCCAC
CGCCCGACCG GGGTCGGTGG CCGTGCGGCG GCGCTCGGCC ACCTGGTGGT GGACGTCAAC
TGGCTGGTGC CGTTCGCCCT TCCGTGGCCG GAGCGGGACC GGACCGCCCG CGCCTGCTTT
CCCGCGGAGG CCGCCGAGCT GCACGCCGCC GTCACGGCGA CGCTGCGCGC GGCGGCCACC
CGCATCGAAC CGTCCGACCA CGGTCGGCCC AGGCCCAGGC CCAGGCCCAG GGCCGATGGC
GGGGGTGGTG GGATGCGGGT CGGGGACGAT CGGCTCGGGA TCGCGCGACT CGAACGGTCC
GAGCGGGCGA TGCGGTCGGC GTTGCTGCGG CAGCTGCGGG AGCCGGCGCC CGCCTGCCCA
CCGGACCTCG CCGCGGCCGA GGCGTTCCGC CTGCGCCGGC TGGCCCGGGG GGCCAGGGAA
CTGGCGCTGA ACGTGCTGAG GGTGACCGGC CCCCTCCCGC CCCGCCCGCC GGCGAGTCCC
GTCCGGCGCG CCGTGGACGC GTTCCATCAC GCGCGCCGGC TGGTGCGCGA ACGGGGAACC
GCCGCGACCG ATCTCGCCGC CGGCTACGCC AGCCCGCGGT CGGTGTGGTT CCGCAACAGC
GTGCGCGGCT CCCTCGGCCT CACCACGGCG GTGATCATCG CCCAGGCCGC CGGGCTCCAG
CACGGCTTCT GGGTCGTGCT GGGGACCTTG TCCGTCCTGC GCTCGAACGC CATGGCCACC
GGTTCCGCAG CGGTACGCGC GCTCGCCGGC ACCGGAGTCG GCATCGTCGT GGGTGGCCTT
TTCGTGGTCG CCGTCGGCAC CCACACCGCC GTCCTGTGGG CGGTTCTCCC GCTCGCCGTG
CTGCTCGGGA GCTACTCCCG CCGCAGGTCG GGGTTCGTAC TGGGCCAGGC GGGCTTCACG
GTCTCCGTGC TCATGCTGTT CAACATCGTC GAACCCGCCG GCTGGCGGGT GGGCATAGTA
CGGATCCAGG ACGTCATGAT CGGGTTCGGG GTGAGCATCG TCGTCGGTGC CCTGCTGTGG
CCCCGCGGCG CGGTCGCCGT GATCCGGACC CGCGCCGAAT CCGCCTACCG AAGTGCGGTG
ACGTTCCTCG ATCTCGTCGT CCCGCACGCG CCGGGTGCCC CCGAGCATCC GGCCGTGGCA
CCGGCCGCCC GCGAGGCGAT CCGGGCCGGC CGCCTCCTCG ACGACGCCGT ACGCCAGTTC
CTCGCCGAGC AGCCGCCGGG CCGCTTCGAC GTCGACGCGC TGATGACGAT CGTCGCCGGC
GCGCTGCGTA TCCGGCGGAC GGCGCAGCTC CTGTGGAACG GGGACGTCCC CTGGCCGCCC
GATCTGACGC CCGATCTCCC ACGCGCGACC GGCGCCGGCC GCGCCGAGGC CACCGGCTTC
GCCGTCGCCC AAGGCATCCT CATCGAGGAC ATGCGGGACC TCTGCCGTTG GTACACCGCC
TACGCCACGG CCCTCGGCGC CGCGCGGCGG CCGCCCGAGC CCGAAGCCGG CTCCGGCCGG
GCCGCCACAG CCGGGCTGAT CATGATCCAC AGGGCTGCCC GCGCGCACCG CTGCCCCGAG
ATCCTCGCCG GTGCCGCGCT GACCTCCCGG GCCGCCTACC TGGACATCCT GCGCGACCTG
CAGCCCCGGC TGACGGCCGC GGCCACGGCG CTCGACCAGA CGTCCGACGG CGGCCGGGAC
CGCCCGCGCA AGCCGGTCGA TCAGGCGCCG GGGCCCCAGG GGTCGCGGCC GGCCCCGGAA
CGCTCGCGAG CCTTCATCCG AGCCTCGTAG
 
Protein sequence
MRQLWWRGAG RPRLFGAFDS DPGHFCLTSA ARVAIVAPAL LALVSTLVGD VRMSLFAWFG 
AYALLEFVDF DGPRPARLTA YLTLATTGAG MVVLGTLGSR SPWLAASMTA VVAFVVMFSG
VLNAKVAVAG RSALLAFVLP VMTPGPISAI PERLVGWGLA SVASITAAML LWPRRPPDRL
RAETADVCHA LATAVSWQPQ VPDPPAAVSR GPAADVSRRP APDLWPALRR LRRQFVATAH
RPTGVGGRAA ALGHLVVDVN WLVPFALPWP ERDRTARACF PAEAAELHAA VTATLRAAAT
RIEPSDHGRP RPRPRPRADG GGGGMRVGDD RLGIARLERS ERAMRSALLR QLREPAPACP
PDLAAAEAFR LRRLARGARE LALNVLRVTG PLPPRPPASP VRRAVDAFHH ARRLVRERGT
AATDLAAGYA SPRSVWFRNS VRGSLGLTTA VIIAQAAGLQ HGFWVVLGTL SVLRSNAMAT
GSAAVRALAG TGVGIVVGGL FVVAVGTHTA VLWAVLPLAV LLGSYSRRRS GFVLGQAGFT
VSVLMLFNIV EPAGWRVGIV RIQDVMIGFG VSIVVGALLW PRGAVAVIRT RAESAYRSAV
TFLDLVVPHA PGAPEHPAVA PAAREAIRAG RLLDDAVRQF LAEQPPGRFD VDALMTIVAG
ALRIRRTAQL LWNGDVPWPP DLTPDLPRAT GAGRAEATGF AVAQGILIED MRDLCRWYTA
YATALGAARR PPEPEAGSGR AATAGLIMIH RAARAHRCPE ILAGAALTSR AAYLDILRDL
QPRLTAAATA LDQTSDGGRD RPRKPVDQAP GPQGSRPAPE RSRAFIRAS