Gene Franean1_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3483 
Symbol 
ID5671854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4141672 
End bp4142793 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content76% 
IMG OID641242371 
Producthypothetical protein 
Protein accessionYP_001507791 
Protein GI158315283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0462629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGATC TCGCCGAGCT GAAGACCTTC GTCGTGGCAC ACGCGGTGTC GCAGGGGCTG 
CCCACCGGGC ACTACGACCC GCTGCTGGCC CGCATCCACC ATGACGAGGA CGGCGTCCCG
GGCTCGTGGG CGTTCGAGTG GAGCGCGCTG GCCGACGGCC TGGCCGCCGA GGGCCGGCCG
CTGGAGGCCT GCGTGCACTA CACGATGGCC CGGTTCCCGT TCGTCGACGG GCCGGCGCGG
GCGCGAGCAC TCGAGCGGGC GACCGGGGAG TTCGCCCGCT GGAGCGCCGC GCACCCGGCG
CTGCGCGGCC TGGACGTCGA GCTGCCCGCG GGGCGGGTGC GCTGCTGGAC GACCGGCCTG
GACGCCCGCG ACGCCGGCGA CGCCGAGGAC CCCGCCGGGC CGCGGCCGCT GCTGGTCATG
ACCGGGGGCA TCGTGTCGAC CAAGGAGCAG TGGGCGCCGG TGCTGCTGGG CCTGGCCGAG
CTGGGCTTCG CCGGGCTGGT CACCGAGATG CCGGGCGTCG GCGAGAACAC GCTGCCCTAC
CGGGCCGACA GCTGGACGCT GTTCCCCGCC CTGCTCGACG CGATCGGCCG GCCCGCCGGC
ACCGCCGACG TCTACCTGCT GGCGCTGAGC TTCAGCGGTC AGCTGGCGCT GCGGGCCGCG
CTGCACGACG ACCGGATCGC CGGGGTGGTG GGCGCCGGCG CCCCGGTGCG GGAGTTCTTC
ACCGACACCG CCTGGCAGCG CCGGGTGCCC CGGGTCACCA CCGACACCCT GGCGCACCTG
ACCCGGACCA GCGCCGACGA GGTCTACCCG ACCGTGCGGG ACTGGGCGCT GCGGGAGGAC
GAGCTGGCGG CGCTGCGGAT TCCGGTCGCG CACGTGACCA GCCTGCGCGA CGAGATCATC
CCGCCCGGCG ACGCGCGGCT GCTGCGCCGG TTGGTGCCGC GGATCCGGCT GCTCGCCCAT
GACGACGTGC ACGGCGCGCC GTCGCACTTC GCCCAGACCC GGCTGTGGAC GCTGCTGTCG
GTGCTGCGCA TGCACGGCGG CAACGCCCCG ACCCGGCTGG CACTGACCCG GCAGTTCGCC
CGGCTGCGCT ACGCCGACCC GGCGGTGCGC TCCGCCGCCT GA
 
Protein sequence
MNDLAELKTF VVAHAVSQGL PTGHYDPLLA RIHHDEDGVP GSWAFEWSAL ADGLAAEGRP 
LEACVHYTMA RFPFVDGPAR ARALERATGE FARWSAAHPA LRGLDVELPA GRVRCWTTGL
DARDAGDAED PAGPRPLLVM TGGIVSTKEQ WAPVLLGLAE LGFAGLVTEM PGVGENTLPY
RADSWTLFPA LLDAIGRPAG TADVYLLALS FSGQLALRAA LHDDRIAGVV GAGAPVREFF
TDTAWQRRVP RVTTDTLAHL TRTSADEVYP TVRDWALRED ELAALRIPVA HVTSLRDEII
PPGDARLLRR LVPRIRLLAH DDVHGAPSHF AQTRLWTLLS VLRMHGGNAP TRLALTRQFA
RLRYADPAVR SAA