Gene Franean1_7237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7237 
Symbol 
ID5675538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8836043 
End bp8837932 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content79% 
IMG OID641246074 
Producthypothetical protein 
Protein accessionYP_001511462 
Protein GI158318954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0824468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.337859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG GTGCGTGGAG AGGTGACCCT CGGGATCCGG CTGGGCCCGG CTACCGGGAA 
CGGCCGCCCG GCGCACCGGA CGACCCGGGG CCGGACGGCT ACTGGGACGG CCCGGACCAG
CATCAGCAGT ACCGGAACGA TCCCCGCGGC AACGGTTCCC ACCAGGGCGT GCCGCACTAC
CAGGGCGGCC CGCCCGCGGG ATATCCCGAC GGCCCCCAGC CGGGCGGCGC CTACCAGCAG
CCGGGATACC CGCCGGCCGC CGACCCCGGG CGCGCCTACG GCTCCGGCCC CGGTGGGGGT
CCCTACCCGC CGGCAGGACC GGCGGGAGGC CAGTACTCGC CGCCGCCGCC CGGTCCCGGT
GCCCGCCCGC CCGCGGGTGG TGGCGCCCCC TATCCTCCGG GCGGCCCGGG CCCCGCGACC
GGCCCGTACC CGCCGCCCGG CCCCGCCACG GGCGGCTACC CGTCGGCGGA CGGCCCGAGC
GGCCCGGCCA GCCCGGGCGC TCCGGGCGGT GGGGGTGCCC GCCGGGACTC GACCTCCGTC
CGCGGTGCCC TGGTGCCGCG CGCCGGCGCC GGCCGTGCCC CGACGCGCCC GCCGGGGTCG
ACCGGTCCGG CGCCGCGGCA GGCCCCCGAG CGGGGGAGCG CGGTAGATGC CGGCGCCGCC
CGTCCGGACC TGTATCAGCC GGAGCCGTCC CGAGCGGCCG CGCGCCCGGG GACGGACGCG
GGCGCGCGCC GGGGCGGTCC CGACCCGGCC TACCGGGACG CGGCGTACCC GCAGATGGCA
CACCGCGACG TCGCGGTGCC CGACGCCGAC CCGCGCGACG CCGGGCGGCG GGACGTGCCC
TACCGCGACC GGGAACGCGG ATACCGCGAC GGACCCGCGC GGGACACGCC GTATCGCGAC
CCGGACGGCC CGGCTGACGA CGACGGCCGC GGCCCGATGC CCGGCACCGG CCCGCGCGCG
CGGGCCCGGG CCGCCCGGCG CGCCGGCGGC CAGGACGGCA CCGGCCCGCA GGACCGGGCC
AGATCCACGG GCGCCGGCAC CGAGGTGCTG GGAGCCGTCG GCGCGGCTTC CGCCGGGACC
GGGCCGCGCC GGGCCGCGCC CCCGCGGCAG GGCGGCTTCG ACGAGGTCGA CTTCCCCGGC
GAACCTGACT TCCCGGACGA TGGCGACGGC CTCGACGGTC CGGACGGCGG CGAGAGCGCC
GGGCTCGGGC CCTTCCTGCG CCGCCTCGTG ATCGCCCTGG TCGTGCTCGG CGTGGCCCTC
GCGGTGGGCG TCGGTGCCGG CGTCATCTGG GAGAAGGTGC GCCCGAGCGG CGATACGGCG
ACGACGGCGA ACACGCCCCC GACGGCGACG CCCGGCACGG GCCCGTCGGC TTCCCCCGCG
CCGTCCACCG GTGCCCCCGC GGGCGGCGGC CAGCCGCAGG CCGCGGTGCC CGCGGACTGG
GTGGCCTTCA CGGACCCCGA CCAGAAGGCG ACGTTCTCCC ATCCGCCGAC CTGGAAGCAA
CGACGGGACA ACACCGGTGT GTTCTTCGGG GAGCCGGGGG CGGGCGCGGT GGGCACACCC
GCCGAGTACG GCCCGCAGAT GATCGGCGTC GCCCGGGTCG CGGGCGCGGA CGCCGCGACG
GCGCTCAGCC AGGTCCAGAG CAGTGAGTTC GGCAGCGTCT CGGGCCTGAC TCAGGACCGC
TCGGGCCCGG CGACGGACAC GTCCGGCGCG ACTGTGCAGG AACTGGCGGG CTCCTACGAC
CGTGACGGCC AGCGCGTCTC GTACCTCATG CGCACGAGCG AGGCGCCCGG CGCGGTCTAC
GTGCTCATCG CCAGGGTTCG GGCGGACGCC TCGGCGTCAC TGAACACGAT GATGGGCGCG
CTGCGCGCCT CGTTCCAGCC GGCCGCCTGA
 
Protein sequence
MTDGAWRGDP RDPAGPGYRE RPPGAPDDPG PDGYWDGPDQ HQQYRNDPRG NGSHQGVPHY 
QGGPPAGYPD GPQPGGAYQQ PGYPPAADPG RAYGSGPGGG PYPPAGPAGG QYSPPPPGPG
ARPPAGGGAP YPPGGPGPAT GPYPPPGPAT GGYPSADGPS GPASPGAPGG GGARRDSTSV
RGALVPRAGA GRAPTRPPGS TGPAPRQAPE RGSAVDAGAA RPDLYQPEPS RAAARPGTDA
GARRGGPDPA YRDAAYPQMA HRDVAVPDAD PRDAGRRDVP YRDRERGYRD GPARDTPYRD
PDGPADDDGR GPMPGTGPRA RARAARRAGG QDGTGPQDRA RSTGAGTEVL GAVGAASAGT
GPRRAAPPRQ GGFDEVDFPG EPDFPDDGDG LDGPDGGESA GLGPFLRRLV IALVVLGVAL
AVGVGAGVIW EKVRPSGDTA TTANTPPTAT PGTGPSASPA PSTGAPAGGG QPQAAVPADW
VAFTDPDQKA TFSHPPTWKQ RRDNTGVFFG EPGAGAVGTP AEYGPQMIGV ARVAGADAAT
ALSQVQSSEF GSVSGLTQDR SGPATDTSGA TVQELAGSYD RDGQRVSYLM RTSEAPGAVY
VLIARVRADA SASLNTMMGA LRASFQPAA