Gene Franean1_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2112 
Symbol 
ID5670512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2538688 
End bp2539752 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content69% 
IMG OID641241033 
ProductPhoH family protein 
Protein accessionYP_001506454 
Protein GI158313946 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCGGC CCAGGCGGCC GTACTACATG CCCGAATCCG ACACCCCACC CGGCTCCCGG 
GTCACCACAC GCATCGTCGT CCCCGACGGG CACAGCATGG TGAGCCTGCT CGGGCACCAG
GACCAGCTTC TGCGTGTGAT CGAGAAGGCC TTCTCCTCTG ACATCCACGT CCGCGGCAAC
GAGATCACGA TCACGGGTGA CCCGGCGGAG AACGAGCTGG CCGCCAGGTT GTTCTCCGAG
CTCGTCGCGC TGCTTGACGC AGGCACCGAG ATCAGCCCGC AGCACGTCGA CCACTCGGTG
GCGATGCTGC GCAGCGGCGC GGGGGAGCGG CCTGCCGAGG TGCTCACCCT CAACATCCTG
TCCAACCGTG GTCGGACGAT CCGTCCCAAG ACGCTGAACC AGAAGCGGTA CGTGGACGCG
ATCGACCAGC ACACGATCGT GTTCGGGATC GGCCCGGCGG GCACCGGCAA GACCTACCTG
GCGATGGCCA AGGCCGTGCA GGCGCTGCAG GCGAAGAAGG CCAACCGGAT CATCCTCACC
CGGCCGGCGG TCGAGGCGGG TGAGCGGCTC GGCTTCCTGC CCGGGACGCT CTACGAGAAG
ATCGACCCGT ACCTGCGTCC GCTCTACGAC GCGCTGCACG ACATGATCGA CCCCGACTCG
ATCCCGCGGC TCATGCAGAG CGGCACCATC GAGGTCGCGC CGCTGGCGTA CATGCGCGGC
CGTACGCTCA ACGACGCCTT CATCATCCTG GACGAGGCGC AGAACACCTC GGCCGAGCAG
ATGAAGATGT TCCTGACCCG CCTCGGCTTC GGGTCCAAGA TCGTGGTGAC CGGTGACGTC
ACCCAGGTCG ACCTGCCCAG TGGCACGCAG AGTGGCCTGC GAGTGGTCCG CGAGATCCTG
GACGGCGTCG CCGACGTCCA CTTCGCGACC CTGACCAGCA CGGACGTCGT CCGGCACCGG
CTGGTCAGCG ACATCGTCGA CGCCTACGCG CGCTGGGACG CGGCGAGCCC GGCACCGAGC
ACGGACACGC GCCCCACGCG GGCGGCCCGC CGGGACCGCC GATGA
 
Protein sequence
MLRPRRPYYM PESDTPPGSR VTTRIVVPDG HSMVSLLGHQ DQLLRVIEKA FSSDIHVRGN 
EITITGDPAE NELAARLFSE LVALLDAGTE ISPQHVDHSV AMLRSGAGER PAEVLTLNIL
SNRGRTIRPK TLNQKRYVDA IDQHTIVFGI GPAGTGKTYL AMAKAVQALQ AKKANRIILT
RPAVEAGERL GFLPGTLYEK IDPYLRPLYD ALHDMIDPDS IPRLMQSGTI EVAPLAYMRG
RTLNDAFIIL DEAQNTSAEQ MKMFLTRLGF GSKIVVTGDV TQVDLPSGTQ SGLRVVREIL
DGVADVHFAT LTSTDVVRHR LVSDIVDAYA RWDAASPAPS TDTRPTRAAR RDRR