Gene Franean1_4414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4414 
Symbol 
ID5672766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5270758 
End bp5272434 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content70% 
IMG OID641243282 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_001508699 
Protein GI158316191 
COG category[T] Signal transduction mechanisms 
COG ID[COG3290] Signal transduction histidine kinase regulating citrate/malate metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA CACCCGGTGG TGCGGACGAC CCCAGCGGCG GCCGCGGCGC GCGGCTGAGC 
CGGCGCAGCA TCGCGGGCCA GGTGCTCGTC CTGCAGGTCG CGGTGGTCGT ACTTCTTGTG
GTGGCCGCGG TTGCCGCCTC GGTGCTACAG GCCCGGCGCA CGGCGGAACG CCAGGCACGC
GATCTCGCCG TCGCGGTCGC GGAGACCTTC GCCCATGCTC CAGGGATGGT CGCGGCCCTG
AGCTCGTCCG ATCCGAGCGC TTCGCTGCAG CCGCTGGCCG AGGCCACCCG AACCCGGACC
GGGGTCGACT TCCTCGTCGT GATGAATCCG GTCGGGATCC GGTACACCCA CCCTGACCCG
ACCCGAATCG GCCAGCGCTT CCTCGGACAC ACCGAACCGG CGGTACGAGG CGAGGTGCTC
ACCGAGACCT ACTCGGGGAC GTTGGGCCCG TCAGTACGCG CGGTCGTGCC GGTCACCACC
AGCGATGGGA AGATTGCCGC TCTGGTCGCC GCAGGCATCA CCGTCGAGAA CGTCAGCCAG
ACGGCCGGCC GGCAACTTCC GGTACTCCTG GTTGCCGGAC TGGCTGCCGT CGCGCTAGCG
GTGGGTGGCA CGGTGCTGGT CAACCGCCGC CTTCGGCGGC AGACTCGCGG CCTCGGCCCG
GTAGAGATCA CCAGGATGTA CGAACATCAC GACACCGTGC TGCACGCTGT TCGGGAAGGG
GTTCTCATCG TCGACGACCA GGCTCGGGTG CTCCTGGCCA ATGATGAGGC CCGCCGGCTC
CTCGGCCTCC CGTCGGACTC CCATGGGCGT CCGGTCGCGG AACTCGGTCT TGATCCGGCG
GCGCTGGACC TACTCGGCAG TGGTCGTCCA GCCGAGGACA CCGTGCTGCT GGCTGGCGAC
CGGATGCTCG CGGTCAATCA GAAACCCACC GTCCGCGACG GGAAACCGCT CGGCAGTGTC
GCGACGATCC GGGACACCAC CGAACTCCAA GCGCTGACCG GCGAACTCGA CGCGGTCCGG
GGCATGGCGG ATGCGTTGCG TGCCCAGAGC CACGAAGCCG CGAACCAGCT TCATATCGTG
GTCACCCTGA TGCAGCTCGA TCGGCTGGAC GAGGCCGTCG AGTTCGCCAC CGCCGGGCTC
GCTGGCTCAC AGCAGCTCGC TGACCGGATG CTGACCGCCG TCGACGAGCC GGTCCTGGCG
GCTCTGCTGC TCGGTAAGAC CGCCCAGGCG GCCGAGCGCG GCGTGGAACT CGTCGTCACC
GACGACACCC GGTTCGAGGT CTTCGACGTC GAGGCGGCTG AGCTGGTGAC GGTCGTCGGC
AACCTCATCG ACAACGCAAT CGAGGCCGCG GTCGCAGGAG AGGCCCCGAG GCGGGTGACG
GTCTCGGTGC GTACCGAGCC CGATGGCCTG ACGGTCCGGG TCTCTGACAC AGGTCATGGC
CTCGATCCGA CACAGGTCGC GGCGGCGTTC GAACGCGGCT GGACCACCAA GACGAGCACG
TCGACGGGCG AGGCAGCGCG CGGGCGAGGT CTCGGGCTCG CCCTGGTCCG GCAGGTCGTC
CGGCGCCACG GCGGCCAGAT CGAGGTGGGT CGCGACGGCG GTGCGGTGTT CACTGTCCGC
ATCCCTCGGC AACATGGGAG CGTCGACATG TCCACGGCGA TGACTCCGGC CCGATGA
 
Protein sequence
MTATPGGADD PSGGRGARLS RRSIAGQVLV LQVAVVVLLV VAAVAASVLQ ARRTAERQAR 
DLAVAVAETF AHAPGMVAAL SSSDPSASLQ PLAEATRTRT GVDFLVVMNP VGIRYTHPDP
TRIGQRFLGH TEPAVRGEVL TETYSGTLGP SVRAVVPVTT SDGKIAALVA AGITVENVSQ
TAGRQLPVLL VAGLAAVALA VGGTVLVNRR LRRQTRGLGP VEITRMYEHH DTVLHAVREG
VLIVDDQARV LLANDEARRL LGLPSDSHGR PVAELGLDPA ALDLLGSGRP AEDTVLLAGD
RMLAVNQKPT VRDGKPLGSV ATIRDTTELQ ALTGELDAVR GMADALRAQS HEAANQLHIV
VTLMQLDRLD EAVEFATAGL AGSQQLADRM LTAVDEPVLA ALLLGKTAQA AERGVELVVT
DDTRFEVFDV EAAELVTVVG NLIDNAIEAA VAGEAPRRVT VSVRTEPDGL TVRVSDTGHG
LDPTQVAAAF ERGWTTKTST STGEAARGRG LGLALVRQVV RRHGGQIEVG RDGGAVFTVR
IPRQHGSVDM STAMTPAR