Gene Franean1_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3858 
Symbol 
ID5672221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4586246 
End bp4587676 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content77% 
IMG OID641242736 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001508156 
Protein GI158315648 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.296973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTGC GGGTACGACT GACCCTGCTG TTCGTCCTGG GCACCACGCT GGTGCTCGCG 
GCCGCCGGCG TGCTGTTCTA CGCGCTGCTG CGCACCAACC TGCGGAACTC CGTCGACGCG
AACCTGCGCA CCCGGATGGC CGTGCTCGCC GCGCAGGTCC CCGCCGCGGC GGACCCGGCC
GCGCAGCTGC GGGCCTACGG CGCCGGGCCC GCCCAGCTGC TGCGCCCCGA CGGCTCCGTC
GCCGCCTCGA ACGACAGCGC CGGTCCCGAG CCGCTGCTCG AGCCCGCCCA GGTGGCCACG
GCGCTCGCCC GCTCGGAGTC GCTCACGTCC GGCGAGCTGC GGGAGCCGGG CGGCGACGAT
GACGACGCGC GCGCGCTGGC CGTCCCGGTG CGGACCGGGG CGCGGGACGG GGTTTTGGTC
GTCGCGACGA GCACCGATCT GACCGACTCG GCCGAGGACC GCGTCCGCAA CATCATGGTC
AGCGCGACGG CGCCGATGGT CGCGCTCTCC GGGCTCGCGG CCTGGCTGCT GTCGGGCGCC
GCCCTGCGCC CGGTCGATCG CATGCGCCGG CAGACCGCCG CCATCAGCGA GTCGGACAGC
TCCGCCGAGC TGGACGTGCC CCCGACCCGC GACGAGATCG CCGCCCTGGC CGCCACGATG
AACAACCTGC TGCGCCGGCT GCACGCCGCC CGCGCCCGCG ACCGGGCGTT CGTCGCAGAC
GCCGGGCACG AGCTGCGGAC CCCGCTCACC AACCTCAAGG CCGAGCTCGA GCTCGCCGGC
CGCCCGGCCC GCACCCGCGA CGAGCTCGTC GACGCGGTCG CGGGCGCGGC GGAGGAGACC
GAGCGGCTGA TCCGGCTCTC CGAGTCACTG CTCACGCTCG CCCGGATGGA CAGCGGCATC
ACCGCCCCGC GGCGGCTGGA CGCCGGCGAC CTCCTCGAGC GGGCCGCCCG CGCCGCGACC
GGCCACGCCG AGACCAGGCA GGTCCGCCTC CACCTGGACG CCGACCCGGG CCTCGCCGTG
GACGCGGACC CCGACATGCT CCGCCAGGCG GTCGACAACC TGGTGGCCAA CGCCATCCGT
CACGCGCCGC CCGGCACCGC CGTGGACGTC CGGGCCGGGC CGGGTGAGGC CGGGAGGACC
GTCGTCGTGC GGGTGCGTGA CCGCGGCCCG GGGTTCCCCC CGGACTTCCT GCCGCGTGCC
TTCGAACGTT TCAGCCGCGC CGACGCCGCG CGCACCCGCG ACCACGGCGG CACCGGCACC
AGCGGAAACA GCAGCGGGAC GGGCCTTGGT GGCACCGGGC TCGGGCTCGC CATCGCCGCG
GCGGTGGCGC GGGCCCACCA GGGGACCGCC ACCGCCGCCA ACCATCCCGA CGGCGGCGCC
GTCGTCACGC TCACGCTGCC CGCCGCGGGC GGTCTCCCGC CGGACCGGTA G
 
Protein sequence
MPLRVRLTLL FVLGTTLVLA AAGVLFYALL RTNLRNSVDA NLRTRMAVLA AQVPAAADPA 
AQLRAYGAGP AQLLRPDGSV AASNDSAGPE PLLEPAQVAT ALARSESLTS GELREPGGDD
DDARALAVPV RTGARDGVLV VATSTDLTDS AEDRVRNIMV SATAPMVALS GLAAWLLSGA
ALRPVDRMRR QTAAISESDS SAELDVPPTR DEIAALAATM NNLLRRLHAA RARDRAFVAD
AGHELRTPLT NLKAELELAG RPARTRDELV DAVAGAAEET ERLIRLSESL LTLARMDSGI
TAPRRLDAGD LLERAARAAT GHAETRQVRL HLDADPGLAV DADPDMLRQA VDNLVANAIR
HAPPGTAVDV RAGPGEAGRT VVVRVRDRGP GFPPDFLPRA FERFSRADAA RTRDHGGTGT
SGNSSGTGLG GTGLGLAIAA AVARAHQGTA TAANHPDGGA VVTLTLPAAG GLPPDR