Gene Francci3_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0470 
Symbol 
ID3903201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp550454 
End bp551731 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content70% 
IMG OID637877801 
Producthistidine kinase 
Protein accessionYP_479585 
Protein GI86739185 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.53157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGG TGCCTGGAAC AGCCGGTTCG ACGACTGACC TGACTACTGG TTCGGCGTCC 
GGCGGGGACA CCGGCGGCAT GACCTTGCCG GCAAGGCTCG TGCGTCGGGT GATCGCCGGC
CTGCCGACCG GTCTGCTCGT CCTCGACGCC GCGGACCGGG TGGTCCTGGT GAACATGGTC
GCGCGACGTA TGGGTGTGGT CGCCGCCGAC GAGATCGCGG TGGCGGAGCT GGCCGATCTC
GTCCGGGCCA CCCGGCTCGC GGGCAGTGAC CAGGAGCGGC AGCTTGAGCT GCCGCCCGTC
CCCGAGCCCC CCCTCACCCG TCCCCGACCG GACCAGGAGG GGCTGGCGGT GCGTGCCCGG
GCCCGGCTGC TGGACTCGTC CGGTCATGTC GCCGTCATCG TGGATGACAT TACCGAGTCG
CGTCGGGTCG AGGCCGTCCG TCGGGACTTC GTGGCGAACA TCAGCCACGA GCTCAAGACG
CCGGTCGGTG CGTTGCACGT CCTCGCCGAA GCGGTCGCCG CGGCCTGCGA GGACCCGGTG
GCAGTCCGCC GGTTCGCCTC CCGGATGACC CACGAATCGA CCCGGCTCGC CCGTCTTGTT
CAGGAGATCA TCGATCTCTC CCGGCTGCAG GGCGCCGATC CGCTGCCCAA CCTGCGGCCG
ATGCGGGCGT CCGCGGTGCT CACCGAGGCG GTCGACCGCA CCCGGCTGGC AGCGCAGGCC
CAGGCGATCT CGGTTGCGGT GATCGGCGAC GGTGACCTGC CGGTGTGTGG GGATGAGGGC
CAGCTCGTGA CCGCCGTCGC GAACCTGCTC GACAATGCGA TCAGCTACTC GCCGCGTGGC
ACCCGGGTTG TGCTCGGGGT TCGGCGCAGC GGTGAGACCG TGGAGATCTC CGTCGCCGAC
GAGGGCATCG GGATCGCGGA GAAGGACCTG GAACGGGTCT TCGAACGCTT CTATCGGGCG
GATCCGGCGC GATCCCGCGC GACCGGTGGG ACCGGCCTGG GGCTCGCCAT CGTCAAGCAC
ATCGCGACCA ATCACGGCGG CGTGGTCTCT GTGTGGAGCG CCGAGGGCGC GGGTTCCACC
TTCACGCTCC GGTTGCCGCT GTTCACCGGC GACGATGACG ATGCGATGAC GGACGGCTCG
GACGAAACCC GCGAGGATGA CGGGGTGGAT GGCTTCGACG TCGTCGGGGC CGATGCCGAC
GCCCGCGGTC GGGGCGATGG TGAACATCAT GGCGATGGTG AACATCATGG CGATGATGAA
CTCGGTGGCG GTTCGTGA
 
Protein sequence
MTAVPGTAGS TTDLTTGSAS GGDTGGMTLP ARLVRRVIAG LPTGLLVLDA ADRVVLVNMV 
ARRMGVVAAD EIAVAELADL VRATRLAGSD QERQLELPPV PEPPLTRPRP DQEGLAVRAR
ARLLDSSGHV AVIVDDITES RRVEAVRRDF VANISHELKT PVGALHVLAE AVAAACEDPV
AVRRFASRMT HESTRLARLV QEIIDLSRLQ GADPLPNLRP MRASAVLTEA VDRTRLAAQA
QAISVAVIGD GDLPVCGDEG QLVTAVANLL DNAISYSPRG TRVVLGVRRS GETVEISVAD
EGIGIAEKDL ERVFERFYRA DPARSRATGG TGLGLAIVKH IATNHGGVVS VWSAEGAGST
FTLRLPLFTG DDDDAMTDGS DETREDDGVD GFDVVGADAD ARGRGDGEHH GDGEHHGDDE
LGGGS