Gene Francci3_2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2826 
Symbol 
ID3904738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3325447 
End bp3328461 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content72% 
IMG OID637880147 
ProductATP-binding region, ATPase-like 
Protein accessionYP_481913 
Protein GI86741513 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.869401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCGTC GGCGGCACTC GGCGGGGGCC GTGACCGTGA CCGTGACCGG GGCCGTGACC 
GGGGCCGCGG CCCCCGCCGT GGCTACGACC GGGGCGGGGG AGTTCCGATC GCGTCCGTGG
CGGCGGCTGC GCCTGCACGA CCTCCCCGTG CGCGGGAAGC TCTTCGCGGC CTTGGCGGTG
CCCATGGCGG CCTTCTTGGC GGTAGCGATT CTCGCCGGCG TGACCTGGCT GTCCGACGCG
GCCAGCTACG GTCACGGGGT GACCTCGGCG AGGCTCGGCC GGGATGTGAC CGCGGCGGTG
CACGAGATGC AGCTCGAGCG GGACCTCGCC GCGGGCTTCA TCATGAGCGG GCGTGGCGCC
GCCCCGACCA GCAGGGCGCT GGCCGACCGG CTGAACACCG AGCAGCGTGT CGTGGACAAG
GTGCTAACCC GTTTCCAGGC CTCGCTGGGG TCCTCCTCCG GCCGTCTCGG CCGAGGGGCC
TCCCCGCTGT CCGTGCGGGT GCGGCAGAGC ATCGCCGCAC TGCCCGCGCT GCGTCAGGCC
GTCCGGGGCG GGAAGCTGCC CATCGGCGCG ATCCTGAGCG AGTACTCGAC CACGATCAGC
CGCCTGCTCG CCTTCGACCG CCGGATCGGG CAGGACCGGG ACGACGACGA GCTCCGCTAC
GCCACCACGG TCCTCAATGA CCTGTCGACC ATCAAGGAGA TCAACTCCCA GATCCGCGGT
CAGCTCTACG CCGCGGCCCT CACCGGCCGG TTCGAGTTCG GCGCCGCCGA ACGGATGTCC
GGTCTCCTCG CCGAGGAGCA GACGGCCCAG GACGCGTTCC GCGCCGCAGC CCGACCCAGC
GACCGGACCC GCTATGATCA GATCGTCAAC GGTCAGGCGG TGCTGACGGT CAAGCGCACC
TCCGACCGTG CCATCCAGCG TCAGCGGCTG CCCGACCTCG GCATCGATCC CGACCAGTGG
TTCGCGGCCA GCACCACCCA CATCGAGCTG CTGCGCACCG TCGAGACCGG ACTGCTCGAC
GACGTCATCG GGTCCGCGAG CGACCTGCGC TCCACCGCCT GGGAGCGAAC CGCCATCCTC
GGCGCGCTGA TCATCATTAT CATGGCCGGT GCGATCGTGT GGACGCTGGC GATCGCGCGC
ACGATGACCG GCCCCCTGCG CAGGCTGCGG ACGGGCGCCC TCGACGTCGC CCACGAGCGC
CTTCCCCGCC TGATCGAGCA GCTGCAAACC GCGTACCCGG ACCAGGTCGA CACGACGATC
CGGTCCATCG CCGTCGACTC CCGCGACGAG ATCGGCGAGG TCGCGCGCAC CTTCGACGAG
CTGCAGCACG AGGCGGTGCG GCTGGCCACC GAACAGGCCG GCCTGCGGCG CAACGTCAAC
ACCCTGTTCC TGAGCCTGTC CCGGCGCAGC CAGAGCCTCA TCGAACGGCA GATCGCGCTC
ATCGACCGGC TGGAGGCCAG CGAGGAGAAC CCCCTCCAGC TGGAGAACCT GTTCAAGCTC
GACCACCTGG CGACCCGGAT GCGCCGCAAC AGCGAGAACC TGCTCGTGCT CGCGGGGGCC
GGCCCGGGCC GGCGCCGGGC GGGTCCGGTA CCGCTGGGTG ACGTGCTGCA GGCCGCGATC
GGCGAGATCG AGCAGTACGA ACGGATCCAG ATCATCGAGG TGCCGGAGGT GCAGGTTGCG
GCCGACTCCG TCAACCACGT CGTCCACCTC GTCGCCGAGC TGTTGGAGAA CGCCGCCCAG
TTCTCCCCGC CATACCTCGC GGTCGACCTC GCGGCGTCCC GCCTTGACGA CGGCGGATTG
CTCATCATCA TCGACGACGC CGGCCTCGGG ATGTCGGAGA AGGAGCTGAC CGCAGCCAAC
CAGCGGCTGG CCGAGCCTCC GGTGTTCGAC TTCTCGATCG CCCAACGCCT CGGCCTGTTC
GTCGTGGCCC GCCTGGCCGC GCGGCACGAC ATCGAGGTGC GGCTGGCACG CTCGGATACC
GGCGGGGTTC GCGCGCTCGT CCATCTGCCC GCCGTGCTGC TCGTCGCGGG CAGCGCGTCT
CGTGGCAGCG CGTCTCGTGG CAGCGCGTCT CATGGCAGCG CGTCTCATGG CAACGGCCTG
GCGGCTCCCG GCCTGGCGGG GCCCGGTGCG ACGGGCCTGA ACGGTTACGG GCCTGCGGGG
CCTGCTGGGC CTGCGGGGTC GCTCGGCGGT CCCGATACGC CGCCGGCCGC TCTCGGCTCC
TACACCTCCT ACACACCGAC CGAGGAGTGG TTCCGGCCGC GCGGTGTCGA TCTCGACGTC
GTTGACGAGC TCGCCGACGA CGCCGACCTG CCCACCGCGG CCGAGGCGGG TGGCGGGTCG
CGTCCGGGCG CCGCCACGGG ATCGGATGGC GGGGCCGCCC TGCCTCCGAT AGAGGAGGCG
GTGGCCGCGC TCCGTCGTCG GCGCGAGCGT GGGGCGGACA GAAGGATGCG CGCCGCCGCG
GTGCGGTCGC CCGAGCCCTC CGAGCCCTCC GAGCCCTCCG AGGTCCCCGA GGTCCCCGAG
GTCCCCGAGG TCCCCGAGGA GAAGTTCAAC TGGTTCACGC GAGATCCGGC ACCGTCGCGT
TCCGCGGAGC GCATCCCACC CGCGTCCCTT GAGCCACCCG CGTCCCTTGA GCCACCCGCG
TCCCTTGAGC CCTCAGCGGC CGGTGGGGCG GCACGGTCAG GCGGGGCGAT AACCGTCGGT
GGGGTACCGG GCTCAGGCGA AGCGGCGGTG AGTGACCATC CCCCCATCGG GCCAGACGGT
CCCAGGACCC CCTCCGGGCT GCCGATCCGC GCCCCGCAGA CCCACGGGCT CCTCGGCCCG
GACCCGTTGG GCACACCCCC GTCCAGGTCC GCGACCGCTC GCCCCCCCGA TGCCGGTCCC
GACCTTTTCG GCCCGGCAGC GGCGCTCACG CCGGCCCCGG AACGGCAGAC GGTGGCGCCC
GAACGGATCC GCGGCCGCCT GAGCCGACTG TACGAAGGCG TTCACCATGC CCGTGGTGTC
AAGGGAGACC CCTGA
 
Protein sequence
MRRRRHSAGA VTVTVTGAVT GAAAPAVATT GAGEFRSRPW RRLRLHDLPV RGKLFAALAV 
PMAAFLAVAI LAGVTWLSDA ASYGHGVTSA RLGRDVTAAV HEMQLERDLA AGFIMSGRGA
APTSRALADR LNTEQRVVDK VLTRFQASLG SSSGRLGRGA SPLSVRVRQS IAALPALRQA
VRGGKLPIGA ILSEYSTTIS RLLAFDRRIG QDRDDDELRY ATTVLNDLST IKEINSQIRG
QLYAAALTGR FEFGAAERMS GLLAEEQTAQ DAFRAAARPS DRTRYDQIVN GQAVLTVKRT
SDRAIQRQRL PDLGIDPDQW FAASTTHIEL LRTVETGLLD DVIGSASDLR STAWERTAIL
GALIIIIMAG AIVWTLAIAR TMTGPLRRLR TGALDVAHER LPRLIEQLQT AYPDQVDTTI
RSIAVDSRDE IGEVARTFDE LQHEAVRLAT EQAGLRRNVN TLFLSLSRRS QSLIERQIAL
IDRLEASEEN PLQLENLFKL DHLATRMRRN SENLLVLAGA GPGRRRAGPV PLGDVLQAAI
GEIEQYERIQ IIEVPEVQVA ADSVNHVVHL VAELLENAAQ FSPPYLAVDL AASRLDDGGL
LIIIDDAGLG MSEKELTAAN QRLAEPPVFD FSIAQRLGLF VVARLAARHD IEVRLARSDT
GGVRALVHLP AVLLVAGSAS RGSASRGSAS HGSASHGNGL AAPGLAGPGA TGLNGYGPAG
PAGPAGSLGG PDTPPAALGS YTSYTPTEEW FRPRGVDLDV VDELADDADL PTAAEAGGGS
RPGAATGSDG GAALPPIEEA VAALRRRRER GADRRMRAAA VRSPEPSEPS EPSEVPEVPE
VPEVPEEKFN WFTRDPAPSR SAERIPPASL EPPASLEPPA SLEPSAAGGA ARSGGAITVG
GVPGSGEAAV SDHPPIGPDG PRTPSGLPIR APQTHGLLGP DPLGTPPSRS ATARPPDAGP
DLFGPAAALT PAPERQTVAP ERIRGRLSRL YEGVHHARGV KGDP