Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2826 |
Symbol | |
ID | 3904738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3325447 |
End bp | 3328461 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880147 |
Product | ATP-binding region, ATPase-like |
Protein accession | YP_481913 |
Protein GI | 86741513 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.869401 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCGTC GGCGGCACTC GGCGGGGGCC GTGACCGTGA CCGTGACCGG GGCCGTGACC GGGGCCGCGG CCCCCGCCGT GGCTACGACC GGGGCGGGGG AGTTCCGATC GCGTCCGTGG CGGCGGCTGC GCCTGCACGA CCTCCCCGTG CGCGGGAAGC TCTTCGCGGC CTTGGCGGTG CCCATGGCGG CCTTCTTGGC GGTAGCGATT CTCGCCGGCG TGACCTGGCT GTCCGACGCG GCCAGCTACG GTCACGGGGT GACCTCGGCG AGGCTCGGCC GGGATGTGAC CGCGGCGGTG CACGAGATGC AGCTCGAGCG GGACCTCGCC GCGGGCTTCA TCATGAGCGG GCGTGGCGCC GCCCCGACCA GCAGGGCGCT GGCCGACCGG CTGAACACCG AGCAGCGTGT CGTGGACAAG GTGCTAACCC GTTTCCAGGC CTCGCTGGGG TCCTCCTCCG GCCGTCTCGG CCGAGGGGCC TCCCCGCTGT CCGTGCGGGT GCGGCAGAGC ATCGCCGCAC TGCCCGCGCT GCGTCAGGCC GTCCGGGGCG GGAAGCTGCC CATCGGCGCG ATCCTGAGCG AGTACTCGAC CACGATCAGC CGCCTGCTCG CCTTCGACCG CCGGATCGGG CAGGACCGGG ACGACGACGA GCTCCGCTAC GCCACCACGG TCCTCAATGA CCTGTCGACC ATCAAGGAGA TCAACTCCCA GATCCGCGGT CAGCTCTACG CCGCGGCCCT CACCGGCCGG TTCGAGTTCG GCGCCGCCGA ACGGATGTCC GGTCTCCTCG CCGAGGAGCA GACGGCCCAG GACGCGTTCC GCGCCGCAGC CCGACCCAGC GACCGGACCC GCTATGATCA GATCGTCAAC GGTCAGGCGG TGCTGACGGT CAAGCGCACC TCCGACCGTG CCATCCAGCG TCAGCGGCTG CCCGACCTCG GCATCGATCC CGACCAGTGG TTCGCGGCCA GCACCACCCA CATCGAGCTG CTGCGCACCG TCGAGACCGG ACTGCTCGAC GACGTCATCG GGTCCGCGAG CGACCTGCGC TCCACCGCCT GGGAGCGAAC CGCCATCCTC GGCGCGCTGA TCATCATTAT CATGGCCGGT GCGATCGTGT GGACGCTGGC GATCGCGCGC ACGATGACCG GCCCCCTGCG CAGGCTGCGG ACGGGCGCCC TCGACGTCGC CCACGAGCGC CTTCCCCGCC TGATCGAGCA GCTGCAAACC GCGTACCCGG ACCAGGTCGA CACGACGATC CGGTCCATCG CCGTCGACTC CCGCGACGAG ATCGGCGAGG TCGCGCGCAC CTTCGACGAG CTGCAGCACG AGGCGGTGCG GCTGGCCACC GAACAGGCCG GCCTGCGGCG CAACGTCAAC ACCCTGTTCC TGAGCCTGTC CCGGCGCAGC CAGAGCCTCA TCGAACGGCA GATCGCGCTC ATCGACCGGC TGGAGGCCAG CGAGGAGAAC CCCCTCCAGC TGGAGAACCT GTTCAAGCTC GACCACCTGG CGACCCGGAT GCGCCGCAAC AGCGAGAACC TGCTCGTGCT CGCGGGGGCC GGCCCGGGCC GGCGCCGGGC GGGTCCGGTA CCGCTGGGTG ACGTGCTGCA GGCCGCGATC GGCGAGATCG AGCAGTACGA ACGGATCCAG ATCATCGAGG TGCCGGAGGT GCAGGTTGCG GCCGACTCCG TCAACCACGT CGTCCACCTC GTCGCCGAGC TGTTGGAGAA CGCCGCCCAG TTCTCCCCGC CATACCTCGC GGTCGACCTC GCGGCGTCCC GCCTTGACGA CGGCGGATTG CTCATCATCA TCGACGACGC CGGCCTCGGG ATGTCGGAGA AGGAGCTGAC CGCAGCCAAC CAGCGGCTGG CCGAGCCTCC GGTGTTCGAC TTCTCGATCG CCCAACGCCT CGGCCTGTTC GTCGTGGCCC GCCTGGCCGC GCGGCACGAC ATCGAGGTGC GGCTGGCACG CTCGGATACC GGCGGGGTTC GCGCGCTCGT CCATCTGCCC GCCGTGCTGC TCGTCGCGGG CAGCGCGTCT CGTGGCAGCG CGTCTCGTGG CAGCGCGTCT CATGGCAGCG CGTCTCATGG CAACGGCCTG GCGGCTCCCG GCCTGGCGGG GCCCGGTGCG ACGGGCCTGA ACGGTTACGG GCCTGCGGGG CCTGCTGGGC CTGCGGGGTC GCTCGGCGGT CCCGATACGC CGCCGGCCGC TCTCGGCTCC TACACCTCCT ACACACCGAC CGAGGAGTGG TTCCGGCCGC GCGGTGTCGA TCTCGACGTC GTTGACGAGC TCGCCGACGA CGCCGACCTG CCCACCGCGG CCGAGGCGGG TGGCGGGTCG CGTCCGGGCG CCGCCACGGG ATCGGATGGC GGGGCCGCCC TGCCTCCGAT AGAGGAGGCG GTGGCCGCGC TCCGTCGTCG GCGCGAGCGT GGGGCGGACA GAAGGATGCG CGCCGCCGCG GTGCGGTCGC CCGAGCCCTC CGAGCCCTCC GAGCCCTCCG AGGTCCCCGA GGTCCCCGAG GTCCCCGAGG TCCCCGAGGA GAAGTTCAAC TGGTTCACGC GAGATCCGGC ACCGTCGCGT TCCGCGGAGC GCATCCCACC CGCGTCCCTT GAGCCACCCG CGTCCCTTGA GCCACCCGCG TCCCTTGAGC CCTCAGCGGC CGGTGGGGCG GCACGGTCAG GCGGGGCGAT AACCGTCGGT GGGGTACCGG GCTCAGGCGA AGCGGCGGTG AGTGACCATC CCCCCATCGG GCCAGACGGT CCCAGGACCC CCTCCGGGCT GCCGATCCGC GCCCCGCAGA CCCACGGGCT CCTCGGCCCG GACCCGTTGG GCACACCCCC GTCCAGGTCC GCGACCGCTC GCCCCCCCGA TGCCGGTCCC GACCTTTTCG GCCCGGCAGC GGCGCTCACG CCGGCCCCGG AACGGCAGAC GGTGGCGCCC GAACGGATCC GCGGCCGCCT GAGCCGACTG TACGAAGGCG TTCACCATGC CCGTGGTGTC AAGGGAGACC CCTGA
|
Protein sequence | MRRRRHSAGA VTVTVTGAVT GAAAPAVATT GAGEFRSRPW RRLRLHDLPV RGKLFAALAV PMAAFLAVAI LAGVTWLSDA ASYGHGVTSA RLGRDVTAAV HEMQLERDLA AGFIMSGRGA APTSRALADR LNTEQRVVDK VLTRFQASLG SSSGRLGRGA SPLSVRVRQS IAALPALRQA VRGGKLPIGA ILSEYSTTIS RLLAFDRRIG QDRDDDELRY ATTVLNDLST IKEINSQIRG QLYAAALTGR FEFGAAERMS GLLAEEQTAQ DAFRAAARPS DRTRYDQIVN GQAVLTVKRT SDRAIQRQRL PDLGIDPDQW FAASTTHIEL LRTVETGLLD DVIGSASDLR STAWERTAIL GALIIIIMAG AIVWTLAIAR TMTGPLRRLR TGALDVAHER LPRLIEQLQT AYPDQVDTTI RSIAVDSRDE IGEVARTFDE LQHEAVRLAT EQAGLRRNVN TLFLSLSRRS QSLIERQIAL IDRLEASEEN PLQLENLFKL DHLATRMRRN SENLLVLAGA GPGRRRAGPV PLGDVLQAAI GEIEQYERIQ IIEVPEVQVA ADSVNHVVHL VAELLENAAQ FSPPYLAVDL AASRLDDGGL LIIIDDAGLG MSEKELTAAN QRLAEPPVFD FSIAQRLGLF VVARLAARHD IEVRLARSDT GGVRALVHLP AVLLVAGSAS RGSASRGSAS HGSASHGNGL AAPGLAGPGA TGLNGYGPAG PAGPAGSLGG PDTPPAALGS YTSYTPTEEW FRPRGVDLDV VDELADDADL PTAAEAGGGS RPGAATGSDG GAALPPIEEA VAALRRRRER GADRRMRAAA VRSPEPSEPS EPSEVPEVPE VPEVPEEKFN WFTRDPAPSR SAERIPPASL EPPASLEPPA SLEPSAAGGA ARSGGAITVG GVPGSGEAAV SDHPPIGPDG PRTPSGLPIR APQTHGLLGP DPLGTPPSRS ATARPPDAGP DLFGPAAALT PAPERQTVAP ERIRGRLSRL YEGVHHARGV KGDP
|
| |