Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1969 |
Symbol | |
ID | 5670370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2365089 |
End bp | 2366795 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240890 |
Product | GAF sensor signal transduction histidine kinase |
Protein accession | YP_001506312 |
Protein GI | 158313804 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4585] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.197678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.851655 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCC CGGACGCGTT CGACGAGTTC GGCGCGGAGC GCCCCGGCCG GCTCGAGCTG GACGAGCTGA TGGCGCAGCT CGTCGAGCGC GCGCATGAGG TGATGACCAC CCAGGGCCGG CTGCGCGGCC TGCTGCGGGC GCACCGCGCG GTCGCCGCCG ACCTGAGCCT GGAGGTCGTC CTCCGGCGGA TCGCGGAGGC CGCCTGCGAG CTGGTCGACG CCCGCTATGG CGCGCTCGGC GTGATCGCCC GCGACGGCCG GCTCGAACAG TTCATCCACG TCGGGATGGA CGCCGATCTG GTCGGCCGGA TCGGCCACCT GCCCCGCGGC GAGGGGGTGC TCGGGCTGCT GACCGCCGAG CCCCGGGCCG TCCGCCTCGA TGACATCGCC GCGCACGAGC ACGCGGTCGG CTTCCCGCCC GGCCACCCGC CGATGCGCAC GTTCCTCGGC GTCCCGATCA AGGTCCGCAG CGAGGTCTTC GGGAACCTCT ACCTGACCGA GAAGCGCGGC GGGCGCCGTT TCACCGCGCA GGACGAGGAG CTCGTCCTGG CCCTCGCCGC GAGCGCCGGC GTGGCGATCG AGAACGCCCG GCTGTTCGGT GCGGCGCAGC GCCGCCAGCA GTGGCTGCAG GCATCCGCGG ACATCATGCG CCACCTGCTG GCGGACGGGC CGGAGCCGCT CACGCTGATC GTCGCGCGGG CCCGCGAGGT CGCCGACGCC GATCTGGCGT GCGTCCTGCT CGCCGACGGA GCGACCGAGG AGCTGCTCGT CGACGCGGCC GACGGCCCGC AGGCCGACCG CCTCCTGGGC GAGTCCGTTC CGATGGCCGG AACCCTCGCC GGCCGGGCGG TCGCGGCCGG GCGGCCCCTG CTGGTCGACG ACGCCGCGGC CGAGCCGGGC GTCACCGGCT TCGGTGGCCT CGACATCGGC CCGCTGATGG TCATCCCCCT GGTCGGGGCG CAGGTCGGGA CGGGCGCGGT CGTGCTGGCC CGGGGGCCCG CGGGCCGGCC GTTCGCCGAC GGCGATCTGG ACATGGCGGC GACGTTCGCC GGGCATGTGC AGGTCGCCCT CGGCCTGGCC GCGTCCCGGG CCACCCGTGA CCGGCTGCTC GTCCTGGAGG ACCGCGACCG GATCGCCCGC GACCTGCACG ACCACGTCAT GCAGCGGCTC TACGCCGTCG CGCTGGGTCT GCAGGGGATG GCGGCCGCCG AGGAGCGCCC GCAGTCCGCC GGCCGGCTCA CCACCTACGT CGACGACCTC GACGCGACCA TCCGGGAGAT CCGCTCGACG GTCTTCGAGC TGCGCGGGCG GCGCAGCACC GGCGGGCCGG GCGTGCGGGC CCGCCTCGGC GAGATCGTCG AGGAGGTCGC CGAGGCGCTC GGCTTCAGCC CGCGCCTGCG GGTGGACGGC CCGCTCGACA CCGCGCTGGA GGGGAACATC GCCGACCATC TCCTCGCCGT CGCGCGGGAG AGCCTGTCGA ACGTGGCGCG CCACGCCCGC GCCAGCCGGG TGGAGCTGTC GGTCACCGTC GGCCAGGGCT GGCTGTGCGC CGAGGTCACC GATGACGGGG TCGGGCTGGG CGACACCGGC CGGCGCAGCG GCCTGCGCAA CCTGCGCAGC CGCGCCGAGG AGCTCGGCGG GACCTTCGAC ATCGCCCCCG GCCCGTCCGG CGGCACCCGG CTGCGCTGGG CGGTCCCGCT GCCGTAG
|
Protein sequence | MDVPDAFDEF GAERPGRLEL DELMAQLVER AHEVMTTQGR LRGLLRAHRA VAADLSLEVV LRRIAEAACE LVDARYGALG VIARDGRLEQ FIHVGMDADL VGRIGHLPRG EGVLGLLTAE PRAVRLDDIA AHEHAVGFPP GHPPMRTFLG VPIKVRSEVF GNLYLTEKRG GRRFTAQDEE LVLALAASAG VAIENARLFG AAQRRQQWLQ ASADIMRHLL ADGPEPLTLI VARAREVADA DLACVLLADG ATEELLVDAA DGPQADRLLG ESVPMAGTLA GRAVAAGRPL LVDDAAAEPG VTGFGGLDIG PLMVIPLVGA QVGTGAVVLA RGPAGRPFAD GDLDMAATFA GHVQVALGLA ASRATRDRLL VLEDRDRIAR DLHDHVMQRL YAVALGLQGM AAAEERPQSA GRLTTYVDDL DATIREIRST VFELRGRRST GGPGVRARLG EIVEEVAEAL GFSPRLRVDG PLDTALEGNI ADHLLAVARE SLSNVARHAR ASRVELSVTV GQGWLCAEVT DDGVGLGDTG RRSGLRNLRS RAEELGGTFD IAPGPSGGTR LRWAVPLP
|
| |