Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0988 |
Symbol | |
ID | 5669402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1157412 |
End bp | 1158407 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239916 |
Product | putative integral membrane sensor protein |
Protein accession | YP_001505350 |
Protein GI | 158312842 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3300] MHYT domain (predicted integral membrane sensor domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.012877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAAACA ACATGAGCTT CGCCGCTCCA CTACAGAACG TGACCGTTGA GTATCTCGCC GCCGACCACG CCGACCACGC CCACCTTCAT GCTTCCTACA ACTTGTACTT CGTCGCGCTC TCCTACGGCT TGGCGGCGCT CGGCTCCTTC GCCGCGCTGG CCAGCGCGTC CCGGATTCGG GAACACGTCG GGTTCAGGCG GCTCGGCTGG TCAGCGGTGA CCGCGCTGGC GCTCGGCGGC GGCGGCATCT GGTCGATGCA TTTCGTCGGA ATGGTGGCCT ACCACATCGG GACGATGGTC ACCTTCGACA TGAAAGTCAC CACGCTCTCG CTACTCATCG CCGTGGCCGT CTCCGGAGTC GGAATCTGGG TCGTCGCCAG CGATCCCTTC AACGCCGGCC GGCTTATCGG CGGCGGGACC TTCGCCGGGT TGGGAATCGC GGCGATGCAC TACACCGGGA TGGCGGCGAT GCGGACCTCG GGGACGATCA GCTACCGACC CGGGCTCGTG CTGCTGTCCA TCGCTATCGC CGTGGTCGCG GCCGTCGCCG CCTTCGGGAT CGCCTTCCGC GTCGGCGGCA CCGGGCGCGT ACTCGGCGCG TCGTTCGTCA TGGCCGCCGC GGTCTGCGGG ATGCACTACA CGGCGATGGC GGCGACCCGC GTGACACCCG ACCCAACGCT GAGCCGGCCG ACCGGCTTCG ACCCGTTCGC GCTCGGCATC GTCTCCGCGT TCGGGTCCGT CATCGTCATG ATGTTCATCA TCGTCCACGC GCTCGGCGGC GTCAGTGACC CCGAGTTCAA CGTCCGGAAG ATCATGAGCG ACAGGCTTCC TGAGGGGTCC GGGCCGGACG CGACACCCGG CGTCCAGACC GGGACCACTG GCGTCCCACA TTGGTGGTCT TCTCGGCACA CTGAGCGTGG ACAGGCGGTC GCAGACGACG CGCGATCTGG CCGGGAGGTA GTGGACCCGC AGGCTGCCTG GGTGCGTGGA AGGTAG
|
Protein sequence | MGNNMSFAAP LQNVTVEYLA ADHADHAHLH ASYNLYFVAL SYGLAALGSF AALASASRIR EHVGFRRLGW SAVTALALGG GGIWSMHFVG MVAYHIGTMV TFDMKVTTLS LLIAVAVSGV GIWVVASDPF NAGRLIGGGT FAGLGIAAMH YTGMAAMRTS GTISYRPGLV LLSIAIAVVA AVAAFGIAFR VGGTGRVLGA SFVMAAAVCG MHYTAMAATR VTPDPTLSRP TGFDPFALGI VSAFGSVIVM MFIIVHALGG VSDPEFNVRK IMSDRLPEGS GPDATPGVQT GTTGVPHWWS SRHTERGQAV ADDARSGREV VDPQAAWVRG R
|
| |