Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_1985 |
Symbol | |
ID | 8753656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2057159 |
End bp | 2058328 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | histidine kinase dimerization and phosphoacceptor region |
Protein accession | YP_003409051 |
Protein GI | 284990497 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCTTCG CCGGCGAGGC TGAACGGCCG GAGATGGCTG GGCGCGGCAG GTGGCGAGGC ACACTGTTCG CGGTCGCCGT TGGCGCGCTC TGGCTGATTG TCGCCCGGGA CGAGTGGGGT TCGGCCGGGG CGCTGTTGTG GATCGACGTC GCCCTCGGCG TCGCCTCTGT CGCGGCCATG CAGTTCCGCC GTCGCCGGCC GCTGGCCGTC ACGGTCCTGA CCGTGGCTGC GACCGCGGTC TCGGCCAGCG CCATCGGCGC GTGGGTCGTC TGCCAGGTGT CGCTGTCGGC CCGCCGGCGT TGGCCCGAGG TCCTGCCGAC CGCGGTGCTG TCCGTCGCGA CCGGCCAGAT CCTCTACGCC GTGCAGCCGG ACCAGGCGCT GCCCTGGTAC GTGAACCTCA TCTTCACGGC TCTCGCCACG GGCGTCGTCG TCGCCGTGGG CATGTACGTC GGTGCGCGTC GCGAGCTCGT CACCTCGCTG CGTGACCGGG CCGAGCGCGC CGAGCGCGAG CAGAGGCTGC GGGTCGCCGC TGCCCAGGCG GGTGAGCGGG CGCGCATCGC CCGCGAGATG CACGACGTGC TGGCACACCG CATGTCGCTG GTGGCGCTGC ACGCCGGGGC GCTGGTCTAC CGCACCGATC TCAGCGCGGC CGAGACCCAG GGGACGGCGG GCATCATCCA GGCCAACTCT CAGGCCGCCC TCGCCGACCT GCGGGAAATC CTGGGTCTGC TGCGGGACAC CGAACGGGGC GAAGACCCGA CCGGCCACCG GCCGCAGCCC ACCCTGGGCG ATCTGGACAC CCTGCTCGAC GAGGAGCGCG CCGCCGGCGC ACACATCACG GTGCACTCCG ACCTCGAGGA CCGCGACGCG CTTCCGGTGT CCACAGGCCG CAGCGCCTAC CGGATCCTGG AGGAGGGCCT GACAAACGCC CGCAAGCACG CCCCGCATGC CGCCGTCACC GTCGAACTGA CGGGTCGTCC GGGGGACGGG GTCGACCTCA TCGTCCAGAA CCCGGTCCGT GTGGACGACA ACCACCACGG CAACGACGCC ACCGGCTTCG GCCTCGTCGG ACTGGCCGAG CGGGCCGCGG CCAGCCACGG CCACTGCCAG CACGGCGTCA TGGCTGACGG CGACTTCGTC CTCCGGGCCT GGCTTCCGTG GGACCGATGA
|
Protein sequence | MGFAGEAERP EMAGRGRWRG TLFAVAVGAL WLIVARDEWG SAGALLWIDV ALGVASVAAM QFRRRRPLAV TVLTVAATAV SASAIGAWVV CQVSLSARRR WPEVLPTAVL SVATGQILYA VQPDQALPWY VNLIFTALAT GVVVAVGMYV GARRELVTSL RDRAERAERE QRLRVAAAQA GERARIAREM HDVLAHRMSL VALHAGALVY RTDLSAAETQ GTAGIIQANS QAALADLREI LGLLRDTERG EDPTGHRPQP TLGDLDTLLD EERAAGAHIT VHSDLEDRDA LPVSTGRSAY RILEEGLTNA RKHAPHAAVT VELTGRPGDG VDLIVQNPVR VDDNHHGNDA TGFGLVGLAE RAAASHGHCQ HGVMADGDFV LRAWLPWDR
|
| |