Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0194 |
Symbol | |
ID | 5732040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 226580 |
End bp | 228307 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277318 |
Product | exopolysaccharide tyrosine-protein kinase |
Protein accession | YP_001542974 |
Protein GI | 159896727 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG0489] ATPases involved in chromosome partitioning |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.27748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCA TCCGTCGTTA TATAACTGGT CTCCGTCGCT GGCTCTGGCT TTTAGTGCTA GGGCCAGTTG TCGCTGCTGG CGCGGCTTAT GGCATTAGCA GCCAACAAAC ACCGCGTTAT GCCAGCAGCA CTCGTGTGAT TGTCGGCCAA ACCCTCAAAA ATAGTAATCC CGATTATGGC AGTTTGGTGG CAAGTGAGCG CTTGGTAGCA ACCTATGCCC AAATCGCCCA AAGCCGCACA ACCATGCAGG CAGTTGAGCA GCGCTTAAAT TTGAGCGATA TGGCAAGTTC AGCGATCATC ACTACCCGCC CAGTCCAAGA AACTGAGTTT TTGGATATTG CGGTTGAGGC CAATGATCCG CAACAGGCTG CCGATATTGC CAATGCAATT GCTGACCAAT TGATTTTGAC TAGTCCGGCA GGGCCACAGA GCGCTGAAGC CAAATTGCTT GATGAAGTCA ATCGCCAAAT TGCTACGCTC AACGAGGAAA TTACCCGTAC CGATGAGGAA ATTAAAACCC TCAAGGCCGA AATTGAGCAA ATTGGGGCCG ATAAACCTGC TGCTGAATCG TTGATTGCGA ACTTGCAACT CAAACAACAG AGCCAAAATC AAAATCGCCA AACCCTCAGC ACGCTCTACA GCACGGCGCT GGGCAATCGC GCCAACTCGA TCAGCGTGGT CGAGGCTGCC ACGGTCAATC CAACCCCAAT TGCACCACGG CCAATTCGCA GCGCAATTTT GGCGGGGATT TTGGGCTTTG CCTTGGTATT TGGTTTGGCC TTGTTGATCG AATATTTTGA TGATAGCGTG CAAACGCCCG ATGAAGGCGT TGATCTCGTT AATGCGCCGT TGCTGGCAGC GATTGTCAAG CAAGAAACTA AGGTGACCAA AGCCTCGCAA CGCTTGGTTT CGCGACTTGA TCCGCGTTCA CCAACCGCTG AAACCTTTCG CACCTTGCGC ACGAATTTGC AGTTTTCGAA TGTTGATACC AAAGCTCGCA CATTGATTGT CACCAGCAGC CAGCCTGAAG AAGGCAAAAG CACAGTCGCT GCTAACTTGG CATGGGTGCT GGCGCAAGCA GGCCAAAAAG TCGTCTTGAT CGATGCTGAT TTGCGCAAGC CCATGATGCA CCGCGTGTTT GAGGTGAGCA GCGAATATGG CCTGACCAAT TTGCTGACCA ATAATGAAGA CCCAACGATC CGTGAGCGCA CGGTGCTATC GGTTGCCGAA AATTTGTGGC TCATTCCTAG CGGGCCTTTG CCTCCTAACC CCTCGGAATT GCTCAGCAGC AAACGCATGG AAATGCTGAT TTGGCTGTTG CAGCAAGAAT ACGATTGGAT TTTGTTCGAT ACACCGCCAA TTTTGACCGT AACCGACCCA ATCGCACTGA TTCCACGGGT TGATGGTGTA GTGTTGGTGG CCGAGGCCAA GCGCACCCGC CGCGATATGC TGGCAAAATG TCGGGCTGCG GTGCAAACCG TCGGCGGGCG GGTGATTGGC TTGGTCTTTA ACAAGCTTGA TCCGCGCTCC GAGGGCTATG GCGTTTACTA TACCTACTAC TACGATCAAC ACCATACTTC CAATCGTGGT CGCCGCTTTT GGAATCGCAA AGATGATCAT CAGCCAGTGC CGAGTATGAG CGAGCCAGCC CCGTTGGATC TGCATGATCC TGCGCTTGAT CGTTCGGAAG CCGCCTATGA GATGGCCAGC CATGAGCGCA GCAAGTAA
|
Protein sequence | MNIIRRYITG LRRWLWLLVL GPVVAAGAAY GISSQQTPRY ASSTRVIVGQ TLKNSNPDYG SLVASERLVA TYAQIAQSRT TMQAVEQRLN LSDMASSAII TTRPVQETEF LDIAVEANDP QQAADIANAI ADQLILTSPA GPQSAEAKLL DEVNRQIATL NEEITRTDEE IKTLKAEIEQ IGADKPAAES LIANLQLKQQ SQNQNRQTLS TLYSTALGNR ANSISVVEAA TVNPTPIAPR PIRSAILAGI LGFALVFGLA LLIEYFDDSV QTPDEGVDLV NAPLLAAIVK QETKVTKASQ RLVSRLDPRS PTAETFRTLR TNLQFSNVDT KARTLIVTSS QPEEGKSTVA ANLAWVLAQA GQKVVLIDAD LRKPMMHRVF EVSSEYGLTN LLTNNEDPTI RERTVLSVAE NLWLIPSGPL PPNPSELLSS KRMEMLIWLL QQEYDWILFD TPPILTVTDP IALIPRVDGV VLVAEAKRTR RDMLAKCRAA VQTVGGRVIG LVFNKLDPRS EGYGVYYTYY YDQHHTSNRG RRFWNRKDDH QPVPSMSEPA PLDLHDPALD RSEAAYEMAS HERSK
|
| |