Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2632 |
Symbol | |
ID | 5734510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3376517 |
End bp | 3377527 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279772 |
Product | aminoglycoside phosphotransferase |
Protein accession | YP_001545398 |
Protein GI | 159899151 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATCA TGCTGCTCTC AGATCATCGC GCATTTCCTG CGCTACGTGA TCAAGCCCTG CCAAGTTTAG GGATTTTCAC GCTACATCCA ACGTTCAATG TCGAACCAAT TATGAATACT CCCAGCGTGC AACGGGTGCA TGAAGCCCAT AGCGGAGTAC ACGTAGTAAT TAAGTCGTTT GAGCATAAAA CGGGCTTAGA TGGCCGGCGA TTTTTGCCCA CCTACTCACG CACGTTGTTG AACCGCGAAG CCACGACGCT GGCGCAATTG CATGCGCTGG GTTTTGCCCA AGCCCCCTAT TTGGTGCCAC GCCCACTGGC CACCGACCCT GAGCGCCTGC TGTTGATCAC TGAGTGGCTG CCAGGCCAAA GTTGGGGTTC AATTCTCGAA ACCACCCTGC AGACTCGCAA TCTTGGCCCA TTGATGGCCT ATGTCAATAC GATTACTGGC TGGTTGGCCA CCTTGCACCG TCGAACTGCC ACCACCGATA AAATTGATGT GACCAATGCT TGGGAATATT GGGAAAAATT GTTGCGCCAA TTGCGTGAAC AAGAATTGCT CAATCATGCA GCCTTGGCAA ATTTACGCAT GCTCCAAGTA CATTGGGATG CCAGCGGTGT GCTGCATCAA GGCCTGCGTT CGATGATTCA TGGCGATGCA ACTCCGGCCA ACATGCTCTT TACTGCTCCG GAGGAACTTG CATTAATCGA TTGGGAGCGA TCCCGCCACG ATGATCCAGC GATTGATATT GGTTGTATGG TGGCTGAATT GAAGCATGCC TTCTTCAATG CTACGGGTGA TCCAACCGCT GCCGAGTGGT TAATCCGCCG GGTTTATGAT CGCTATAGCT TGTTGAGCGA TTTTGATCAA GAGGAATTTG CCGCCTTTAC CCAACGTGGC CAATTTTATA TGGGCTGCTA TTTGCTGCGG ATCGCTCGCA ACGAATGGTT TAGCTGGGAA TATCGTCAAC GGTTGGTCGC GGAGGCACAG GCATGCCTGG TCGTCCGCTA G
|
Protein sequence | MTIMLLSDHR AFPALRDQAL PSLGIFTLHP TFNVEPIMNT PSVQRVHEAH SGVHVVIKSF EHKTGLDGRR FLPTYSRTLL NREATTLAQL HALGFAQAPY LVPRPLATDP ERLLLITEWL PGQSWGSILE TTLQTRNLGP LMAYVNTITG WLATLHRRTA TTDKIDVTNA WEYWEKLLRQ LREQELLNHA ALANLRMLQV HWDASGVLHQ GLRSMIHGDA TPANMLFTAP EELALIDWER SRHDDPAIDI GCMVAELKHA FFNATGDPTA AEWLIRRVYD RYSLLSDFDQ EEFAAFTQRG QFYMGCYLLR IARNEWFSWE YRQRLVAEAQ ACLVVR
|
| |