Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3525 |
Symbol | |
ID | 5735386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4437693 |
End bp | 4438667 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280672 |
Product | hypothetical protein |
Protein accession | YP_001546289 |
Protein GI | 159900042 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAAC GCTTGCGCGA AGTGCTCGAA TATTTGGCCA AGCAATCGTT GGTCGAGCCA ACCCGTTGGC AGCATTGGCA GTTAACTCCA ATTGCTGGTG GCGCGAACAA TCGGGTCTAT CGGGCGCAAT CCACCAATGC CGATTATGCA ATTAAGTGGA CGATTCGTGA TGAGCGTCGC CGCGCATGGC GTGAATATCA ATCACTCCAG ATAGTAGCTG AGTTTAATCA TTTGCTGGCT CCGCAGGCAG TTTGGCTTGA TGAAGCCAAT TTTGCCCAGC CAGTTGTCGT GCAAACATGG CTCGATTTTC CAGCGTTGAC GAGCATTCCC CAGGATTCGG CTCAATGGCA AGGCTTAATT GCTCACTACT GGGCGCTTCG CCAAATAACC CAAACGTCAG CCACATTTGA ACTCCCCAAA GCAACCCTCA ATATGACCAG TGTTGCTGAT GGCAAGGCTT TGATCGAACA ACATTGCGCC AAAATTCCGA CTGATCAATT GCCGATTACG CTCGAAAATC TATTAATCTG GCTAGAAACT TGGGCTGTAC CCGAGTTGCC TAGCCCGCCA ATGAGTTTGT GTCGGGTTGA TAGCAATTGG CGTAATTTTT TGGTAACACC CACAGGTTTT GTCTCAGTCG ATTGGGAAAA TGCAGGCTGG GGCGATCCAA ACTTTGATAT TGTTGATCTG ATGACCCATC CGGCCTATGC TGAGGTGCCG ACTGAGCATT GGTCGTGGTT TGTGCAGGCC TATTGTGGCT TTGGCGACGA CCCACGGGCT GTCCAGCGAA TTGAGATTTA TCGCACGTTG ATGCTGATTT GGTGGGTTGT GCGTTGGCAG CGATACCTGT ATGAAGTGCC ACGTGGCTTG GATGAACGGT TGGTACAACG CCCGCCAGCT TGGTTGATTA CGGCTCAAGC CAATTATCAA CGCTATTTAA ATTTAGCCCA ACAGGCTATC AATCAATGGA GGTGA
|
Protein sequence | MDQRLREVLE YLAKQSLVEP TRWQHWQLTP IAGGANNRVY RAQSTNADYA IKWTIRDERR RAWREYQSLQ IVAEFNHLLA PQAVWLDEAN FAQPVVVQTW LDFPALTSIP QDSAQWQGLI AHYWALRQIT QTSATFELPK ATLNMTSVAD GKALIEQHCA KIPTDQLPIT LENLLIWLET WAVPELPSPP MSLCRVDSNW RNFLVTPTGF VSVDWENAGW GDPNFDIVDL MTHPAYAEVP TEHWSWFVQA YCGFGDDPRA VQRIEIYRTL MLIWWVVRWQ RYLYEVPRGL DERLVQRPPA WLITAQANYQ RYLNLAQQAI NQWR
|
| |