Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0799 |
Symbol | |
ID | 4711504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 879860 |
End bp | 881257 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639855258 |
Product | O-antigen polymerase |
Protein accession | YP_001002377 |
Protein GI | 121997590 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00494175 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTAACA CACACAAAGC GTACTGGCGA TCTATCCTTC CCCCCTTTGA GCCTTGGGCA AATGTCCGAC TCGCGATCGC TGATCGCATT GGCTTCCTTG CCTTGCTCCT CATAGCCTTC ACAGGGCTGT GGATCGGCGA TTTATACCGT CTTGGCCTGG CATTGGTGGT GGTTGCTTTC TTGATAGCCG CCGTGGACCT GTGGCCAAGC ATGAAGCGCA GCGCTCTGTT TTGGGTCGGC GTCGCCTTCG TGCTCTTCGC TACGATACGC CACTGGGTGG CTGTGGTGGA GTTGGGCATC GATATGGTGG ACTCGCCACC CAAGACCTCA CAAATGATCC GCACCTCTCC GCTCCTGGTC GCTTTCGCGG CGATATGGCT CCGTGGGGAC GAACAACGTC TAGTGTGGTT TCTGTCGGCG ACGTTACTCG GTGCAATAGC TTGGCTTATT ACGTCGACTC ATTGGGACGC GTTTATTGAA ATGTTCCGCA ACTGGGACTG GAGCAACGCG CGTCGAGAGC TGTACGAAGG GAGCACCAAC CGCACCCCAT TTATTTATCT AGTTCTCAGC CTGGCTCTTA TTACAATCGG CACTGGCTTT GTCGTACGAT GCACCCATGT GTGGTGGCGT GCTGGCCTTT TCATCGCTTC CCTGACAATG GCCGTTGGAT TCTTGTCGCT TACCTTGACC ATCGAGACGC GTGGTGCCCA AATCGCAGCA TTTGTGGCTT ATGGGTTGCT TGCTGCGGCA ATGGTCAGTA AGGGGCTCCG CCACCTCGGC ATCATTAACC GGGGGACCCG CTGGGCATTA ACGGCCACAG GGGGCGCAAC CATCATTGCT ATCGGAATCG CGCTCTGGAT CACCATCGGC ACTAGTAGTG AGCGACTGCA CAATACGATG GAGGCAGGAA AAGCGTTAGC ACAAAATCCG GACTATCTCC ACGAGCCTTT CGAAGCGGCA CAGGAGCATG ACGCCATTCG AGCCGGGAGC GTAGTCCAGC GCCTCAACCT GGTAAGCTTG GCTGCTGATG CGATCGCGGA ACGACCATTG GTCGGATGGG GAGGAGGCAC TAGTCATAAA TGGGTACGCG AGTACGGACG TACTGATTTT CACAACTGGT ATCTTGATGT AACGGTTGCT TTTGGTCTGA TCGGCGCCGC CCTTTATTTC GGCGGCTTCG TCTATATCCT AGGAAGCAGC ATTAGGGCCC GAATGATTCA TCGACTTGAT CCTCATGTCG CCCTGTTTGC CTTCAGCGCT ACGGCAGCTT GGTTGACAAC ACAGCTATTC ACCACTTGGA TCAGCGCCGC ACAAGGACGT TTTACATTAG TATTTATTGC GACATTGTTG GCGTTCGCTC ACACTGCGCG CTGGCTGCCG GAGCTACGAA AAAAATAA
|
Protein sequence | MFNTHKAYWR SILPPFEPWA NVRLAIADRI GFLALLLIAF TGLWIGDLYR LGLALVVVAF LIAAVDLWPS MKRSALFWVG VAFVLFATIR HWVAVVELGI DMVDSPPKTS QMIRTSPLLV AFAAIWLRGD EQRLVWFLSA TLLGAIAWLI TSTHWDAFIE MFRNWDWSNA RRELYEGSTN RTPFIYLVLS LALITIGTGF VVRCTHVWWR AGLFIASLTM AVGFLSLTLT IETRGAQIAA FVAYGLLAAA MVSKGLRHLG IINRGTRWAL TATGGATIIA IGIALWITIG TSSERLHNTM EAGKALAQNP DYLHEPFEAA QEHDAIRAGS VVQRLNLVSL AADAIAERPL VGWGGGTSHK WVREYGRTDF HNWYLDVTVA FGLIGAALYF GGFVYILGSS IRARMIHRLD PHVALFAFSA TAAWLTTQLF TTWISAAQGR FTLVFIATLL AFAHTARWLP ELRKK
|
| |