Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2363 |
Symbol | |
ID | 4709230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2592968 |
End bp | 2594083 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639856838 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001003928 |
Protein GI | 121999141 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.629772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAGTCC TGTACCTGTT CGACACCACC GACCGCGCCG AGAGCGAGAG CGTCATCGAG ATGGCCAGCC ACGGCGTGGT CCCGACCATC GTCTGCCAGC CGGATGCGCC CATGCGCGGC CGCTTCGAGG CGGCCGGGCT GGAGGTCATC CCGGTCGCCA TGCGCAGCAA GGCGGACCGG GCAGCCATCG CCGCGCTGCG CCGGCTGCGC CGGGAACGGA CCTTCGACCT GGTGCACGCC TACTACAAGA TCGCCCTGAC CAACTACAAC CTGGCGGCGG TCGGGCTGCC CCGGGTGCCG GTGGTGGCCT ACCGGGGGAT CATCGGCAAT CTGAGCTACT GGGACCCGTT CTCCTGGCTC TCCTTCCTCG ACCCGCGCAT CGAGCGCATC GTCTGCGTCT GCGAGGCCAT CCGCCGGTAC TTCCTGGACA AACCCTTCCT GCCCGGGACC CGGCTGTTCC GTCCGGAGCG GGTGGTGACC ATCCACAAGG GCCACCGCCC GGCGTGGTAC CAGCAGCCCG ACGCCCGCCT GCCCGCCGAC CTGGCCATCC CGCAGGGCGC ACCGGTCATC GGCTGTGTGG CCCGGATGAA GAAGCGCAAG GGCATCGTCG AGCTGATCCG CGCCTTCGAG CAGATCCCCG CCGAGCACAA CGCCCACCTG GTGCTGATCG GCCCCATCGA GTACCCCGCC ATCGAGCAGG CGGCGGCCCA CAGCCCGGCG GCGGATCGCA TCCGCATCAC CGGCTACCGG GCCGATGCGC CCAAGATCGC CGGCGCCTTC GACATCGCCA CCCTGCCCTC CCTGCGGCGC GAGGGCCTGC CGCGGGCGAT CATCGAGGCC ATGGCGCAGG GCATCCCGGC GGTGGTCTCG GACTCCGGCG GCAACCCGGA GCTGGTCGAG GACGGCGTCA GCGGCCGGGT CACCCCGGCG GGCGATGTGG ACGCGCTCGC CGCAGCCCTG CGCGAACTGG TCGCCGACCC GGCGCTGCGG GGTCGTCTGG GGGCCGCCGC CCACGAGCGC ATCGCCACCC GCTTGACCGT CGAGCGCACG GCGCGGGAGA CGCTGGCCCT TTACGCCGGC GTGCTCGGCC AACGCTCGGG CGAGCCGGCG GCCTGA
|
Protein sequence | MRVLYLFDTT DRAESESVIE MASHGVVPTI VCQPDAPMRG RFEAAGLEVI PVAMRSKADR AAIAALRRLR RERTFDLVHA YYKIALTNYN LAAVGLPRVP VVAYRGIIGN LSYWDPFSWL SFLDPRIERI VCVCEAIRRY FLDKPFLPGT RLFRPERVVT IHKGHRPAWY QQPDARLPAD LAIPQGAPVI GCVARMKKRK GIVELIRAFE QIPAEHNAHL VLIGPIEYPA IEQAAAHSPA ADRIRITGYR ADAPKIAGAF DIATLPSLRR EGLPRAIIEA MAQGIPAVVS DSGGNPELVE DGVSGRVTPA GDVDALAAAL RELVADPALR GRLGAAAHER IATRLTVERT ARETLALYAG VLGQRSGEPA A
|
| |