Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4492 |
Symbol | |
ID | 5736343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5752959 |
End bp | 5754158 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281655 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001547252 |
Protein GI | 159901005 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0132524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGAA TATTGTTGCT AACAGCCGAA TATTTGCCCC AACCTGGGGG CGTTGGCGAT TACACTGCCA AATTAGCCGA AGCCTTAACC GCGCTCGACT GTCAGGTTTG TGTACTCACT GCTGGCGAAG GTGCTGAACA AAGCGAGCCA TGGCTGGTTT GGCGCAGGGT GCGCGGCTGG GGCCGTAAGC TGCATCAGGA TGTGCGCAAC GCTGCCAAGC AATTCGAGGC CGATATTGTG CATATTCAAT ATCAAACTGG CGCATACGAA ATGAAGCCTG CGGTCAATTT GCTACCTGCG GCGCTCTCGG TGCCAAGTGT GGTAACCCTG CACGATTTGC GTATGCCCTA TTTGGCTCCC AAAGTTGCGC CGTTGCGCCG CTATGTTACC CGTTTGCTAA TCGAAAATGC TCATGCCGTG GTGGTTACCA ATGCCGAGGA TGAATCGCGC TTGGCTGGCG ATGCACCCAG CAGCAATCCC GATATTTATA CCCTGACCCA GCCATTGGAG CCACCAGCCC ACTTAATTCC AATTGGCGCG AATATCGAGG TTGCGCCATT AGCCGACCGC CAGCAGCTAC GCCAGCAGTT GGGAGCCAAC TCCGAAACGC TGTTATTGGG CTATTTTGGC TTGCTCAACA GCACCAAAGG CGTGCACACC ATCGTTGAAT CGTTGCAATA TTTGCCAGCA ACTACCCGCC TCGTGATTAT CGGCGGCGGC ACAGGCACGC CCGAAGATGA AACCTATGCC GAACAATTAC GCGCCACAAT TAGCCGTCTC GGGCTTGATC AGCGTATCCA TTGGACGGGC TACCTGAGTG CCAGCGAGGT TTCCCGTACC ATGCAAGCGC TCGATTGCGC GGCCCTGCCA TTTAGCGATG GTGCATCGTA TCGTCGGGGT AGTTTGTTGG CGATGCTAGC CCATGGCGTG CCCACAATCA CCACCCCGCC GCATGTGCCA ATCGATCCTT CACTGCGCCA TGAACGTGAT GTGCTATTAG TCGAGCCAGA TGATGCCATT AATTTGGCTT TGGCAGTTGA GCAAATCGCC GCCAACCCCG AATTACGCCA ACAATTGAGC CAAGCAGGCC AGCGCATCGC CTGCTCATTT CGCTGGGAAA CCATCGCTCA ATTGCATACG ACGCTCTATC AACAACTGCT CGATGCACAA CAAAACAAGC AAGAGCAGAT CGATTTTTAG
|
Protein sequence | MMRILLLTAE YLPQPGGVGD YTAKLAEALT ALDCQVCVLT AGEGAEQSEP WLVWRRVRGW GRKLHQDVRN AAKQFEADIV HIQYQTGAYE MKPAVNLLPA ALSVPSVVTL HDLRMPYLAP KVAPLRRYVT RLLIENAHAV VVTNAEDESR LAGDAPSSNP DIYTLTQPLE PPAHLIPIGA NIEVAPLADR QQLRQQLGAN SETLLLGYFG LLNSTKGVHT IVESLQYLPA TTRLVIIGGG TGTPEDETYA EQLRATISRL GLDQRIHWTG YLSASEVSRT MQALDCAALP FSDGASYRRG SLLAMLAHGV PTITTPPHVP IDPSLRHERD VLLVEPDDAI NLALAVEQIA ANPELRQQLS QAGQRIACSF RWETIAQLHT TLYQQLLDAQ QNKQEQIDF
|
| |