Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3431 |
Symbol | |
ID | 5735292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4320066 |
End bp | 4321205 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280578 |
Product | glycosyl transferase family protein |
Protein accession | YP_001546195 |
Protein GI | 159899948 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.749704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTT TACTACGTTC GACAATGGTT CGAGGTTTGG CGCTGGCGTT GGCTGGTTTT ACCCGCCCAG CACCAGCCCA ACCTCAACGA ATTTTGGTGA TTAAGCCCGA TCATTTGGGC GATGTTTTAC TGTTGACCCC AGCGCTACGG GCCTTGCGCT ATAGCCAACC GCACGCCCAG ATCAGCGTTT TGGTTGGTTC ATGGGCTACC CGCCTCTTGG CCGACAATCC TGATCTTGAT GCGATTGAAA CCTGTGAGTT TCCTGGTTTT GTGCGCGGTG CGCAACCCTC GACCTTGGCT CCTTATCGCT TGCTTTGGCG CGAAGCTGCC CGTTTACGAA GCATGAATTT TGATACAGCC TTGATTGCCC GTGATGATCA TTGGTGGGGC GGCTTGTTGG CGCTTGGGGC TGGCTGTGGC CGCCGAATTG GTTTTGCCCA TCCCTTGGTT GCGCCAACCT TAACCAAAGC TCTAGCGTGG AATAGCAATG AGCATGTTAC CAAGCAAGCC TTGGATTTGG TGGCAGCGCT CGGTTCAGAT CAACCACAAA CCCAAACGTT GCGGTTTATG CCAAGTGCCG CTGAGCATGC TTGGGCTGAG GCTTGGCTGG CGCAGCATCA GATTCAAAAA CCATTGGTGG CAATTCAGGC TGGCAGTGGC GGCGCAGCTA AATTATGGCC GGCTGAGCGT TGGGCACAGG TCGCTGAACA ATTGGCAAAT CAAGCCCAAA TTGTATTAAC TGGTGGGCCA GCCGATGCTG TTGATGTGGC AGCAATCAGC CAACAGTTGC AAATTCCCCA TTTGAATGCA GTTGGTCAGG CCAATTTGGG CCAATTGGCG GCCTTGTTTG GGCGTTGCGC TTTGGTGTTG GGCGTGGATA ACGGCCCTTT GCATTTGGCC GTGAGTCAAT CAACCCCAAC CATTCATCTG TTTGGGCCAG GCGATAAGCG GCGTTTTGGG CCTTGGGGCG ACCCAACCCG CCACGTTGTG ATCGATGCCG AATTAGCCTG CTCGCCATGT GGTGTGTTGA CCCATTGCCC ACGCCAAACC AAACCCAGCG AGTGCATGAC CGCAATTTCC GTCCAGCACG TGATCGGTCA CGCCAAACGT CTGCTTGATC AGGCTGGAAC CTCAATTTAG
|
Protein sequence | MKALLRSTMV RGLALALAGF TRPAPAQPQR ILVIKPDHLG DVLLLTPALR ALRYSQPHAQ ISVLVGSWAT RLLADNPDLD AIETCEFPGF VRGAQPSTLA PYRLLWREAA RLRSMNFDTA LIARDDHWWG GLLALGAGCG RRIGFAHPLV APTLTKALAW NSNEHVTKQA LDLVAALGSD QPQTQTLRFM PSAAEHAWAE AWLAQHQIQK PLVAIQAGSG GAAKLWPAER WAQVAEQLAN QAQIVLTGGP ADAVDVAAIS QQLQIPHLNA VGQANLGQLA ALFGRCALVL GVDNGPLHLA VSQSTPTIHL FGPGDKRRFG PWGDPTRHVV IDAELACSPC GVLTHCPRQT KPSECMTAIS VQHVIGHAKR LLDQAGTSI
|
| |