Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3576 |
Symbol | |
ID | 5735437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4496235 |
End bp | 4497482 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280725 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546340 |
Protein GI | 159900093 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCACAAC CAATCGCTTA CATGATGTCA CGGTTTCCGC ATCTGTCGGA AACCTTTATT CTGCGTGAAA TGCTTGAAAT GGAACGGCTT GGCTGGGATG TTAAGCTGTT TCCGTTGATG TTGCAGCAAC AAGCGGTCGT CCACGATGAG GCCAAGCGCT GGATTCCGCT GGCTCAACCG CAGCCATTTA TCTCGCCGCA TGTTGCCCTG AGTTCGTTAC AGCGTTTGGC CAAACAACCT CTCAAAACAG GTGGCATCGT TGCTAAAACG CTAGCCGAAA GTGCCACAAC TCCTAGCGAA TTAGTCCGTA AATTGGCATT AATTCCTAAA GCAATCACCT TGGCTAAGCA TATGCAGGCC CAGCAGATTG CCCATATCCA TTGCCATTAC GCCACCTATC CTGCCTTTGC TGCGTGGATT ATCAATCGCC TAACGGGTAT TCCCTATTCT TTCACCGTCC ATGGCCACGA TATTTTTGTC AATCGGGCGA TGCTGCCGAC CAAAGCCCGT GGAGCCAGCG CCATTATGGC AATCGCCGAG TTTCACCGCG AATTTTTGGT TGAAAAATTA GGCGAATGGG TACGCGATCG GATTCATATT GTGCATTGTG GCATTCGCCC TGAGCGCTAT CAACCGCAAC CCAAGCCAGC TGGCGAACGC TTTGAAATTT TGACCACCGG CAGTTTGCAA GATTACAAAG GCCACCCCTT CTTGATTCAG GCCTGTAGTT TCTTGCGTGA TCGTGGGATC GCCTTTCGTT GCCGCATTAT TGGTGGCGGC GAAGATCGCC CAATGCTTGA GCAATTGATT GCCGAAAAAC AATTAACTGG GATGGTCGAA TTGCTTGGCC CGCAGCCAGA AACCGCCGTG CGCGAGCTGC TTGCTACCGC TGATTGCTAT GTTCAAGCTA GCATTATCAC GCCCTCAGGC AAAATGGAAG GCATTCCGGT TTCGTTGATG GAAGCCTTGG CCTGTGAGTT GCCAGTCGTG GCCTCGCGCA TGTCGGGCAT TCCTGAATTG GTGCGCCACG GCGAAACAGG CTATCTTGTG CCCCCCGCCA ATGCCGCAGC CCTAGCCGAA CAACTGCTGT ATGTACGCGA TCATCCTGAG GAAGCCGCTA GTTACGCTGC TCGTGGCCGC GAACTGGTCC AACGCGACTT CAACTTAGTA ACTTGCGTCG AAAAACTTTC TGAAACGTTA CTCGCCGTGA ATCCCGCTTT GCAACGGCCA ATTCAACGCA ATGCCTAA
|
Protein sequence | MAQPIAYMMS RFPHLSETFI LREMLEMERL GWDVKLFPLM LQQQAVVHDE AKRWIPLAQP QPFISPHVAL SSLQRLAKQP LKTGGIVAKT LAESATTPSE LVRKLALIPK AITLAKHMQA QQIAHIHCHY ATYPAFAAWI INRLTGIPYS FTVHGHDIFV NRAMLPTKAR GASAIMAIAE FHREFLVEKL GEWVRDRIHI VHCGIRPERY QPQPKPAGER FEILTTGSLQ DYKGHPFLIQ ACSFLRDRGI AFRCRIIGGG EDRPMLEQLI AEKQLTGMVE LLGPQPETAV RELLATADCY VQASIITPSG KMEGIPVSLM EALACELPVV ASRMSGIPEL VRHGETGYLV PPANAAALAE QLLYVRDHPE EAASYAARGR ELVQRDFNLV TCVEKLSETL LAVNPALQRP IQRNA
|
| |