Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4116 |
Symbol | |
ID | 5735977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5262850 |
End bp | 5264148 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281270 |
Product | hexapaptide repeat-containing transferase |
Protein accession | YP_001546876 |
Protein GI | 159900629 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGA TTGTTCTCCG CGATCCTACG CTTATCGCTC CCTTTGGAGA ACCAGCGCGT GATTTGCGGA TTCTCAATAA GCCGCTGTGG CTGCTACACC GTGATCTGCT TGCACGCCAC TGCCAAAGTG TTGCGGAAGT AGACGATTGG TCTGAAATTT CACCAAGTAG TGATGAACTC TTGGTGCATA AAGATAATCT CTATTTCAAC CGCGATTTCA TTGAGACGTT TATCGCTGAG GCACGCGCCA CTGGGCAACC TTGCCAAGTG GCTTTTGCTG CTGATGATGC GATGATTACG GCCCATGCTT TGCGATTACA AGAGGGTATT CGCAAGCATG GCAACCATTT TATCGCCGAC CTCTACTATT TCCCTCGCGG CGTTGTCCCG AATCCACAAC CGCTGGTGAT CGATACCAAT GCCATGGAGA TGGGCTACTA CCATATTCCA AGCTATATGG CGCCCAACCA AGGGGATTTG GTATTCCAGG TGCCAATTCG TGCGTTTTGT TCAATCGAAA GCTGGGTCCA TATTTTCATG ACAAACTCTC CGCTCGGGGT GTTTGCATGG GGTCGGAAGC TCGAGCAAGA AGTTGCCGCA AGTTGGCGTT TGAAGTTGAA GATTGGCTTT CGTTCATTCA TCGAACGTAA GCACTTTCTT TCATCATCTC CGGTGGTCAA GATTGGCAAG AACTGCTCAA TCGATCCTTC GGCGATTATT CAAGGGCCAA CTGAGATCGG TAACAACGTG AATATTGGCG CTGGAGTGGT GATTACGAAT AGCTTGATCG GTAATAACGT GACGATTATG CAAGGCTCTC AAGTGATGCT TAGCGTAGTC AGTGATCGTT GTTATTTACC ATTCCGGGCT GCTCTGTTCA TGACTGTCTT GATGGAAAAT TCGATGGTGG CGCAAAATAC CTGTTTGCAG TTATGCGTCG TTGGCCGTAA TACCTTTATC GGGGCTGGCA ATACCTGTAC CGATTTCGAT CTGCTGGGCA AGCCAATCAA GACGCTCCAT CGCGGGCGCT TGGAAGAAGT TGGTCTGCCA GTTATTGGCT CGGCAATTGG CCATAATTGT AAAATTGGCT CAGGCTTTGT CATTTACCCA GCCCGTAATA TTGAATCAGG CACGGTCTTG ATTTATGGCG ATGACCATTC GGTTATTCCT AAAAATGTTT CGAGTGGTAT TTATACGCGC CCACCAGTTT TCTATCCCGA CCGCGATCCG CGCGTTCAAC GTGTTCCAGT TAACGATCGC GTCGCAGATG AGTACCCAGC AGAACAATTC CGTGATTAA
|
Protein sequence | MKRIVLRDPT LIAPFGEPAR DLRILNKPLW LLHRDLLARH CQSVAEVDDW SEISPSSDEL LVHKDNLYFN RDFIETFIAE ARATGQPCQV AFAADDAMIT AHALRLQEGI RKHGNHFIAD LYYFPRGVVP NPQPLVIDTN AMEMGYYHIP SYMAPNQGDL VFQVPIRAFC SIESWVHIFM TNSPLGVFAW GRKLEQEVAA SWRLKLKIGF RSFIERKHFL SSSPVVKIGK NCSIDPSAII QGPTEIGNNV NIGAGVVITN SLIGNNVTIM QGSQVMLSVV SDRCYLPFRA ALFMTVLMEN SMVAQNTCLQ LCVVGRNTFI GAGNTCTDFD LLGKPIKTLH RGRLEEVGLP VIGSAIGHNC KIGSGFVIYP ARNIESGTVL IYGDDHSVIP KNVSSGIYTR PPVFYPDRDP RVQRVPVNDR VADEYPAEQF RD
|
| |