Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0730 |
Symbol | |
ID | 5732616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 839405 |
End bp | 840961 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277860 |
Product | TPR repeat-containing protein |
Protein accession | YP_001543506 |
Protein GI | 159897259 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.444597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCA ATGTTGACAC TTTGAATGTA ATGCCGCTGG ACCTCGATAC CTTTGCTGGG AGCGAGCGTT TTATGGCTGG GACGCGCTTG GGTGCTGCGT TTGGGCGCGG GATCAAAGCC TATTTGGCCG CCAATTATCC TGATGCAATC GAGCATTTCA AAACTGCATT AATCGCTGCC TATGTTGAAG GCGATGAGAA ATCGCAAATC TACGATCGCG AGCGGGCGAT TATCTATCTG TATATTGGCA ATGCCTGTGC TTTTCAAGAT GATTGGCCGA CGGCTCAGCG TGAGTATTTG GAATCGGTAC AAACCGACCC GCAATTGGCC GAAGCCCACT ACAATTTGGG TGTTGCTTTC GGCGCGATGC GCCAAATTGA TCGGGCAATT GGCGCATTCA AGGAAGCGCT TGAACATAAT AATGGCTTGT ACGAGGCCCA TTTTGCACTT GGCCGTGCTT ATCAAATGCT CGATGATGCT GGCCGCGCCT ATATTCACTT TACTTCAGCC CGTGAATTGC GGCCTTACGC TGGCGAGCCT TTGTATTACA TGGGTTTGAT GCACCAAGCC CATGGTGCTC ACGAATTGGC ACAGCGCTGT TTTGCCGAGG CGCTGCGAGT TGAGCCAACC TTTATCTCGC CTAGCGTTGG CCCCGATGAA GTGCTGGTTA GCAAATCGGA GCAAGAGGTT GCCAATTGGT ACTATCGTTT GAGTGACGAC CTTAAATCCC AAGGCTATGA TGAAGATGCC AAGCGGATTT ACGAGGCTTT GCTGAAATGG CGACCAACCG AGCATCGTGC CCGCTATTTG TTGGGCAATT TGTTGGCGCG GGCGCGACGT TGGGAATTGG CTTTGCAAGA ATATGGCCGA ATTCCACCGC AAGATCGCTA CTATGTTGAT GCGCGACTGC GGATCAGTGC AATTTTGCGT TTTCAGGGCA AGCATCGCGA GGCCTATAAC CATTTGTATA GTACCGCCAA GTTGCGGCCC AACGATGGTC AAGTGTTCTT GCAAATGGGC AAATTGCTCT ATGATTTAGA GAAACATCAT GCTGCGATGC GTGCTTTTGA ACGCGCGGTC CAACTTTTGC CCAAAGACCC TAATGCCTAC TATTTGTTAG GGTTTATGTA CACGGTGTTG GGCCATGAAA GTTGGGCCTT GGCGGCTTGG CGCAAAGCAG TGGAGTTAGC GCCGCATGCC CATTCATTGC GCTACGATTT GGGCTATATG TACACCCGCC GTCGCCGCTA CGATTTGGCT TCGCGCGAAT TTAGCCATGT ACTGATGCAT TGGCCCGATG ATATTGAAAC CACCTTTATG CTGGGCACAT GTTATAAAGA AATGTTAGAA CCAGCCCAAG CCATACCGTT ATTTGAAAAA GTGCTGCGCC GCAACCCACG TCATACCCAA GCACTCTACT ATTTGGGGGC ATGCTATCTG CAAGTTGGCA ACAGTTCTTT GGGTAAGGCC TACCTGCGCC GCTACGAACA TCTGATCAAC CAACAGGAAC AATTACCACA AAAGGGCCGC GCAATGGCTC GTGGAGGTTC GCGATGA
|
Protein sequence | MTTNVDTLNV MPLDLDTFAG SERFMAGTRL GAAFGRGIKA YLAANYPDAI EHFKTALIAA YVEGDEKSQI YDRERAIIYL YIGNACAFQD DWPTAQREYL ESVQTDPQLA EAHYNLGVAF GAMRQIDRAI GAFKEALEHN NGLYEAHFAL GRAYQMLDDA GRAYIHFTSA RELRPYAGEP LYYMGLMHQA HGAHELAQRC FAEALRVEPT FISPSVGPDE VLVSKSEQEV ANWYYRLSDD LKSQGYDEDA KRIYEALLKW RPTEHRARYL LGNLLARARR WELALQEYGR IPPQDRYYVD ARLRISAILR FQGKHREAYN HLYSTAKLRP NDGQVFLQMG KLLYDLEKHH AAMRAFERAV QLLPKDPNAY YLLGFMYTVL GHESWALAAW RKAVELAPHA HSLRYDLGYM YTRRRRYDLA SREFSHVLMH WPDDIETTFM LGTCYKEMLE PAQAIPLFEK VLRRNPRHTQ ALYYLGACYL QVGNSSLGKA YLRRYEHLIN QQEQLPQKGR AMARGGSR
|
| |