Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3712 |
Symbol | |
ID | 5735576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4668653 |
End bp | 4669867 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280864 |
Product | hypothetical protein |
Protein accession | YP_001546476 |
Protein GI | 159900229 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0151792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAA CACTCTACAT TACCTATGAT GGCTTACTCG ATCCACTTGG TCAAACTCAA ATCTTGCCCT ATTTAGAGGG CTTGGCTGCT CAGCATGGCC ATCAATTTGT GATTCTTTCG TTTGAAAAGG CGGCGCGTTG GGCCGATCTG GCGCGGCGGC AAGCTTTGAT TGAGCGTTTG CAAAGCCACG GCTTGGTTTG GCTGCCATTG CGCTATCATC AACGCCCAAT TCTCCCAGCA ACCTTGTATG ATCTGGCATT GGGCTTGTGG ACGGCTAGCG TGGCAGTGTG GCGCTATCGA ATTCAAGCTT TGCACATTCG CTCGACCGTG CCAATGACGA TTGCCCTCTT GCTCAAACGC TGGTTCAAAC TACCCTTGCT GTTCGATCTG CGCGGTTTTT GGGCCGATGA ACGCGCCGAT TTGGGCATGC CGCGCATAGG CTTGGTCTAT CGTTTGCTCA AAAAACTTGA GCGTGCCAGC CTGCAACAAG CTGAGCAGAT TGTGACCCTA ACCCAGCAAA GTTTGCCCTA TTTGGCCGAA CATAGCCAAA CACCGCACAT CGCCAATAAA ACCACAATTA TTCCATGCTC GGCCAATCTG CAAGTTTGGC AGCGCGATTT GGCCGCTCGA ACCCAAATTC GCCAAGCCCT TGGTTGGCAA ACCAACCCAA TTTTGGTCTA TAGTGGCTCG CTTGGTGGCG GGTATCGTAG CCGCGATATG GCCGAATGTT TTGCCTCGTG GCTGCACGCT GAGCCAGATT TGCGCTGGTT GGTACTCTCC AACCAAGATC CCCAGCCGTT GATCACAGCC TTGCAGGATT TGAATGTCCC CGCAGAATCT TATACAATCC GTGGTGTGGC CAGCAATGAG GTGGCTCAAT GGCTCAGCGT GGCCGATGCA GCACTTTCCT TAATCACGCC CAGCTATGCC AAAATTGCCT CGTCGCCAAC CAAATTTGGC GAATATTTGG CCTGTGGGCT GCCAATTTTT AGTAATGCAG GCATCGGCGA TAGCGATCAA TTATTCGCTC AAGGCGTGGC AATCAAGCTT GATGATTTTA ACGTAGCGGC CTATCAAGCA GCGTGGCAAA CCTATCAAGC CTTGCGCGAA CAGCCCGATT TGGCTCAACG CTGTCGGGCC TTGGCCGAAC GTGAGTTTGA TTTGCAGCTG GCGGTTGAAC GCTACGCCGC CTGTTATCGA GAGTTAGAAG CATGA
|
Protein sequence | MLKTLYITYD GLLDPLGQTQ ILPYLEGLAA QHGHQFVILS FEKAARWADL ARRQALIERL QSHGLVWLPL RYHQRPILPA TLYDLALGLW TASVAVWRYR IQALHIRSTV PMTIALLLKR WFKLPLLFDL RGFWADERAD LGMPRIGLVY RLLKKLERAS LQQAEQIVTL TQQSLPYLAE HSQTPHIANK TTIIPCSANL QVWQRDLAAR TQIRQALGWQ TNPILVYSGS LGGGYRSRDM AECFASWLHA EPDLRWLVLS NQDPQPLITA LQDLNVPAES YTIRGVASNE VAQWLSVADA ALSLITPSYA KIASSPTKFG EYLACGLPIF SNAGIGDSDQ LFAQGVAIKL DDFNVAAYQA AWQTYQALRE QPDLAQRCRA LAEREFDLQL AVERYAACYR ELEA
|
| |