Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0193 |
Symbol | |
ID | 5732039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 225533 |
End bp | 226576 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277317 |
Product | hypothetical protein |
Protein accession | YP_001542973 |
Protein GI | 159896726 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0116004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAAC GCCTTACGCG GGTACTGGGA ACCATTTTCT GTGTACTTGG CTTGGGTTTG GCGCTATTTG GCTTTGATGG CGCATGGCGT GCCTATGGTC AAACCGCAGG GCCAACCCCA ACCCCCGATT TTGACCTGCC GATTACCAAA GCTGTTAGCC CAAGCAATGC CTTGCCCGGC GATACGGTGA CGTTTTCAAT TAATGTTACC AACGATCAGC CGCAAACCCA AACCAATGTC GTGATTACCG ATAGCGTGGT TAATTTCTTG GAAGTAGTTG GTGCAAGCAG TAGCAAAGGC ACTGCTAGCT TTAGCGGCCA AGAAGTTCGC GCCGATGTTG GTACATTGGC CAGTGGCGAA TCGGTACGTT TGACGATTAC CACGCGGGTA CGGGTTGGCA CAGCGCCTGG CACAATCGGC CAAAATGTGG CGTTTGTCAA TACGGCCAGT GGTTCAAGCT CCTCAAGCAA TGTGGTTACA GTGACGATTG GCGGTGAAGG TACGCCTACG CCCAGCCCAA CACCAGTGCC AGCAGGCTCA AAATTGGTGG TTGAAAAGTC GGCTAGCCCA GCTAGTGGCA AAGTTGGCGA TTTAATTACC TTCAAAATTG TGGTGCGCAA TACTGGTGGC TCAACTGCCC CAAACGTCGT AGTCAACGAC CGTATTCTCG ATTTCTTGGA AGTTGTTAGC GTGCAAACCA GCAAAGGCAG CGCCGCAACC ACAGGCCAAG ACGTGAAGGT TACGGTTGGC GATTTGGCAG CTGGCGAAAG CGTCACCATT GCCATCACCA CCAAGATTCG AGCTGGCACG GTCAGCGGTC AACAAGGTAT CAACATCGCC GAAGCAGTCG CCAGCGATGG CAGCGGTGGT TCATCAAGCA TTCCTAGCAA TCCTGTGGCA ATCGCGGTTG ATCGTAATCC ACCAGCAGGT TTGCCTGATA CTAGCGCCCC CAACCAAGCT TCGTGGATTT TCTGGCTTGG CTTGGGCATG GCGATTACTG GCGGTTTGTT GTTGATGGTT AGTCGCCGCC GGCGCGTTGC TTAA
|
Protein sequence | MHKRLTRVLG TIFCVLGLGL ALFGFDGAWR AYGQTAGPTP TPDFDLPITK AVSPSNALPG DTVTFSINVT NDQPQTQTNV VITDSVVNFL EVVGASSSKG TASFSGQEVR ADVGTLASGE SVRLTITTRV RVGTAPGTIG QNVAFVNTAS GSSSSSNVVT VTIGGEGTPT PSPTPVPAGS KLVVEKSASP ASGKVGDLIT FKIVVRNTGG STAPNVVVND RILDFLEVVS VQTSKGSAAT TGQDVKVTVG DLAAGESVTI AITTKIRAGT VSGQQGINIA EAVASDGSGG SSSIPSNPVA IAVDRNPPAG LPDTSAPNQA SWIFWLGLGM AITGGLLLMV SRRRRVA
|
| |