Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0836 |
Symbol | |
ID | 5732737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 946148 |
End bp | 947308 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277968 |
Product | hypothetical protein |
Protein accession | YP_001543612 |
Protein GI | 159897365 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00104213 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGCT TCAGCGATGT CACCAACATT TGGTCAACCA TAAAAGAAAT TGATGTGCGC GATATTCGCG ATCAGGCCGA TTTGCCCTGT CGGATTGCGC TACTTGGCCA TGCGACCTTC GGGCGCGATT TGATTATGCG CTTGTTGACG CTTGGCGCTC AACGTTTTCC CGCCCGTACT CCGCAAGTCA GCATCATCGA TTTACCCTTG GGCCGCGAAC AGGCTACCGA TCTCAACCGC GTCGATTTGA TTGTACTCAC GCTCGATGCT AGCCAAGCCT TGAGTTACGA TGAGTTTCTG GCCTACGAAA AATTGGCGAT TCTGCCAGTG CCACTGTTGA TCGCCGTTTG GGGTACGAAT CTCCCTAAAA GCTCCGAAAG CACCCACCAA GCTGATCTGC AAGCATCGCC AGCGGTGTTG CTCGACCCAC AAGCCGAGCC GGCAACCCAG CGCAAAATGC TGGCCAAAGC GGTGCTCGAA CTCGTACCAG AAGCGTTGCA TATTGCGGCT GCTCGGCGCT ACCCAGGCTT GCGCAGCGAA GTTACCAATA ATTTGATTAG CAGCGTTTCG CTGAGCAATG CGACTTTTGC TTTTACCTCG GGCATTCCCG AGATGATTCC CGTGCTGAAT TTGCCATTGA ACGCCGCCGA TATGCTGGTA TTGACCAAGA ATCAGGCGCT GTTGGCCTAT CGCGTGGCCT TGGCTATGGG TGCTGAAGGC GATTTTAGTG CCATGATTCG TGAATTGCTG CCCGTGGTTG GCGGTGGTTT CCTCTGGCGA CAACTGGCAC GCCAATTGGT CGGTCTGATT CCAGGCATTG GCTTATTGCC CAAAGTTGCG GTGGCCTATG CAGGCACGTT TGTGACTGGG ATTGCGGCAT GGCGCTGGTA TGAACGTGGC GAGTTGGTCA GCAAAGCCGA ATTACAAAGC CTCGTCAAAG CAGCCTTAGA AGAAGGCCGA CAACGAGCCA AGGCGCTGAT TGGTAATCGT AAGGCTGACG ATGATCCCAC TGCATCAGCC AAACCCAGTT TTCGCCAACG GATCGGCGCA GTGCTGAACC CAAAAAACTG GTTCAAGGCG CTGCGTGCCC GCCTGCGACG CAAACCCAAA TCGATCCAAA AAACAACTGA GCCAACCGAT CAATCCAACT CTGCTGCTTA A
|
Protein sequence | MSRFSDVTNI WSTIKEIDVR DIRDQADLPC RIALLGHATF GRDLIMRLLT LGAQRFPART PQVSIIDLPL GREQATDLNR VDLIVLTLDA SQALSYDEFL AYEKLAILPV PLLIAVWGTN LPKSSESTHQ ADLQASPAVL LDPQAEPATQ RKMLAKAVLE LVPEALHIAA ARRYPGLRSE VTNNLISSVS LSNATFAFTS GIPEMIPVLN LPLNAADMLV LTKNQALLAY RVALAMGAEG DFSAMIRELL PVVGGGFLWR QLARQLVGLI PGIGLLPKVA VAYAGTFVTG IAAWRWYERG ELVSKAELQS LVKAALEEGR QRAKALIGNR KADDDPTASA KPSFRQRIGA VLNPKNWFKA LRARLRRKPK SIQKTTEPTD QSNSAA
|
| |