Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0997 |
Symbol | |
ID | 5732900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1140463 |
End bp | 1141755 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278131 |
Product | hypothetical protein |
Protein accession | YP_001543773 |
Protein GI | 159897526 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03440] conserved hypothetical protein TIGR03440 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0142804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCGT CGCTTTCAGC GATGCCTGTG CAAGCCCGCC AAGCTCGTTT GATTGAACAC TATCAGACAG TGCGCCAATT CTCGGAGTAT CTTTGTGAGC CGCTTGTAAC CGAAGATTAT GTGATTCAGT CGATGCCTGA TGTTAGCCCA ACCAAGTGGC ATCTTGCCCA TACCAGTTGG TTTTTTGAAA CGTTTGTACT GACCCAAGCC GTGCCCAACT ACCAAACCTT ACACCCTCAA TATGCTTACC TCTTTAATTC GTATTATGTA ACGCTCGGCA AACGCCATTG TCGCCCCAAA CGTGGCTTGA TTTCGCGGCC AACCGTCGAG GAAACCTACC GCTACCGTGC CTATGTTGAT CAGCAGATGT TGGCCTTGTT GATTGCGATG GATGCCGAAA CCTTTGCCCG TTGGGAGCCA ATTCTCGATT TGGGGATTCA TCACGAGCAG CAGCATCAAG AATTAATGTT GACTGACCTC AAGCATGTTT TTTCAGAAAA CCCCTTGCGC CCAGCCTACC GTGAATTTGC CCCAAGCAAC CAACAGCCTG CCGCACCATT GCGTTGGCTC AGCTATCCCG AAGGCATTGT TTGGCTTGGC TATGAAGGTC AAAGTTTTGC CTTCGATAAT GAATCGCCGC GTCATCGCCA ATTTGTACAC AGTTTCAAAT TGGCCTCACG GCTTGTCACC AACGGTGAAT ATTTGGCGTT TATCGAAGAT GCTGGCTATG CCCGCCAAGA TTTGTGGCTC TCATCAGGCT GGTACACCCG CGAAGATGCT GGCTGGACTG CGCCGCTGTA TTGGGAGCAA CTCGATGGGG TGTGGCAGCA GATGACGCTT GGTGGTTTGC GACCGCTTGA TCTAGCCGAG CCTGTTTGTC ATCTCAGCTA TTATGAAGCC GATGCTTTTG CCCGTTGGGC GGGCAAGCGC TTGCCCACCG AAGCCGAATG GGAGTTGGCA GCCCAAACCG TGCCGCTCGA TGGATCATAC GCTGATGCAG GCCGCTATCA CCCAACAGCG CTCAATCTCG ATAATAGCAC TTTGCCCCAA CAAATGTTTG GCGAAGTTTG GCAATGGACC CAAAGTGCCT ACTCGCCATA TCCTGGGTTT CAGCCAGCAG CAGGCGCACT AGGCGAATAC AATGGCAAAT TTATGTCGGG GCAATATGTT TTGCGCGGCG CTTCATGTGC CACCTCGCGC TCCCATGCCC GCCTGACCTA TCGCAATTTC TTCCCACCCG ATGCCCGTTG GCAATTTAGT GGCTTGCGCT TGGCAGCAGA TGGTGAAGCA TGA
|
Protein sequence | MDSSLSAMPV QARQARLIEH YQTVRQFSEY LCEPLVTEDY VIQSMPDVSP TKWHLAHTSW FFETFVLTQA VPNYQTLHPQ YAYLFNSYYV TLGKRHCRPK RGLISRPTVE ETYRYRAYVD QQMLALLIAM DAETFARWEP ILDLGIHHEQ QHQELMLTDL KHVFSENPLR PAYREFAPSN QQPAAPLRWL SYPEGIVWLG YEGQSFAFDN ESPRHRQFVH SFKLASRLVT NGEYLAFIED AGYARQDLWL SSGWYTREDA GWTAPLYWEQ LDGVWQQMTL GGLRPLDLAE PVCHLSYYEA DAFARWAGKR LPTEAEWELA AQTVPLDGSY ADAGRYHPTA LNLDNSTLPQ QMFGEVWQWT QSAYSPYPGF QPAAGALGEY NGKFMSGQYV LRGASCATSR SHARLTYRNF FPPDARWQFS GLRLAADGEA
|
| |