Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1981 |
Symbol | |
ID | 5733870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2435772 |
End bp | 2436806 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641279125 |
Product | extracellular HAF |
Protein accession | YP_001544752 |
Protein GI | 159898505 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTCAT TTAAAACTTT TTGGTTGCTG CTGCTCGTCG TTATGGCACT TGGCTGGTTT GTTGTTCAGC GCTCGCCGCT TCAGGCCCAA ACGCCAAATC CCAACTATAG TTATGTCAAT CTTGGGGCAC TAGGCGGGCA ACATATGTAT CCTAGTGATA TTAATGATTT TGGACGAATC GCTGGGAGTG TAGAAACCGA GTTCTCAGCA ATGCGAGCGT TCGTTTGGCG ACGAGGTACG CTCAGCAATC TGGGCACACT CGGCGGCAAT CAAAGCTATG GCTATGGCAT CAATGATACT GGCTATGTTG TCGGTGAAAG CACTACGAGT AATAACAAAC GGCAGGCTTT TTATTGGCGC GAAGAGCAAA TGCTCAATCT TGGCACCCTC GGTGGTAATG TTAGCACAGC GCTTGATGTC AGCAATGGCG AGCGGATCGT TGGCCGAAGC ACGACCAGTA CTGGCGATAC CCATGCATTT ATGTGGTATC GCAATACGAT GACCGATCTT GGTACGCTGG GGGGCAACTA CAGCACCGCC AATGAAATCA ACGATCACAA AGTTATTGTC GGTTGGAGCA CCAATGCCAA CGGTGAAACT CGCGCCTGTA TCTGGAAAAA CGGTACGATT ATCGATCTAG GCATACCTGC GGTTAAAAGT TATGGCTATG CAATCAATAA CAATGAGCAA GTTGTGGGAA TGATGGAATT AAGTGATGGT CAACGCCATG CATTTCTTTG GGAGAATGGC GTAACCACCG ATTTAAGCGC CGGATTGAAT CAATATAGTG GTGCAAATGA TATTAACGAT GCAGGCACAA TCGTTGGGTT TACTGGTGAC GACACAACAC CACTTGCTGC AACGGTTTGG CATAATGGCA CACGTTTGCG GATGGGGCCA TTCAGTCAAG CAAGCACCGA ATATCAAACG ATTGCAACTG CGATTAATGA GGCCAACCAA ATTGCTGGTT ATGCTATCGT GAGCGCTGAT GGCGTTACGC GCACCGACGG AATAATTTGG CAATTTGAAG ATTAA
|
Protein sequence | MRSFKTFWLL LLVVMALGWF VVQRSPLQAQ TPNPNYSYVN LGALGGQHMY PSDINDFGRI AGSVETEFSA MRAFVWRRGT LSNLGTLGGN QSYGYGINDT GYVVGESTTS NNKRQAFYWR EEQMLNLGTL GGNVSTALDV SNGERIVGRS TTSTGDTHAF MWYRNTMTDL GTLGGNYSTA NEINDHKVIV GWSTNANGET RACIWKNGTI IDLGIPAVKS YGYAINNNEQ VVGMMELSDG QRHAFLWENG VTTDLSAGLN QYSGANDIND AGTIVGFTGD DTTPLAATVW HNGTRLRMGP FSQASTEYQT IATAINEANQ IAGYAIVSAD GVTRTDGIIW QFED
|
| |