Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5142 |
Symbol | |
ID | 5737100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 193484 |
End bp | 195073 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641282307 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001547898 |
Protein GI | 159901652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0225115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTCC ACTTCATCAA AAAACGCAAT CCCATGGTCC TTGTTGTGGT CTTTATTGGT ATCTGGTTGT TGACGATGAT GCCCCCAACA CGTCATACTG CTGCGATACC GCATGGTCTT ACCACCCATG ATTGGCAGAT CCTCACATCG TTCTTTCCCC CAACTCAGCA AGCATTTTTT AAGGCATCCA ATACCGGCCC ACAAGATTTT TTTGGATCTC GTGTTGCTGT AGACGATGAT ACCGTCGTCA TAACGGCCCC GCAAGAAGAT AGTAGTACAA CCGGGGTCAA TAGTACACCC AATGAGGAGG CATCGGAATC GGGGGCAGCC TATGTCTTTG TTCGGACGAA TGGGCTTTGG ACACAACAAG CCTATCTCAA ACCCTCCAAT ACGAGTCCCG GTGATGCGTT TGGAGAAAGT GTCGCCATTG ATCAGGATAC CATTGTTATT GGTGCCTTTA AGGAAGATAG TAGTACGACC GGGGTCAATA GTATGCCCAA TGAGGAGGCA TTAAACGCAG GCGCAGCCTA TGTCTTTGTT CGGACGAATG GGCTTTGGAC GCAACAAGCC TACCTCAAGG CATCGAATAC GGATGCGGAT GATGCGTTCG GGACAGCCGT AAGTGTTAGC CAGGATAGTA TCGTTGTCAG TGCGATCAAT GAAGATAGCA GTACGACCGG GGTCAATAGT ACGCCCAATG AGGAGGCATT AAACGCAGGC GCAGCCTATG TCTTTGTTCG GACGAATGAG CTTTGGACGC AACAAGCCTA CTTTAAAGCA TCGAATACGG GTGCGGATGA CATGTTCGGA CGAAGCGTTG CCCTTTATGC CACAACCCTC GTTGTAGGTG CCTATCTTGA GGATAGTAAT ACCCGCGGTG TCAACAATCC GCCCAATGAA GCAGCACCTG ATGCGGGTGC AGCCTATGTC TTTGAACGGG TCAATGGTCA TTGGGTGCAG CAGGCCTATC TCAAGGCATC AAATCCAGGG GAAACTGATC GCTTCGGCAT TAGTGTTGCG CTTGAGAGCA GCACGATTGT GGTAGGGGCG TATCTGGAAG ATAGCAGTAC AGCGGGTGGG CAGAGCGACC CAAATGAGCA AGCACCCGAT GCGGGTGCAG CCTATGTCTT TGTCCGAATC ATGGATACAT GGTATCCGCA AGCCTATCTT AAAGCGTCAA ACATCGATGC AGGAGACCGT TTTGGAGCCA GTGTCGCCGT CCATGGCGAT TTACTGCTGA TTGGAGCGTA CTTGGAAGCA AGCAGTAGTA GTGGCATCGA CAGTATCCCA AATAATGATG CACCCGGAGC CGGAGCCGCC TATCTTTTTC TACGAACCAA TCATAGATGG ACACAACAAT CCTACTTAAA AGCATCGAAT ACCGGACTGA ATGATACATT TGGCATTCGT GGTGCGCTAT ATCAAGGGAC AATCGTGATC GGTGCATATC AGGAAGATAG TAGTACGGTT GGGGTGAATC CTCCTTCTAA TGAGGAGGCT TCAGATTCCG GTGCAGCCTA TGTCTACACG ACGACTATGC TTGTGCCGAA TCCCCATCGT GCATATCTGC CATGGGCTGG ACGGGAATAG
|
Protein sequence | MHVHFIKKRN PMVLVVVFIG IWLLTMMPPT RHTAAIPHGL TTHDWQILTS FFPPTQQAFF KASNTGPQDF FGSRVAVDDD TVVITAPQED SSTTGVNSTP NEEASESGAA YVFVRTNGLW TQQAYLKPSN TSPGDAFGES VAIDQDTIVI GAFKEDSSTT GVNSMPNEEA LNAGAAYVFV RTNGLWTQQA YLKASNTDAD DAFGTAVSVS QDSIVVSAIN EDSSTTGVNS TPNEEALNAG AAYVFVRTNE LWTQQAYFKA SNTGADDMFG RSVALYATTL VVGAYLEDSN TRGVNNPPNE AAPDAGAAYV FERVNGHWVQ QAYLKASNPG ETDRFGISVA LESSTIVVGA YLEDSSTAGG QSDPNEQAPD AGAAYVFVRI MDTWYPQAYL KASNIDAGDR FGASVAVHGD LLLIGAYLEA SSSSGIDSIP NNDAPGAGAA YLFLRTNHRW TQQSYLKASN TGLNDTFGIR GALYQGTIVI GAYQEDSSTV GVNPPSNEEA SDSGAAYVYT TTMLVPNPHR AYLPWAGRE
|
| |