Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0935 |
Symbol | |
ID | 5732821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1067836 |
End bp | 1070808 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278067 |
Product | kelch repeat-containing protein |
Protein accession | YP_001543711 |
Protein GI | 159897464 |
COG category | [S] Function unknown |
COG ID | [COG3055] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.35348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCTTTT TCGCTCGTTC AAGGGATTTG TTGCCCCGCT TGCTGAGCGG GTTGCTGCTG CTGACCTTGA TCATTACGGG GTTGTTGGTG CAGTCGCAGG CCCAAGCTCA AACTGCTCAA CCACAGGCTG TTGCCACTAG TGCCACAACG ACGATGTTTA GTTGGCAAGA TGGCGCAACC GTGCCGCAAC CACGTTTTGA ATCGCAAGGG GTCTTTGTCA ATGGCAAGTT GTATGTAATT GGTGGCTTTA TTAGTTGTTG TACCCAAATT AATGCTACCG ATTTGGTCGA TGTTTATGAT TTGGCCAGTA ATACGTGGCA ACGGATCGCC AGTATTCCCG AGGCGATTAG CCATGCCCCA GTTGTTGCCG ATGGCACCAA TATCTATGTG CTCGGCGGGT ATTTGGGCAA TAACCCAGGT GGTAGCACCA ACCATGTGTG GAAGTTGAAT ACAGTTACTA ATACATGGAC GCGTGGTATC GACTTGCCCG TTGCTCGCGG TGGCGCTGGT GCAGCGATTG TTAATCGTAA AATTTATTTC TTCGGTGGGG CTGTGCGTAC CGCTGGTGTG TTTGATGATA CTGACTTCGG CGATCATTAT ATGCTCGATT TAAGCCTCGC TAACCCAACT TGGGTTTCAC GCGCTGCCAT GCCCAACCCA CGTAACCATA CTGCTGCTGG GGTGGTCGAT GGCAAAATTT ACGCCGTTGG TGGCCAGCAT GGCAAAGCCG AAGAATCGGC CAATCAGGCC GAGGTTGATC GCTATGATCC CGCAACCAAT ACGTGGACGC GGGTTGCCGA TATGCCCATT CCCAAAGGCC ATACTTCTTC ATCAACCTTT GGCTATCGTG GCCGTTTATT GGTGATTGGT GGCTCGATCA ATGGTGGTAC CAGCGGCCTT GCTTCGGCTG ATGTGCTGAT GTACGACCCC AAAAGCGATG TTTGGATGAA GTTGGTTTCG TTGCCAGCCT ATCGTAAAAC CCCAGTTGCC GATGTTTGGA ATAACAAACT TGTGGTGACG ACTGGCGGCG GTTATGGCCA AACGGATACC ACTTGGATTG GCAATTTGCC CGATTCATGG GAATCTAACG GCACCATGCC CGTACCCTTG GGCGAGGTTG CCAGTGGGAT TATCGGCAAT AAAATGTATG TGGTTGGTCA AGGCAATGCT GCAACCTTGC GCTACAACTT GACGACTGGC AGCTATAGCT CAACCACCAC TTTGGCTCAA CGGGCCTTGC GTGGCAATCA CCATGCCGCC GAAGTAATAA ACGGTAAATT CTACTTATTT GGTGGCCTCG ATAATAACAG TGATGGCAAA GTCCAAATCT ATGATCCTGT GTCCAACACC TGGGCCATGG GCGCGGATAT GCCGTTTGCC GCTGGGGCTA GCTCCTCAGC CGTGATCAAT GGCTATGCCT ATGTTGCTGG CGGGATTATC AATGGCAGTA CAACCACCCG TGTAGCCCGC TATGATCCAG TCGCGAATAC TTGGACTGAA GTTGCCGCCA TGCCGCGCGG GCGCAATCAC GCTGCTTCGG CGACTGATGG TAGCAAATTG TATGTCTTTG GTGGGCGTGG GCCAGGTAGT GGCGATGGCA ATAATGTTGC CAATGGCTTT GATACTGTCC AAATTTACGA TCCTGTTGCT AATTCATGGC GCTCAAGCAT GACCGATACA ACGATTGCGC CGTTGCCGCA AGCCCGTGGT GGGATGGGCA AAGCGGTTTA TTATGACGGC GAGTTCTATA TTTTTGGCGG TGAAACGCTT GATGGTGCAG GCGCGGTCAC TGGCAATGTG TATGATCGGG TTGATATTTA TAACCCAATT CTTAACCGCT GGCGCACTGG TTCGCCTATG CCGACCGCCC GCCATGGCAT CTTCCCTGTG ATTAACGGCG GACGAATTTA TATTGCTGGG GGTGGCACGG TTTCGGGCTT TAGCCAAACC AACATCACCG AAATTTATAA TCCAGGGATT GCTGCCGACG AAAATGCCCA CCTTGAAAGC AATGGTCAAG TTGGCTTCGA TTCCGAGCAA GTCCACGGCA ATACCGCGCG TGGTCAGTGG TCGTGGCAAA CTCGCAATGA TGTTGATCGG CCTGGCTTCA GCGGCAACAA CTACTTGGCG GTCTTGCCCA ATACTGGCAC CCTTTCGGAT ACCAATTACA CCACGATGGC TCCACAACTG GATTATCGCG TGAAGTTCAC CACGCCTGGT ACCTATTATG TGTGGCTGCG TGGCTGGGCC GATAGTGGCG GCGATAATTC GGTGCACCTT GGCCTGAATG GTCAGCAATC AAGCTCATCG GATAAGATGA CCTTGCCAAT TTTCAGCGCT TGGCGTTGGT TCCGCGATAC AACCGATGGT GTGCCTTCAA CTATCACTAT TCCAAGTGCT GGAGTATACA CGCTGAATTT GTGGATGCGT GAGGATGGTT TCCGTGTTGA TCGCTTGTAT CTGACGACTG ATGTTAATTT TGTGCCAAGC GATGCCCAAT TAACCCCAAT TCCGACGACT GCACCAACCG CGACGGTGAG CAATCCGACC GCGACCAATA CCGCGACGGC AACCAATACA CCAACCAACA CACCAACCAC CGTGCCGCCA ACGGCGACCG ATACCGCTAC GGCGACCGAG GTTCCAACTA ATACGCCAAC CGCCACGACG GTTCCGCCAA CGGCAACCGA TACTCCGACT TCAACCAATA CGCCTGAACC GTCAACCACC GAGCCAGCGA CCAACACGCC GACTCCGACG ATTACGGTGA CGGCAACCAA TACAACCACG GCTGTGCCGA CAACAGCAGT GCCGACAACA GCATTGCCAA CCACGGCTGT GCCAACGGTA ACTGAGACGC AAACTCCGAC CGCAACTGTC ACAGTAGGCA CGATTACGCC AAGTGAAACG CCAACCGCGA CAGTCCCGAC CAGCGAAACA CATCAAGTCT ATCTACCTTG GGCTTCTAAA TAA
|
Protein sequence | MVFFARSRDL LPRLLSGLLL LTLIITGLLV QSQAQAQTAQ PQAVATSATT TMFSWQDGAT VPQPRFESQG VFVNGKLYVI GGFISCCTQI NATDLVDVYD LASNTWQRIA SIPEAISHAP VVADGTNIYV LGGYLGNNPG GSTNHVWKLN TVTNTWTRGI DLPVARGGAG AAIVNRKIYF FGGAVRTAGV FDDTDFGDHY MLDLSLANPT WVSRAAMPNP RNHTAAGVVD GKIYAVGGQH GKAEESANQA EVDRYDPATN TWTRVADMPI PKGHTSSSTF GYRGRLLVIG GSINGGTSGL ASADVLMYDP KSDVWMKLVS LPAYRKTPVA DVWNNKLVVT TGGGYGQTDT TWIGNLPDSW ESNGTMPVPL GEVASGIIGN KMYVVGQGNA ATLRYNLTTG SYSSTTTLAQ RALRGNHHAA EVINGKFYLF GGLDNNSDGK VQIYDPVSNT WAMGADMPFA AGASSSAVIN GYAYVAGGII NGSTTTRVAR YDPVANTWTE VAAMPRGRNH AASATDGSKL YVFGGRGPGS GDGNNVANGF DTVQIYDPVA NSWRSSMTDT TIAPLPQARG GMGKAVYYDG EFYIFGGETL DGAGAVTGNV YDRVDIYNPI LNRWRTGSPM PTARHGIFPV INGGRIYIAG GGTVSGFSQT NITEIYNPGI AADENAHLES NGQVGFDSEQ VHGNTARGQW SWQTRNDVDR PGFSGNNYLA VLPNTGTLSD TNYTTMAPQL DYRVKFTTPG TYYVWLRGWA DSGGDNSVHL GLNGQQSSSS DKMTLPIFSA WRWFRDTTDG VPSTITIPSA GVYTLNLWMR EDGFRVDRLY LTTDVNFVPS DAQLTPIPTT APTATVSNPT ATNTATATNT PTNTPTTVPP TATDTATATE VPTNTPTATT VPPTATDTPT STNTPEPSTT EPATNTPTPT ITVTATNTTT AVPTTAVPTT ALPTTAVPTV TETQTPTATV TVGTITPSET PTATVPTSET HQVYLPWASK
|
| |