Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0788 |
Symbol | |
ID | 5732672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 889415 |
End bp | 891352 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277918 |
Product | LVIVD repeat-containing protein |
Protein accession | YP_001543564 |
Protein GI | 159897317 |
COG category | [S] Function unknown |
COG ID | [COG5276] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0658316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTCGTC GTTTTTTTCG TTGTAGTTTG CTGCTGATGC TGGCCGTTGG AGCAATCGCT AGCCAGCATG CCACCCCGCT GGCCGCCCAA AGCCCTCAGT CAAGCCCAAG CCAATTGGCA ATTCGTGGTC AACTTGGCGG TGATAGCTCA ACCATGGTCG TTGCCGAACA AGCTGCTTAT GTTGGCATCG GTCCACGGGT CGTAACCTAC GATTTGCACA ATCCCAATCA GCCAAATCTA GCCAACCAAA GCAGCCTTTT ACCCGCCACC GTGCAAGATT TGGCCTTAGG CGAGGGCTAT TTGTACGCCG CCTTGGGCTA TGGCCGTGAA CATGGCTTGG CTACATTTTC CTTAGCCACT CCATTAACAC CAACGCTTGT AAGTTTTTAC CCGCTTGAAT TATCGGCCAT GAGTGTTTTC AGCTATCAAG ATTATGTCTA TGTGCAAGTT GATGATGGCT TGCAGGTGTT CGATTTGAGC AATCCTAGCC AACCACAATT GGTCAATCAA CTTAATCAAT ATGGCATTGC CGCCGCAACG GTTCAAGGCG ATCATGTGTA TCTCAATATT CAATCGTGGC TTTCGGGCTT CGATAGCTTG AATATCGTCG ATTTGAGCAC ACCCATGACT CCAACGCTCA AAGGTTCGAT CGTGCTTGGC GATATTTTAG ATATCGAAAT TGCCGCTAAT TATGCGTATG TCAATACTCA AATGGGCCTG CGGGTGGTTG ATATCAGCAA TGGCAGCAAC CCAAACATTA TCAACCAAAC CGATGATGTT GCTGGATTAC AACTAACCGC AAGTGGCAAC ACACTCACAA TGATTGGGCT AGATTATGCC TTACACACGT GGGATATTAC CAACCCAATC ACGCCGACCG AAATTGCTGC CAGCTCCGAA ACGGTGCTTG GCTGGCCAAC AGCCTTAATT GCCCAACAAA ACCTTGGCTA TGTGCTGACT CGTTCAGGCC AAATTAATGT CTTCGATTGG AGCACTCGCC AAGCTCCCAC GCGCATCAGT GCGACTGAAA CCCCTGGCTG CAAGGAAAAT GTAGCCTTGC AAGTGGTTGG CGATTATGCC TATGTGGCCG ATTGGGGCCG TGGTTTGTGC ATTGTTGATG TGCGCAATCC CCAACAACCA AGCGTGATTG GCCGCTTCGA TTTAACCACC GATCGCAACG CCGTCACCGA TATTGCCGTC CAAGGCTCGT TGGTATATAT GGTCGATTGG AGCCACGGAA TTTATGTGAT CGATGTCAGC GACCCAACCG AACCAACTCA AGTCAGTTTC TTTGCAACCG CTGGCTTTCC AGCAGCAATT GCGGTCAATG GCCCGATGGT CTATGTTGGC GAATCGATCA ACGATCAAGG CGTTGGTGGC AGTGTACGGA TTTTCGATCT CAGCGATTTG GCCAATCCAG TTGAGCATAG TCAGTTCTAC ACCAGCAAAG TCGCCCAATT AGCCGTGATC GATCAAACCG TGTATGTGGC CGACAACGAA GGCGGCCTGC GGATTCTCGA TGTTAGCAAT CCTGATAACA TTCAGATGAT CGGCCAATAT GCTGAAAATT GGTTCGCCAT GGATTTAGCG ATTCAGGATG AGTATGTGTA TTTGGCGACG ACTAGCGGCT TAGAAATTGT CGATATCAGC AACCCCAGCC AACCAATCCC AGCAGGCCGC CTTGGCGATC GCTTGATTGA TGGCATCGTT GTGGCGGGCG ATAATGCGTA TATTGGTGAA AATGGTAGCA TTCGCCAAAT CGATATCAGC CAACCGAGCA ACCCTCAAGT TGTAGCCGAA ACCCTGCTAG CTCGCGATTC ATGGATTCTA CCAGCGCTGC ATGGTGGAGA GATTGTAGGC TTGAGCTATC ATGGTGGTGG CATGTTCGTA CTTGAGCCAG AATATCAATT ATTTTTGCCT GTAATCAGCA AAAACTAA
|
Protein sequence | MVRRFFRCSL LLMLAVGAIA SQHATPLAAQ SPQSSPSQLA IRGQLGGDSS TMVVAEQAAY VGIGPRVVTY DLHNPNQPNL ANQSSLLPAT VQDLALGEGY LYAALGYGRE HGLATFSLAT PLTPTLVSFY PLELSAMSVF SYQDYVYVQV DDGLQVFDLS NPSQPQLVNQ LNQYGIAAAT VQGDHVYLNI QSWLSGFDSL NIVDLSTPMT PTLKGSIVLG DILDIEIAAN YAYVNTQMGL RVVDISNGSN PNIINQTDDV AGLQLTASGN TLTMIGLDYA LHTWDITNPI TPTEIAASSE TVLGWPTALI AQQNLGYVLT RSGQINVFDW STRQAPTRIS ATETPGCKEN VALQVVGDYA YVADWGRGLC IVDVRNPQQP SVIGRFDLTT DRNAVTDIAV QGSLVYMVDW SHGIYVIDVS DPTEPTQVSF FATAGFPAAI AVNGPMVYVG ESINDQGVGG SVRIFDLSDL ANPVEHSQFY TSKVAQLAVI DQTVYVADNE GGLRILDVSN PDNIQMIGQY AENWFAMDLA IQDEYVYLAT TSGLEIVDIS NPSQPIPAGR LGDRLIDGIV VAGDNAYIGE NGSIRQIDIS QPSNPQVVAE TLLARDSWIL PALHGGEIVG LSYHGGGMFV LEPEYQLFLP VISKN
|
| |