Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0161 |
Symbol | |
ID | 5732070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 190394 |
End bp | 195169 |
Gene Length | 4776 bp |
Protein Length | 1591 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277285 |
Product | hypothetical protein |
Protein accession | YP_001542941 |
Protein GI | 159896694 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.330881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTGC TCGAAAAACT AGCACGATGG TGGTGGCGTT GGCTTCTAAC AACAATTAGT TTGTTGGGCA CGGCGCTGCT TGGCGTAATC GTGGCGAGTA TCGCTGGCTG GGTTCCGCAA CCAAGTGTTT TGCGCTTAAC GCCGTTGGCT GGTAGCCGCG AATTGCCGCC AGCCACGGCC TTGGAATTTA ATTTCAATCT GCCGATGGAT CGTTCGAGCG TCGAGGATGC TTTGGTGATT ACGCCGCCGG TGCGTGGACG TTGGCATTGG GAAGGCCGCG CCCGTGCCCG TTGGCAGCCT GAATTTGGCT GGACACCAGG CACAACCGTG ACGGTGCAAT TACGCCCAAC CGCCCAATCG ATGTTGCGCC AAACCCTGGC AACAACCGTG ACAACCCAAT TTACCGCCGC TCCTGCCCCG CTGTTAGTGT TTCGCTCACC GCTACCCAAT GCGATTATTG CCCCCAATAC CCCGATTTTG TTGCGTTTCA ATCGGGCGAT GATCGATTAT GCCAGCCAAA CTGAGCGGGG CTTAGCTGAA TTAAGCATCG AGCCAAATGC TGCTAGCCAA GTGCGTTGGC TCGATGATCG TAGCGTTTTG GTGCAGGCGC AATGGCAGAT TGGTCAGCAC TATCAACTCA AGCTGCAAGA TCTCAACGAT CTGCTGGGAA TTCCGGTTCA GACCACCGAA TGGCAAGTGC AGGTGAGCGA GCCAAACCTG ATTGCTCCGC CAACGACCCA AACCCTTCAG GCTGATCAAG CCTTGGAATG GGCCTTTGAG GGTTTGCTTG ATCAAACCAC AACCCAACAA TTGATCAAAC GAATTCAATT TACTCCCGCT GTGAGCACAA CCTGGCAGGT GCAATATCAG CCTGATCCCC AACCACAAAC CATCATTCAA GCCTTGCCTG AGGCAGCTTG GCCAGCTCAA ACGCTGACTA CCAGCCTACA ACTCAGCCAG CCAATCACCC AAGTTTGGCA AATTCAACCC AACTTGCAAT TAATTGGCTC AGTGCCAGGC CGTGGTGGCA GTTTAGCCGT TGATGATCCG CTGCGCTTGC TGTTCAACCA AACGCCTGAG CGCCAGCAAC TTGTCGAGCA ACTATTTATC GAGCCTGAAG TTGCGAATGT GGCGATTAAC ATCAACAATC AGCAGGCCTT GATTAGTGCT GCATGGCAAC CAAGCACAGT TTACACGCTG ACCTTGGCTG GCTCTGCCCC GCTGACCTTT CGCACCCAAA CCCAAGCGCA GCCACTGCAA ATTGAAGGTG CGGCTTATTC GTTGTTGTTG CCGGAAACTA CCGCCGAATT GCGGTTGAGT GGGCGCGAAA ATAGCCGACT CACGGCGAAT CTGTATGCGG TTGATCGGGC GGTTTTGGCG GCGGCTTTGC AACAACCGCA AACCCCGCTT AATCCTCAGC GTTATAATTT GCAACCGCAA CAGCAATGGC AAATTGCCGC TGGCCCTGAA CGAACCATTA GCATTACCCC AACTCAAGCG GCGTTATTAC TGCAAGTGCA AGCTCCCAAT GCCGAACTTG TGCAACATGT GTTGATTTGG ACTCCCTATC GAGCGCAATT GTTGGCGACA CCTGAGCAAG CCCAAGCGTG GATTGTTGAT CTGGTCAATC GCCAACCAGT CGCCGCCGCA AATTTGAGCA TTTTGGCGGG TGCAAATCAA CTTGCCACAG GCCAGAGCGA TTCGCAAGGG CTTTGGCAAA GCTCAATTCA GGGCAGTAGC GGGCGTTTGG TCTTGCTTGG CGGCGAGCCG CAAGCGCCGA TTGTGGCTGA AACGATTATT CGACCACAAC GCCAAGCACC AAGTTTGAGT AGCCAATTGC TGCTTGACCG CCAGAGCTAC CTAGCCAACC AACAACTGAC GATTATCGGC AGCAGCGAGT TTGCTGCGGA TGGGATGACT ACGCCAACCC TAACCTTGGC AGTGCTTGAT CCTAGTGGCC AAGCGATTGC GCCCGAACAA ACCTTGGTGC TAAGCCAAAG TCTCTGGATA ACCAAGGTGC AACTGGCCGC CGATGTTCAA CCTGGTTTGT ATGAAGTGCG GCTGCGCTAT GCCAACCAAG TGATTGCGAG CCAAAATTTT GTGGTCAATC AGCCCGATTT GGCCTATCGC GTAGTTTTAC CCGAATTGCA AGCGACCAAT GCGATCGTGT CAGCTGAAAT CATCAGCGAT TTACCGAGCC AACATGGCCT GTGGCGCTTG CTTGATCAGC GTAATCGCCA AGTGAATGCT GGCAGTTGGC ATAGCAACGC CGCTGGAATT GCCCAGTTCG ATTATCAACA GAGCCAAACG TTGTTGGGCG ATTATGTGCT CGAAATTCAG CTTGGTGAGC AAGTTAGCTA TCATCCCAGC CAAATTCACT ATCCTGCGCA ATTAGCTGCC AAGGCTGAGC AACAGTTGCT CAATCCCAAT CAACCAGCCA CGATCAACTT CCAGCTGCGC GATCAACAGG GTATGGCTTT GAGCAATCGT GCTTTGCAAA TCGAAGTTCA AGGCGCAATT AGCAGCACCC AAAGTTTGCG TACCAATGCT GCTGGCGAAG TTGTTTGGCG CACCAATGGT TTGCGGGCGG GCCTGTGGCT GATTCGTGCC AGTAGCCCAG ATGCGGCCAG CGTGGAAGTG CCTTTGTGGG TGTTGGGGGC CAAGAGCGAT CAATTGATTG GTCGGAGCCA AAGTAGCTTG GTCGAAACAA CTGAACTCGC GCTTGCCCCG CTGACCAATC TGGCGGCTAC TAATGTGTTG GTTAGTTGGT TTGAAGGTAG CAGCATTCGC CAGCAGATTC AGCCTTGGGT TGCGGGCCAG AGCATCGCGA TTACGCCCAG TCTGAGCAAT ACTCAGCAAT TAGAGGTAGC TTTGGCTTGG CTTGATGCTG AGCAGTTAAT CAGTAGCAGC AGCCAAATTC CCTTCCAAGC AGTTGCGCCA CTCGATTTGA CGATGGTTGT GAGCGGTTCG CAATTGGTGC TGGCAACCCA ACGCGCTGGT CGTCCTGTTG CCAGCATGGT TGGCTTGAGT TTTCAGTCAA TCCAAACGTC AGCTAGCCAA CAACGGCTTG TGCGCACAGC CAGCAATGGC CAAGCAGTGC TCGATTTACA AGGATTTGGC GAGGGTTGGC GGATTCAGGC CTTAGCCAGT GATGGCTTGA ATAGCGCCGA AGCCACGACG TTGTATGCAG CAAATCCCTT GATCGATGCC CAAATGCTGT TGCCCGAACG CTTGATTGTG GGCGATCAAC TGAGCGTGAC CGCTCGCTTG AATTTGCTAG CGACTCGCGG TGGGATTCCT AGCGTTAGCC TGAGCGTCGA TTCGGCTTAT CTTGCGGCCT TGGCTGAGCC AACCACGGGC AATAGTGGCC GAATTTATAC CACAACTCAA GCCTTGGTTG CCGCCAACCT TGGTACGACC ACGCTCACCG CAACCATTGA AGTTAATCAG CAAACCCAAA TTATCACCAA AGCCCTGACG ATTCAGCCAC CAAGCCAAGT GCGTTTGCAA CAAAGCCGTT TGTTGACCGA AGCGACCGAT ATTACTTGGC AAGTGCCGAC GGCCAACCAA GCAGCCTCAA TTAATATTGT GATTGCTAGT TTAGCCCAAG CCAGCCAAGC CTTGCCTAGC CAATTGCCGG GCGATGATCC CCTGACGTTG GCTGGGCGAA TTGCGCTATC CCGCTATGCA GCGCAGCCTG CGCAGGCCGA GATTGAATTA TTGGCCAAGC AGCAACTTAC CAATGGCAGC TGGAGTTGGG ATGCGCAAGC GGGCGATTTG CAATTGACTG CCTTGATTGT GCAACTGATT GCGACCGAGC AACGTTCGCC CCAACTGCGC CAACTCGAAG ATCAAGCCGC TCGTTGGTTA GCCTTGGTTA GTCCACTTGA TCCCGATTTG CGCAGCGAAA TGCTGTTGGC CTTGGCGCAC AGCGGCAAAA CAGTTGAAGC TCAAATCACC AGCTTGCTCA AAAATCGCCA ATTAACCACT GCCCAACGCT TGCGCTTGAT CTATGCCTTG GTGCTGCTCG ATTCGCCTCA AGCCGATCAA TATTTGCTTG AAGTGAGCCA GATGTTGGCT CAACCCCTAA CGCCAAGTGC TTGGATGAGT CGCACCCGCC AAGCGGCGCT GATCGGCTTG ATTTTTGAGC GCAGCCAACC AAGTTCAGCC TTACGTGCCC AAGTGCTCAA ACTGATCGCC GAGGGCTGGA ATGGCACGAC ATGGGAAGAT TTGCCAAGTA GCGTAGCGGT ATTTCAGCTA ACCCAACAAG ATTTTCCAAC CCGTGGGGAT TATCGTTTGG GCTTGAATGG CGCAGCCTTG AGCGATTATG ATCAAGCTAG CAGCCTAATG TTGCCAATTA CCAACCAACT TAACGTGCAG ATCGAGCCGA ATGGCCCAAT CTTGTTGGCG AGTCAAGCTA GCTGGCAAAA CCCTGCTACC CAAGCCTACT TGCTCCATTC CAGCAGCGAA GATCAATTGG CTTATGGTGA GCAATGGTTG TGGCAAGGCT ATTTGGTGCT TGAACAAGAT CTGTCATTGT TACAACTACG TTTGGCTGCG CCGAGTGGTG TGGAGTGGCA GCTTGGTTCA GCTACAGGCT TGAATTGGCA TGATGGTCTG TTCAGTGCCG CCAACGTTCG GGCTGGCATT TACCCCATTC AACTATTGGG CGTAGCTCGC CACAAGGGCC AATTTGCTTG GCCAGCGCCC CAATTAACCA GTGGCGGCCA GATCGTTGAG CTTAAACAAA CCGCGCCGCC AGTGCTAATT CGCTGA
|
Protein sequence | MRLLEKLARW WWRWLLTTIS LLGTALLGVI VASIAGWVPQ PSVLRLTPLA GSRELPPATA LEFNFNLPMD RSSVEDALVI TPPVRGRWHW EGRARARWQP EFGWTPGTTV TVQLRPTAQS MLRQTLATTV TTQFTAAPAP LLVFRSPLPN AIIAPNTPIL LRFNRAMIDY ASQTERGLAE LSIEPNAASQ VRWLDDRSVL VQAQWQIGQH YQLKLQDLND LLGIPVQTTE WQVQVSEPNL IAPPTTQTLQ ADQALEWAFE GLLDQTTTQQ LIKRIQFTPA VSTTWQVQYQ PDPQPQTIIQ ALPEAAWPAQ TLTTSLQLSQ PITQVWQIQP NLQLIGSVPG RGGSLAVDDP LRLLFNQTPE RQQLVEQLFI EPEVANVAIN INNQQALISA AWQPSTVYTL TLAGSAPLTF RTQTQAQPLQ IEGAAYSLLL PETTAELRLS GRENSRLTAN LYAVDRAVLA AALQQPQTPL NPQRYNLQPQ QQWQIAAGPE RTISITPTQA ALLLQVQAPN AELVQHVLIW TPYRAQLLAT PEQAQAWIVD LVNRQPVAAA NLSILAGANQ LATGQSDSQG LWQSSIQGSS GRLVLLGGEP QAPIVAETII RPQRQAPSLS SQLLLDRQSY LANQQLTIIG SSEFAADGMT TPTLTLAVLD PSGQAIAPEQ TLVLSQSLWI TKVQLAADVQ PGLYEVRLRY ANQVIASQNF VVNQPDLAYR VVLPELQATN AIVSAEIISD LPSQHGLWRL LDQRNRQVNA GSWHSNAAGI AQFDYQQSQT LLGDYVLEIQ LGEQVSYHPS QIHYPAQLAA KAEQQLLNPN QPATINFQLR DQQGMALSNR ALQIEVQGAI SSTQSLRTNA AGEVVWRTNG LRAGLWLIRA SSPDAASVEV PLWVLGAKSD QLIGRSQSSL VETTELALAP LTNLAATNVL VSWFEGSSIR QQIQPWVAGQ SIAITPSLSN TQQLEVALAW LDAEQLISSS SQIPFQAVAP LDLTMVVSGS QLVLATQRAG RPVASMVGLS FQSIQTSASQ QRLVRTASNG QAVLDLQGFG EGWRIQALAS DGLNSAEATT LYAANPLIDA QMLLPERLIV GDQLSVTARL NLLATRGGIP SVSLSVDSAY LAALAEPTTG NSGRIYTTTQ ALVAANLGTT TLTATIEVNQ QTQIITKALT IQPPSQVRLQ QSRLLTEATD ITWQVPTANQ AASINIVIAS LAQASQALPS QLPGDDPLTL AGRIALSRYA AQPAQAEIEL LAKQQLTNGS WSWDAQAGDL QLTALIVQLI ATEQRSPQLR QLEDQAARWL ALVSPLDPDL RSEMLLALAH SGKTVEAQIT SLLKNRQLTT AQRLRLIYAL VLLDSPQADQ YLLEVSQMLA QPLTPSAWMS RTRQAALIGL IFERSQPSSA LRAQVLKLIA EGWNGTTWED LPSSVAVFQL TQQDFPTRGD YRLGLNGAAL SDYDQASSLM LPITNQLNVQ IEPNGPILLA SQASWQNPAT QAYLLHSSSE DQLAYGEQWL WQGYLVLEQD LSLLQLRLAA PSGVEWQLGS ATGLNWHDGL FSAANVRAGI YPIQLLGVAR HKGQFAWPAP QLTSGGQIVE LKQTAPPVLI R
|
| |