Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2537 |
Symbol | |
ID | 5734415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3242462 |
End bp | 3244876 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279677 |
Product | hypothetical protein |
Protein accession | YP_001545303 |
Protein GI | 159899056 |
COG category | [S] Function unknown |
COG ID | [COG5427] Uncharacterized membrane protein |
TIGRFAM ID | [TIGR03662] Chlor_Arch_YYY domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCTT GGTGGCTAAG CATCGTTTTA ATTGGAACTC TCGCGCTGCC TTTGAGTATG CGCCTATTTG GGCATTTGCC AGGGCGGGGC TTGGCATGGA GCAAAGCGCT AGGCTTGTTG GTTGTTGCAT GGATCGCTTG GATGGGCGCT ATGCTCAATC TTTCGGGCTT CGATGGCGTG ACGGTTGGCT TGGGCTTAAT TGGCTTGGGC ACACTCGGTT GGTTTGTGCA ACAACCTTTT GATAAAGCCC GTTTGTTGGC AGCAATTCGC CAATATTGGC CGCAATGGCT GGCCTATGAG TTGTTGTTTG CTTTGGTTTT TTTGCTGGGC ATTCAATTGC GCTTCCATGG GATGTTTGGC TCAGGGATTC ATGGCACCGA AAAACCAATG GAATCAATGC TGTTTAGCGC TGTGCTGAAT AGTCCAAGCT ATCCACCAAG CGATTTGTGG TTGGCCGGAT TTAGCGTTAA TTATTATTAT TTTGGCTATG TGCTGATCAG CGTATTGAGC GTGCTCAGTG GCGCGACGCT TGGCGAAACG TTTAATCTTG GTTTGGCCAC GATTATGGGC TTGGCCAGCC TTGGGATTGT TGGTTTAGTC ACAACTATGG CGGGTTTGTG GTGGCGTGAG TATTTGCTAT CCCGTCTCCG ATTAATCGCA ATTGCTGCGC TTGGTTTGTT TGGTGGCGTA TTGGTGCTGT TTGCAGGCAA CCAAATTGGC GCTTTACAGA AGATCGTCAA TTCTGCGGAA GTTAATCGAC TAACCGATAG CCAACGGGTT TCAGTCTTGT GGCAAGCAAT TCAGGGGGTT GAGCCAGCCA CGCTTGATCC CGCCACGCTA AAATCAGAAA ATACTGGCAT TTCTAAAAGC TCGACCTTGC CACCAATGGG CGAGAAATTT GAAGCTTGGC CATCGTCGCG TGCAATCTAC GATGATCACG AAGAAACCAT GATCATTGTC GATAATCAGC AACGTATGGG GATGTCTCAG CGTGAGATTA TCACCGAGTT TCCCTTCTTC AGCTTTTATC TTGCCGATAT GCACCCGCAT GTTTTGTCGA TTCCGCTGAC GTTGTTGGCG ATTGCTTTGG CCTTGGCGAT TTTTGTCCGG CCAGCGATGG TCCGTTTCCC AAAGCACGAT TGGCTTGAAT TGGCGATAGC TGGCTTGGTG ATTGGCGGCT TATATGCAGC CAATTCGTGG GATGCCCCGA CCTTTGGTGT GTTGTATGCC TTGGGTTTGG TTGGCTTGTG GCGTGGGCAT ACACCGCAAC CAACCCGCCG CGATTGGCTA CAGCTTGCAG GTCAAGTTGG TTTAGTGGTC TTGGCGGCGG CATTGTTGTA TATGCCGTTC CTGCTCACAT TTAGCTCATT TGCAGGCCGC GATACCGTGC CCGACCCATT TGCCAGTATT CCAATTATTG GCAGCTTGGG CAAAATTATG GCTCCCGCCC GTGATCACTC TGGCTGGACT GATTTAGTGG CGATTTTTGG CTTGTTCTTA GTGCCGATTA TTGCTTGGCT CAGCCGCACG ATCAAGGTTT GGCAATTATG GGCTATGACT GGCGCGGTGC TGCTGATTGG CTTAATTGCC GGCATTCCGG CGATTGTCTT TTTGCCGATT GCCGTGATCT GTTGGCAAAC AGCTTGGCAA CGCAATCAGC GCGATGTGCA AAACTTTAGC TTGATCGTGG TTGGTTTGGC AGCCTTATTG ATTGTAGTTG TCGATTTTCT GTATCTGCGT GACATTTTTG ATAATCGCAT GAACACGGTT TTCAAGGTCT ATTATCAGGC TTGGATGCTG TTGGGAATTG GTGCTGCTGC TAGTATTTGG GGCTTGTTGA GCAATGCCCA ATGGCGACGC TGGACGAATG GCATTTGGTT GCCATTATTT GGGCTTTTGT TGGCTGGCGG CTTAGTCTAC CCAATTTCAG TGCTTAACCC TACAACTTCG CCCTCGTGGG ATGCAAGTGG CTCGAAGCTT GATGCAGTAG AAAGTTCCCA ACACTTTTCC GAGCCAATGC GCAAAGCCGC TGCCTGGCTT GAAGCCAACA CACCAAGCAA CAGCGTTTTG GCGACTGCAC CTGGCAGCAG CTATCAAGAT GGCGGTGAGT TAGCAACCTT GAGTGGTCGG CCAACCTTAT TAGCTTGGCC CGGCTCGCAT GAAGGTTTAT GGCGCAGCAA ACAGCCTGAT GCAAATCAGC AAGTGGCGCA ACGCCAAGGC GATATCAGCG CAATTTACAA TGCCACCGAT ATCAATCAAC TGCGCGAAGT TTTGGCTCGC CAGCGGGTCG ATTATGTGGT GTGGGGGCCA AACGAGCAAA AAGCCTATCC ACAGGCCAAT ATTGGCTTGC TCGAACAGGT TGCCAGCAAA GTTTACGAAG CCGATAGCTG GATCATCTAT CAAGTACAAC CATAG
|
Protein sequence | MIAWWLSIVL IGTLALPLSM RLFGHLPGRG LAWSKALGLL VVAWIAWMGA MLNLSGFDGV TVGLGLIGLG TLGWFVQQPF DKARLLAAIR QYWPQWLAYE LLFALVFLLG IQLRFHGMFG SGIHGTEKPM ESMLFSAVLN SPSYPPSDLW LAGFSVNYYY FGYVLISVLS VLSGATLGET FNLGLATIMG LASLGIVGLV TTMAGLWWRE YLLSRLRLIA IAALGLFGGV LVLFAGNQIG ALQKIVNSAE VNRLTDSQRV SVLWQAIQGV EPATLDPATL KSENTGISKS STLPPMGEKF EAWPSSRAIY DDHEETMIIV DNQQRMGMSQ REIITEFPFF SFYLADMHPH VLSIPLTLLA IALALAIFVR PAMVRFPKHD WLELAIAGLV IGGLYAANSW DAPTFGVLYA LGLVGLWRGH TPQPTRRDWL QLAGQVGLVV LAAALLYMPF LLTFSSFAGR DTVPDPFASI PIIGSLGKIM APARDHSGWT DLVAIFGLFL VPIIAWLSRT IKVWQLWAMT GAVLLIGLIA GIPAIVFLPI AVICWQTAWQ RNQRDVQNFS LIVVGLAALL IVVVDFLYLR DIFDNRMNTV FKVYYQAWML LGIGAAASIW GLLSNAQWRR WTNGIWLPLF GLLLAGGLVY PISVLNPTTS PSWDASGSKL DAVESSQHFS EPMRKAAAWL EANTPSNSVL ATAPGSSYQD GGELATLSGR PTLLAWPGSH EGLWRSKQPD ANQQVAQRQG DISAIYNATD INQLREVLAR QRVDYVVWGP NEQKAYPQAN IGLLEQVASK VYEADSWIIY QVQP
|
| |