Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2599 |
Symbol | |
ID | 5734477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3334104 |
End bp | 3337289 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279739 |
Product | hypothetical protein |
Protein accession | YP_001545365 |
Protein GI | 159899118 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0018117 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC CATCGAAAAT AATTGTCCTG ATCACTCTAA TGGCATTGTT GGTTGGAGCG TTACAAAGGC CTGTTGCTCC AGTTATTGCC AGCGAGCCGC AAATTCCAAC TAGCCCAGGC ACATGGAGCC AAGCCTTTGC CGACCCACTG AAACTTGACC CCGGCTATAT GCCGCTGCGC CCAATTGAAT GGAATGGCAC GTTATATGCA GGAATCGTTG GCGTAGACGG CTTTGAGCCG GGGGTGGGCT ATTGGAGCGG TCAGCAATGG CTGAAACTTG ATGGGTTATC AGGTGAGGTC GATTCGGTGG TCGTGCATCA AAACCGTTTG TTTGCAGCCG GACGCTTGAC GCTTGGGGGC AACCATATCA GTATTGCTTT TTGGGATGGC AATCTGTGGA CGGCGATGCC CACCCAATTT AGCCCCAATA TTTTTATCTT GGCCAGCCAT AACGATCAAC TCTACGTTGG TGGATATTCA GAGCAGATTG CTGACCAAGC CTCAGGTTTA CTCTTGCGTT GGGATGACAC GCAATGGCAT CCCGTCGCTG AAGGTATTTT CGGCGCAGTT ATGAGTATCC TCTCGCGGCC CGATGGTCTG TATCTCGGCG GGGTCTTCCA ACTCAATGGC CAAAATACCG GCTTGATCCA TTGGAATGGA GCACAGTGGC AGAGCGTTGG CGGGGGCGTT CAGGGGATGG TGATGGATGT TGAATGGGCC AATGATCAAC TCTACATTAG TGGCAAATTC ACCTCGACGC TTGAACCAAC TATGCAGAAT ATCGCTGCCT GGAATGGCAC CAGTTGGAAT ACCTTTGGCA CCGGGATTGT TAGCCCAACC CATAATTTGG CGCTGCTTGA TGGCGACCTG TATGCCCTCA GCCAAACCGA TAGACCTTAT CCTTATCAAA CGATTTATCA GCTCCAACGC TGGGATGCGA CTCACTGGAC GACACTCTCC AATTTAAGTG AAACAAGTAG CGTTCTTAAT TGGTCGCGCT ATCCTGATGT TGTGCTCGTC AATTATCAGC AAGAATTATT GGCATTTGGG CCAATAGGCT TTGTGGATCG CAATGTGCAA ACACTGAGAT GGGGTGATTC GGCCTTGCGC TGGAAGGGTA ATTCTTGGGA AGCCATGACT CCGAATGGAA TTTCTGCAAT GAAACTCGCA TTAGCCGTTG ATGGTGAGGA TGTCTACGCA GCCTCTGGGC GGATGACTTG GGGCAATGGA CAAGCGAGTT TAGCTCATTT GTCGCCAAAT AACCAATGGC AATTGCTGAT AGCCTATGAC TCCCAGCAGC CACAATATGC ACAAGCGCTC CAAAAGTATC AGCAGAACTT TTTTAGTATT TACAATAGCA CTCTATATCA AGCGGTTAAC AATGCTTGGA ATCAAGCCAG TCCTGCGACA GTGGAAAGTT TGGCTCAAGC CAATGATTTG TTGTATGTTG CTGGCGATTT TGAGCAATTC AATGGGGTTA CAGCGCGTAA TCTGGTGACC TGGAATGGCA CACAGTGGCA AGCCTTGAAT ACGCCTGCCT CATTTGATCG GGTCGTTATT GTTGAAGCCC ATGGCGATGA TGTCTATATT AGCGATGGCT TTCAATTGGC CCACTGGAAT GGCAGCCAAT GGACAACCCT CGCCACTAAT GTGGTCAATA TTGGTTCCAT TGAGCCAACC GCCAATGGGG TCTATATCGC TGGCACATTT AGCAGTGTTG GTGGCATAGC CACACCAAAA ATTGCCTATT GGAATGGCAC GGCTTGGTCG GGTTTAACAG GCGAGATCGA TGGTTCAATC TACGATCTCG AAATGGGAGC CGATGGCTTG TACGTGGCTG GATGGTTCCG AGGCATTATC AATGGTATTT ATAGCCCAGG CATTCTACGC TGGGATGGCA CTACATGGCA TGGGCTTGGC GGTGGGGTGA AGTCCAGTGC AACACCAAAT CAACCAGGTG CTGTGACGCT GCTTGCAGCA ACCCCAACCC GCATGCTGCT GTATGGGTCT TTTGATCGGG TGGGAAATAC CTACGAATCC AAACAAATTG CAGCGTGGGA GTATGGCAAC GAACCGTTGA TTAAGGCCAA ATCGGATTAT GGCCTTACCT ATCGTCCGCA GTCAGTTACG GTGAATGTGC TGGCGAATGA TTGGAGCGAT CAGCCAAATC AATTGCAATT GGTGAGTGTG AGCAGCCCAA GCCATGGCAC GGCTGTGATT AACGGTAACT CGGTTGTGTA TAGGCCGGAA GCACAATTTG AAGGCGTTGA AACCTTGACC TATGTTGTGC GCGACCCAAT CAATGCTGTC ACCAGCACAG CGCAACTCCA GGTGCATGTC TGGAATCACT TCCCAAGCAT TGCTGATCAG GAACAAGCGG TCTATCCATT TACTGAAACG CTGCTTGACC CATTGGATGG CCTGATTGAT TTGAATGGCG ATAGCTTGAC GATCACCCAA GCCAGTGCGG TCAGCGGCAC GGTGACGATT GTCAATAATC AATTGCGCTA CATGCCGCCG AATCAACACC ATTTTACCGA TGTGGTGACG TATAGGGTAA GTGATGGTCA TGGGGGGCAA CAAAGCGCCC GGATCAACAT CCATAGCATT GATACAATCG TGACTGCAAC CGCTGATTAT GCAACAACCT ATCGTCCGTA TTCAGTCAGG GTTGATGTGA TTGCCAATGA CTGGACGATT AATGGAGAAC CCTTAGCGGT GGTGGCAGTT GACGCAGCCA TTCATGGCAC AGCAACGATT AGTGGCAACC AAGTACATTA TATTCCTGCG GAAACCTTTC AAGGTACTGA AACCTTAACC TATACCGTGC GCAATCAAAC CCGTGGCATA ACGGCAACCG CAACCTTGAC GATTGAGGTA CAAAATCATG TGCCCACTGT TGCTCCCATA ACGATTACCG TTCAGCCTAA TAGCATCACA ACGCTGAATG TAATGGCAAA TGCGGTCGAT CTAAACGGCG ACCAATTAAC CATTACGCAA GCAAGCACCA CAGCTGGCAC GGTAGCGGTG GTTAATAATC GATTGCGCTA TACCGCGCCA AATTCTTATC CATTTGTTGC GACAATCAGC TATACCATCA ACGATGGTCA TGGTGGTTCG CAGGTTGGGA CAATTGTAGT CAATAGCGTA AAGTATCACT TATTCTTGCC CTATACCATC AAATAA
|
Protein sequence | MNKPSKIIVL ITLMALLVGA LQRPVAPVIA SEPQIPTSPG TWSQAFADPL KLDPGYMPLR PIEWNGTLYA GIVGVDGFEP GVGYWSGQQW LKLDGLSGEV DSVVVHQNRL FAAGRLTLGG NHISIAFWDG NLWTAMPTQF SPNIFILASH NDQLYVGGYS EQIADQASGL LLRWDDTQWH PVAEGIFGAV MSILSRPDGL YLGGVFQLNG QNTGLIHWNG AQWQSVGGGV QGMVMDVEWA NDQLYISGKF TSTLEPTMQN IAAWNGTSWN TFGTGIVSPT HNLALLDGDL YALSQTDRPY PYQTIYQLQR WDATHWTTLS NLSETSSVLN WSRYPDVVLV NYQQELLAFG PIGFVDRNVQ TLRWGDSALR WKGNSWEAMT PNGISAMKLA LAVDGEDVYA ASGRMTWGNG QASLAHLSPN NQWQLLIAYD SQQPQYAQAL QKYQQNFFSI YNSTLYQAVN NAWNQASPAT VESLAQANDL LYVAGDFEQF NGVTARNLVT WNGTQWQALN TPASFDRVVI VEAHGDDVYI SDGFQLAHWN GSQWTTLATN VVNIGSIEPT ANGVYIAGTF SSVGGIATPK IAYWNGTAWS GLTGEIDGSI YDLEMGADGL YVAGWFRGII NGIYSPGILR WDGTTWHGLG GGVKSSATPN QPGAVTLLAA TPTRMLLYGS FDRVGNTYES KQIAAWEYGN EPLIKAKSDY GLTYRPQSVT VNVLANDWSD QPNQLQLVSV SSPSHGTAVI NGNSVVYRPE AQFEGVETLT YVVRDPINAV TSTAQLQVHV WNHFPSIADQ EQAVYPFTET LLDPLDGLID LNGDSLTITQ ASAVSGTVTI VNNQLRYMPP NQHHFTDVVT YRVSDGHGGQ QSARINIHSI DTIVTATADY ATTYRPYSVR VDVIANDWTI NGEPLAVVAV DAAIHGTATI SGNQVHYIPA ETFQGTETLT YTVRNQTRGI TATATLTIEV QNHVPTVAPI TITVQPNSIT TLNVMANAVD LNGDQLTITQ ASTTAGTVAV VNNRLRYTAP NSYPFVATIS YTINDGHGGS QVGTIVVNSV KYHLFLPYTI K
|
| |