Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2909 |
Symbol | |
ID | 5734780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3679769 |
End bp | 3681739 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280052 |
Product | hypothetical protein |
Protein accession | YP_001545675 |
Protein GI | 159899428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0214301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATTG GATTGATTGT GCAACGTAGA ATTATTCAAA TAAGCAGTTT TATAGCTATT TTAGCTCTTT GTTTAGCCAT AACCTTAATT TGGGTACTGC AACGTAGCCC TGCGCAGGTC ACAATTGGTG GCAAATACGA TTCACCATGG CTGGTGGAGG GCTTTCAAAC CAAAGAACGC TCAGAATTGG GAGCCTATCG TTGGACGAAT GGCCATGGAA TTATTGGCTC ACCAGCAACT CATCGTAGTT ATATGCTAGG CTTGAGTTTA GTTTCTCCCG TAACAACCAC GGTTGTTCAG CTCAATAATG CTGGACATCG GGTGCTTGAG CTACCGATTA GTAATGCGCC GCGCTATTAT CAAATTTTTT GGCGGCCCAA TATCTCGCTG AATTGGCTGC GTTGGGCTAG CGACCAGCAA CTTACGCTCA ATAGCGAGCT GCAAGTGCTT GAAGGCAGCG ATCAACGCCA ACTAGGCGTA GTCTTGCAAA ACCTCAGTTG GTCGCCTACT GGCAGCATTT CGTTATTACC ATTTGCCTGG ATTACCGCGC TGGTGGTGAG TTTAGCGGCA TTAATCAGGC CACAGCAACG GCGTGATTGG CTTTGGTTTG CACTATCGGC TATTGGGTTA AGCCTAATGC TCGGCGGCCT GAGCTGGCTT GCCAACGATC AAAGCGTTTG GTCGCCATTA CGCTTTGCCC CTAGTTTGTT GATTTTGCCC TTGGCGGGGT TTGCACTATT GCGCTGGCCA TGGCAAGGCT GGTGGCAAGC CTTGCCAGTA CTCAGTTTAA TTGGCATTGC GGCAATTGTG ATGCTGCTCT CGCGCCAGTG GTGGGCGGTT GAGGGGCCAG ATTTTGGCTG GCACGCCAAC CATGGTAGTT CTGCCGAATC GGTGTTTCGG GCGCATCCCT TCTATCCATT GGGATTTCCG CTAATCTTAT GGCTTGGTTT GTTGTGGAAC GGCGATCAAC TAGCGATTGG GCAAACCGCT GGATTTATCA GCATGCTCTT GAGCTTGCTG CTCACGGGCT TATTGGCCTA TCGGATCTTG GCCTTGCGTG GGGCAATTGT CGCCTTGATT TTGGCCTTAG CAACCCCGCT ATTACTGGCT TTTGGCGTGG TGGCCAGCAG CGATAGTGTC CAATTGCCAG CCTATTTAGC TGCGTTATTG ATCCTAGTTT GGCAACCAGA ACTGACCCGC CGCCGCGTCG CACTGGCTGG GTTGTGCTTG GGGTTGGCTT ATTTATTCCG TTTTCAATCG ATAGTGATTG TGGTGCTGGT TTTGCCATGG TTATGGCTGC AACGCCTGCC TGCCCCGCCG CGTTGGCCGC AACGTTTGGC AGGTTGGTTT GCTCCAAGTT TGCTTTTAGC CGGATTTTTG CTTGGGTCAT CGCCGCAGTG GGTGCTCGAT ATTCGCGATA CAGGGCGACC ATTTTTCTCA CAACAATATG AAAACATCTG GCAAGCTGCC TACAATCGGG TTGATGCGGT AGTAGCTGCC GATAGCCCCG AAGCGATTGC CACCGCGCCC AGCGATACAG GCTTATACGA TATTGTGGCG TTTGATCCAT ATGGCCTATT TCGTCATTGG CAAGCTAATT TAAGCCAATT TTTTAGCTTT ACCTTGCACA CAATCTTTAT TTGGCCATTT GGCTTATTGA TGCTTTTGGG ATTGGGCTTA GCGGTATTGA AACGGGCTGA CCCGCGTTTG AGTTTGTTGG CATGGCTGAG TTTAAGCTAT ATTCCAATTA TTGCCCTAAC CTGGAACAAA GATCGTTTTT ATCTACCGAT TGTGCCCTTG TTGTTGGTGC TTGGCGCGTA TTGGTTGGAG TGGTTGCGCG GGCAGGCCTG GCGTTGGCCA CGAGGCAGTC GTTGGTTGGC TGAGGCAGTT CAGGCTGCCA GTTTGGCTTG GGCTTTGAGC CACCTCAGCG CAATCGATCC GATTTTACGG GTGTATGGAA GCTTAAAATA G
|
Protein sequence | MRIGLIVQRR IIQISSFIAI LALCLAITLI WVLQRSPAQV TIGGKYDSPW LVEGFQTKER SELGAYRWTN GHGIIGSPAT HRSYMLGLSL VSPVTTTVVQ LNNAGHRVLE LPISNAPRYY QIFWRPNISL NWLRWASDQQ LTLNSELQVL EGSDQRQLGV VLQNLSWSPT GSISLLPFAW ITALVVSLAA LIRPQQRRDW LWFALSAIGL SLMLGGLSWL ANDQSVWSPL RFAPSLLILP LAGFALLRWP WQGWWQALPV LSLIGIAAIV MLLSRQWWAV EGPDFGWHAN HGSSAESVFR AHPFYPLGFP LILWLGLLWN GDQLAIGQTA GFISMLLSLL LTGLLAYRIL ALRGAIVALI LALATPLLLA FGVVASSDSV QLPAYLAALL ILVWQPELTR RRVALAGLCL GLAYLFRFQS IVIVVLVLPW LWLQRLPAPP RWPQRLAGWF APSLLLAGFL LGSSPQWVLD IRDTGRPFFS QQYENIWQAA YNRVDAVVAA DSPEAIATAP SDTGLYDIVA FDPYGLFRHW QANLSQFFSF TLHTIFIWPF GLLMLLGLGL AVLKRADPRL SLLAWLSLSY IPIIALTWNK DRFYLPIVPL LLVLGAYWLE WLRGQAWRWP RGSRWLAEAV QAASLAWALS HLSAIDPILR VYGSLK
|
| |