Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1811 |
Symbol | |
ID | 5733669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2103908 |
End bp | 2105893 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278954 |
Product | hypothetical protein |
Protein accession | YP_001544582 |
Protein GI | 159898335 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.032061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGAA AACGTTCATC GCTGGTGATC GTTGCAGCCT TGGTGGTCGC ATTATTGAGT AGCTTTGCCG CCGCCACTCT CGTCACTCCT ACCGCCGCTT CGTCCGATGG TTTTTGGCAA GATGTGAAGG AAAGTCGGAT TGCCCAAAAA GGTGCTCGCC AAATTGTGCC AACGATCTAT CGCACCGTTG CACTTGATGT CGCTGGATTG AGCCAATATT TGGCCAAAGC ACCACTAGAG CAAGATCAGT CGGTGGCAAA CTCGTTGTAT ACCCTGCAAT TGCCCATGCC CAATGGCAAA TTCGAGCAAT TCCGCGTTGT CGAATCGCCG ATTATGGAGC CAGAACTAGC GGCAAAATTC CCCGAAATTC GCACCTACTT AATCCTTGGG GTGGATACTC CCAATCTGAG CGGGCGCTTG GATCTGACCC CCGCCGGCTT TCATGGCTTA ATTCTAGGCG ATCAAGGCCG GATTTTCATT GATCCCTACA GCGCTGGCGA TACTCAACAT TACATCGTTT ACGATAGCAA AAATTTTGTG CCCAGCAGCC AAAAATTGGC TGATCGCCAA GTCGAAGATT ATGTGATCGA CCTACCATTG ACCGATGATG GCTTGGCAGC ACCCAAAGCG GTTGGCGATA AATTGCGCAC CTACCGTTTG GCCATGGCCG CCACTGGCGA GTACACTGCC TATCATGGTG GAACCGTTGC CAAAGGCTTG GCGGCAATCA CCACCAGCGT CAATCGGGTT AACGCAGTGT ACGAACGCGA AGTTGCGGTA CGCATGGTAC TTGTTGCCAA CAACAGCAAT ATCATTTACA CCAACGCCAG CACCGACCCC TACACCAACA ACAATGGGGT TACGATGCTC AGCCAAAATC AGACCAATGT TCGCAATGTA ATTGGCAATG CCAATTATGA TATTGGTCAC GTGTTCAGCA CTGGCGGCGG CGGGGTAGCC TCGTTAGGCT CAGTCTGCTC GACCAACTAC AAAGCTCAAG GCGTAACTGG CTCATCGGCT CCAGTTGGCG ATCCATTTGA TATTGATTAT GTGGCCCACG AAATTGGTCA TCAATTTGGC GGCAACCATA CCTTCAATGG CACAACTGGT AGTTGTGGTG GTGGCAATCG CGCTAGTAGC GCCGCCTACG AGCCAGGTAG CGGCTCGACA ATTATGGCCT ACGCAGGGAT CTGTGGCGCT GAAAACTTAC AATCCAACAG CGATCCGTAT TTCCATTCCA AGAGCTTGAA CGAAATTACG ACCTTCATCA CAACTGGCGG CGGCTCAAGC TGTGGCACAG CAACCAACAC TGGCAACACC GCTCCGGTGG CCAATGCTGG GGCCGATTAC ACGATTCCAC GCAGCACACC ATTCGAATTA ACCGGCACTG GCAGCGATGC CAACGGCGAT AGCATGACCT ACAACTGGGA GCAATATAAC TTGGGCACGG CTGCGCCACC CAACACCGAC AACGGCTCAC GCCCAATTTT CCGTAGCTTC AACCCAAGCA CCAACCCTAA GCGCTCGTTC CCTAAATTGA GCGATATTTT GAACAATACC GCGACAATTG GTGAATCATT GCCAACCACC AGCCGCACCA TGGTTTTCCG GCTGATTGTG CGCGATAACC GGGCAGGCGG CGGCAGCTAT GCGATTGATA GCGCCAATGT GACGGTCAGC AGTGCGGCTG GGCCATTCGC CGTAACCTCG CCGAATACCG CCGTCACTTG GACTCGCAAC AGCAGCCGTA CCATCACTTG GAATGTTGCC AGCACCACCG CTGCACCAAT TAGCTGCGCC AATGTTGAAA TTTTGTTCTC AAGTAATGGT GGCAGCAGCT TCAGCAGCTT GGTTAGCTCA ACCCCCAACG ATGGCAGCCA AGCCGTTACC ATCCCCAACA CCGCAACCAC CCAAGCACGA ATCAAAGTAC GCTGTGCCAA CAACGTCTTC TTTGATCTTT CAAATGTGAA CTTTACCGTA AACTAG
|
Protein sequence | MPGKRSSLVI VAALVVALLS SFAAATLVTP TAASSDGFWQ DVKESRIAQK GARQIVPTIY RTVALDVAGL SQYLAKAPLE QDQSVANSLY TLQLPMPNGK FEQFRVVESP IMEPELAAKF PEIRTYLILG VDTPNLSGRL DLTPAGFHGL ILGDQGRIFI DPYSAGDTQH YIVYDSKNFV PSSQKLADRQ VEDYVIDLPL TDDGLAAPKA VGDKLRTYRL AMAATGEYTA YHGGTVAKGL AAITTSVNRV NAVYEREVAV RMVLVANNSN IIYTNASTDP YTNNNGVTML SQNQTNVRNV IGNANYDIGH VFSTGGGGVA SLGSVCSTNY KAQGVTGSSA PVGDPFDIDY VAHEIGHQFG GNHTFNGTTG SCGGGNRASS AAYEPGSGST IMAYAGICGA ENLQSNSDPY FHSKSLNEIT TFITTGGGSS CGTATNTGNT APVANAGADY TIPRSTPFEL TGTGSDANGD SMTYNWEQYN LGTAAPPNTD NGSRPIFRSF NPSTNPKRSF PKLSDILNNT ATIGESLPTT SRTMVFRLIV RDNRAGGGSY AIDSANVTVS SAAGPFAVTS PNTAVTWTRN SSRTITWNVA STTAAPISCA NVEILFSSNG GSSFSSLVSS TPNDGSQAVT IPNTATTQAR IKVRCANNVF FDLSNVNFTV N
|
| |