Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0827 |
Symbol | |
ID | 5732728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 935902 |
End bp | 937839 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277959 |
Product | hypothetical protein |
Protein accession | YP_001543603 |
Protein GI | 159897356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.123059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGGTG CTGAGATTGT CAGCGTCGCA ATTTATCCTC CGGTTGGCTT TGCCCGCGTC GGAAATAGCC CAAATGAATT TTTTTATGGC CCCGAAGTGC TGGGTGCGCC CCAAGTTGAC CCCGATTTGT TTCGCGATCC TAGTGGCGCG ATCAAACGCC AAGCGGCGCG GTTTCGAGTT TATGGCTTGA ATGCCGCTGG TGAGGTGATT TGCGAGTTAA ACCAAACCAA TCCTGATCTG CAAGAAATTG TCTGGACAGC GCAGCTCGCC AATCAAAAAG CCGCGTGGTA TGAATTTATC CAAGCGCTGG ATATTCCGGC CTCTGCTGAT GGCAAAATTG TTAGCAAGCG CCGCAATGCC GATGTAACCG AGCGCGATCA ATTGACGATC GATGGTGGCA CACGCAGCAT TTGTGGTTTG AATGTTAACC CTCAGGGCCA AGAGGCGATC TATGCCTTTG ATCAGGGCAC ATTCTTTGGT AAGCACGTTT ATTTGGGCGA ATTGCGCACC GACGAACAGG GCAATTTGAT TGTGCTGGGC GGGCGTGGTC ACTCAGCTTC GGCCCATGGC AAGCCCTTAA CCACCTTTGC CGATAACGAT GATTGGCATG ATGATACTGC CGATGGCCCG ATTGAGGCCG TGGTCAAATT GAGTGATGGA CGTGTTTTGC AGGCGACTCA TGCGTGGGTG ATTGTTGCGC CGCCTGATTA TGCGCCAGGC ACAAATTCGA TTATGACAGG CTACGATTTG CTGTTTGAAG TGGCGATGCA GCTTGATCCG ACGCTTGCGC CAGCTAAGCC CCGTTTTAGC AGCGAAATTT ACCCGTTGCT CTCACGTTTT AGCCAATTGC AATGGGTTAA TGCTGGCTTT GCGCGGAGTT TTGGTTGGGG CAGCCCCAGC GATTTCAATA ATCCTGAGTT GATCGCTCGT TTGGGCGATC CTAGTGCTGC GAGCGAACCA TTGCGCCAAG CGGTTTTGGC TCAAATGCGC AATCCCAGCA ATCCCTATAT CGATCCCGAA GGCTTGCCAT TGTTTTATGG CGATGCGATT ACGCTCAACA CCAAAACCAC CGATCCCCAA GAGTGGCAAG CAATTTTGCC CAGCCAATAT CGTTGGCTAG AGCAATGGGC CAATGGCGAT TTTATTGCCG ATGGTTTGCC CGTGGTCAGG CCGTGGGCGC AGCTTGATCC TGCTGAACAG GCCTACAACC TAACGCGGGC GGCGCTTGAT GCAACCTTGG GCGGGCCGTT TCACCCGGGC TGCGAATTCA CTTGGCCGCT ACGCCATACC TCGATGTATG CAGCACCATT TCGGCTCAAA CGTCGTACTA GCCCTGAGCC AGATTGGGGC GATACGCTCG ATGCGGCTAC CGCCTTAGCA CCCGATGGGA TGTTGTATGC CAGCGATGCG GGCGCGATCA CCCGTTGGAT GGCCGTGCCT TGGCAAACCG ATACCTCAAG CTGTCTCTCG GCCTATATGG GCTATGCCGG AACCTATCTG CCAACTTTTT GGCCCGCTCG TGTGCCCAAC GATGTGCTTA GCCAAGAGAG CTACCAGATT ATTATGAACC CTGATGCTAC GCCTGAAGAG CGTGCCCAAG CCTTCGCGCC GACGGCCCGC CGCAAGTGGC TGCGCGGCAT GGTCTACACC GAGAGCATTC CACCAGGCCA CATTCCAACC GTGCCAGCAA TTACTAAATT TACCAACGAA TGGGATACCG TGGGCATTAT CATCGCCCAA ACTGGGCCTG AACATAGCGC TAGCTTCCCG CAAACGCTTT GGGTTGAAAC AGGCCGACAC GTTGAATCCG AGGCTCCCCA ATTAACCCGT AGCCTAGCCG CCGAGCCAGA AACTGATCTA GTTGATAGCG ACCTCAATTG GCCGCAACAA CGCCTCAAGC GAGGACGCAA AAATGCCGCC GAAGCCAATA ACGATTGA
|
Protein sequence | MAGAEIVSVA IYPPVGFARV GNSPNEFFYG PEVLGAPQVD PDLFRDPSGA IKRQAARFRV YGLNAAGEVI CELNQTNPDL QEIVWTAQLA NQKAAWYEFI QALDIPASAD GKIVSKRRNA DVTERDQLTI DGGTRSICGL NVNPQGQEAI YAFDQGTFFG KHVYLGELRT DEQGNLIVLG GRGHSASAHG KPLTTFADND DWHDDTADGP IEAVVKLSDG RVLQATHAWV IVAPPDYAPG TNSIMTGYDL LFEVAMQLDP TLAPAKPRFS SEIYPLLSRF SQLQWVNAGF ARSFGWGSPS DFNNPELIAR LGDPSAASEP LRQAVLAQMR NPSNPYIDPE GLPLFYGDAI TLNTKTTDPQ EWQAILPSQY RWLEQWANGD FIADGLPVVR PWAQLDPAEQ AYNLTRAALD ATLGGPFHPG CEFTWPLRHT SMYAAPFRLK RRTSPEPDWG DTLDAATALA PDGMLYASDA GAITRWMAVP WQTDTSSCLS AYMGYAGTYL PTFWPARVPN DVLSQESYQI IMNPDATPEE RAQAFAPTAR RKWLRGMVYT ESIPPGHIPT VPAITKFTNE WDTVGIIIAQ TGPEHSASFP QTLWVETGRH VESEAPQLTR SLAAEPETDL VDSDLNWPQQ RLKRGRKNAA EANND
|
| |