Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4309 |
Symbol | |
ID | 5736168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5502197 |
End bp | 5504134 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281469 |
Product | hypothetical protein |
Protein accession | YP_001547069 |
Protein GI | 159900822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGCGG CGTTTTCGTG GACTAATTTT TTGCTTGCAG CCAATGACCT AACGCTGGCA GCAATTGTGG TCGTGGGGTT TTCGTTATTT GCCTATATTG CGCTGCACAA TTGGCGCAAT GGGGTAGCTC GTTCGTTTTG TTTGTTGATT GTGGGGTTGA TGATTGTGCT GGGGGGCGCA ATTCTCCAAC GTCAAGCGCA AACCGAAGCT ACTCGTCATG TGCTTTGGCG CATTCAATGG GCAGGTATTA GCCTTGTGCC TGCGGCCTAC TACCATTTTG CCGAATCTCT GCTTCGTAGC ACGGGTGATC CACGTATGTG GACGCGAGTG CTCTTGCCTT TATTCTATAC ATTTAGTGTT GGCTTTTGGC TGGTCGCGCT GACGAGTAAC ATTTTGGTGA TTGATGTGCC AAGTCAGCCG TATGTTGGCT TTGGCAAAGG GCCATTGTTT TGGTTCTTCA TTAGCTATTT CGTCACCGTG TGTTTGCTCG GGGTTTGGTG TATTCGCCAA GCCCATCGGC GCTCGATTAC TCCGGCCAAT CGGCGGCGCT TATGGTATTT AAGTACTTCT TTTTTAGCCC CATTTCTGGG TGTATTTCCC TATTTGATTA TCGCCGCCAA TACTAAGGGC GTGCCATCCT GGCTTTCGTT GATGCTACTA GGCGCAAGCA CCACCGGCGT GGGAGTGATG ATGACCCTCA TGACCTATAG TGTGGCTTTC CATGGGGTGA TTGTGCCAGA TCGTTTGGTT AAGTATAATT TCTTACGTTA TGTATTATAT GGGCCATTCG TTGGCGTAGC CTTGATTATT TGTTTGCAAT TGGTTGAGCC GATCAGTGCC GCCACCGGCT TGCCGCGTGC AACGATCACG ATTTTTGGGG TAATGTTGAT GACGGTGATG ATGCCAATTT TTATTGGGCG AATTCGGCCA ACCGTCGATA CCTTGATTTA TCGCCAAGAT AGTGATGAAG TGCGTTGGAT GCGCCGCTTC GAGGAGCGAG CTTTCACCCG CCAAGATTTA CGCCAATTGC TCGAAAATAC CTTGGTGGCG GTTTGTGGCT CGTTGCGGGT TGAATCAGGC TTTGTGCTGG CTCCCAATGA CGAGCATTTT ACCGTTCAAG CATCGTGTGG CCCGCGCCGC ACCATCAAGC AATTTTTGAA TGTTCATGAT ATCAACGAGC TATTGCAAAA TCTGCCCCAC TTGGCCTTCA GCAACGATCG CATTCCCGAA GTTGAGGATT TTAGCATTCG CGATGGCTTT TGTTTGTTGC CATTGTACAA TAGCCAGCAG GAATTATTGG GCGCAATTGG GATTGGTTGT CGGCCAGAAC AACTGACCAT TCCCACTCGC CAGTTAATTG CAACCTTGGC GCATCAGATG GAATTGGCGC TAACCCATAT GCAATTGCAG CAAAACCTAT TTAGCTCGTT GCGTGGGCTA GCCCCCCAAA GCGCTTCGCT GTTACAATTA ACCAGTGAAA TTGAAACGCC AGTCACCGAA AAAAACGATG CGCTGGCTGA TGTGGCCTTG CACCCTGAGT TTCCACAGTT GGTCAAGGAT GCACTTTCAC ATTATTGGGG TGGCCCAAAA CTGAGCGATA GCCCATTGCT CGATTTGCGC ACAGTGCGCC AATTGCTCGA TACCCAAGGT GGCAGCCCAA CGCGGGCTTT GCAAGGCGTG TTACGCCAAG CGATCGAAAA CATTCGGCCT GAAGATCAAC TTGATCCAAC GGCTCCCGAA TGGATGATTT ACAATATTTT AGAATTACGC TTTCTCAAAG GCTTACGGAT ACGCGAGATT ATAGATAAGC TCGCAATGAG CGAATCGGAT TTTTATCGAA AACAACGGGT GGCGGTGGAA GAAGTGGCCC GTCAGTTGGC GCTGATGGAA GACCAAGGCG ATCGCCCTTC CGGCTCGGTT GAGCGACAAC GTCCCTAA
|
Protein sequence | MVAAFSWTNF LLAANDLTLA AIVVVGFSLF AYIALHNWRN GVARSFCLLI VGLMIVLGGA ILQRQAQTEA TRHVLWRIQW AGISLVPAAY YHFAESLLRS TGDPRMWTRV LLPLFYTFSV GFWLVALTSN ILVIDVPSQP YVGFGKGPLF WFFISYFVTV CLLGVWCIRQ AHRRSITPAN RRRLWYLSTS FLAPFLGVFP YLIIAANTKG VPSWLSLMLL GASTTGVGVM MTLMTYSVAF HGVIVPDRLV KYNFLRYVLY GPFVGVALII CLQLVEPISA ATGLPRATIT IFGVMLMTVM MPIFIGRIRP TVDTLIYRQD SDEVRWMRRF EERAFTRQDL RQLLENTLVA VCGSLRVESG FVLAPNDEHF TVQASCGPRR TIKQFLNVHD INELLQNLPH LAFSNDRIPE VEDFSIRDGF CLLPLYNSQQ ELLGAIGIGC RPEQLTIPTR QLIATLAHQM ELALTHMQLQ QNLFSSLRGL APQSASLLQL TSEIETPVTE KNDALADVAL HPEFPQLVKD ALSHYWGGPK LSDSPLLDLR TVRQLLDTQG GSPTRALQGV LRQAIENIRP EDQLDPTAPE WMIYNILELR FLKGLRIREI IDKLAMSESD FYRKQRVAVE EVARQLALME DQGDRPSGSV ERQRP
|
| |