Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0812 |
Symbol | |
ID | 5732712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 920575 |
End bp | 921750 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277943 |
Product | hypothetical protein |
Protein accession | YP_001543588 |
Protein GI | 159897341 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.217299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGTACG TTGAGAACAT TCCAATCTGG GGCGATCCAA TCGATCAAGG AGCCTTGAGC CAGATTAAAA CCTGTGCGCT TGAGGCCGAT GCAGTGGCGT TGATGGCAGA TCATCATAAG GGCTATGCGG TGCCAATTGG CGGCGTAGTC GCTTATCGTG ATGCAATTAG TCCTTCCGGC GTTGGCTACG ATATCGCTTG CGGCAACAAA GCCGTCTTGC TCGATCTGCC AGCCAGCGAA GTTCGCGCTA AAATCAAGCC GATTATGGAC GACATCTGGC GTTCGTTGTC GTTTGGGGTT GGCCTGAACA ACCGCCAAAG CGTTGATCAC GACTTGTTTG ACGACCCAGC TTGGCAAGTT CCAGCGGTCA AAAGCTTGAA ACAAACTGCC CGCAATCAGC TTGGCACAAT TGGCTCAGGC AATCATTATG TCGATTTATT CGCTGATGAG CTTGATCGGA CATGGATTGG CGTGCACTTT GGCTCGCGAG GCTTAGGCCA CAAAACCGCG ACCTATTTTT TAGAGGCAGG CGGGGCCAAA GATGGCATGG ATGTTGCGCC GTTGATTTTG CCAACTGCCT CGGATTTGGG CGAGCAATAT TTGAGTTGTA TGGCCTTGGC TGGGCGCTAT GCCTACGCTG GCCGCGATTG GGTTTGTGCT GAGGTTGCCC GCATTTTAGG CGCAACTATC CTTGAAGAAG TGCATAATCA CCATAATTTT GCTTGGCGCG AAACTCACAA CGGCGTGGAT TTGTGGGTAG TCCGCAAGGG TGCAACGCCC GCCTTTCCTG GTCAGCGTGG CTTTGTTGGC GGCTCCATGG GCGATATTTC GGTTATTTTA GAAGGCGTTG ACTCGCCCGA AGCCCAAACT GCCTTGCATT CAACCGTGCA TGGCGCTGGT CGGGTGATGA GCCGCACCGC TGCTCGCGGC AAAATTAATT ATCGCACGGG CAAAGTGATT GCACCTGGCA AAATTTCGCG CGAGATGATG AATGAGTGGA TTCAACAGCA AGGGGTTGAA TTACGTGGCG CAGGCACTGA TGAATCGCCC CATTGCTACA AACGATTGCC TGAGGTGCTC AACCACCACG CCAATACAAT CAAGATTGTA CATACCTTGC AGCCGCTGGG CGTAGCAATG GCTGGTGAAA ACGAGTTTGA TCCGTATAAG GATTGA
|
Protein sequence | MQYVENIPIW GDPIDQGALS QIKTCALEAD AVALMADHHK GYAVPIGGVV AYRDAISPSG VGYDIACGNK AVLLDLPASE VRAKIKPIMD DIWRSLSFGV GLNNRQSVDH DLFDDPAWQV PAVKSLKQTA RNQLGTIGSG NHYVDLFADE LDRTWIGVHF GSRGLGHKTA TYFLEAGGAK DGMDVAPLIL PTASDLGEQY LSCMALAGRY AYAGRDWVCA EVARILGATI LEEVHNHHNF AWRETHNGVD LWVVRKGATP AFPGQRGFVG GSMGDISVIL EGVDSPEAQT ALHSTVHGAG RVMSRTAARG KINYRTGKVI APGKISREMM NEWIQQQGVE LRGAGTDESP HCYKRLPEVL NHHANTIKIV HTLQPLGVAM AGENEFDPYK D
|
| |