Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2801 |
Symbol | |
ID | 5734682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3560973 |
End bp | 3562181 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279944 |
Product | hypothetical protein |
Protein accession | YP_001545567 |
Protein GI | 159899320 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.99719 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGC TTGCACGTGT TACCGCAACC ATCCTGTTTA CGCTGGCGCT TGTGGTATTA ATGTGGATGT TCAGCGATGC GGTGTTGCTT TTCTTAGTTT CGCTGGTGAT TGCCTCGGCT GTGCGGCCCT TTGCTCAGCG CTTGATTGAT CGGGGCATGA GCCGTGGAGC GGCCTATGGC TTGGTCTATT TTGCCAGTTT TGTGTTTTTT GGAGGCTTGT TTGCGGTGAT CAGCCGTGGT TTGCTGAGCG AATTACAGCT AGCCGGCGAC CAATTATTGA ATTTTTACAC TCGCGTCACG CTAACCTGGC CCCAAGGCCA AGAATGGCAA CAAACGATAG CTAGCGAACT GCCCTCGCGT CAAGCATTGC TCGATTATTT TGCAGGAACT GATGGGACAG CAGTAATCAA TAGCATCGTG GGGTTTGGCA GCGGCGTATT CGATTTTCTA GCACAGTTGG TGATTTTGCT CTCACTGAGC ACCTATTGGG CAGTTGATCA AAACCGCTTT GAGCGTTTAT GGCTTTCGTT GATCAGTGCC AAACATCGCC AACGCGCCCG CACCATTTGG CGGGCAGTGG AGTTAGAAGT TGGTGCCTAT ATTCGCAGCG AAGTTGCCCA AAGCTCGTTG GCAGGCATAA GTTTAGGGAT TATTTTTGAA TTGATTGGCT TGCCCTATCC AACGCTTTTG GCCTTGTGGA TTGCCATTGC TTGGCTGGTG CCATGGGTTG GGGTATTGTT GGCCTTGATT CCAGCGGTGG CCGTAGGCTT ATTGGTCAAC ATTCCTGTGG CCTTATTGGC AGGTTTATCG ACAATCGCCG TGTTTGCCTT TTTGCAAATT TGGGTCGAGC CACGCTTGTA TGGGGCCAAT CGGGTTAGTC CAGTTTTGGT GGTGCTGATA TTAATGGTGA TGGCGCAAGC CTTTGGGATT TTGGGCATGC TGATTGCCCC ACCCTTGGCA GCCACCATTC AAATTTTGGG GCGCGAACTG TGGTTTAGCA AATCGCCAGA GGAGCAATTG GCTGATGATT ATCGCAATCA GTTGCATACC CTACGCCAAC GGCTGGCGAC GATTGAGCGC GACGAGGTGA CAGCAGAAGT CGCAGCGACC GCCCAACATC AATCGTATAT TCGCCAAATT GAGGCGCTAA TGGATCAGAC TCAAGCTGCG TTGGGCAATG CTGGCATGCT CAATCGTGAG CAATCGTAG
|
Protein sequence | MKQLARVTAT ILFTLALVVL MWMFSDAVLL FLVSLVIASA VRPFAQRLID RGMSRGAAYG LVYFASFVFF GGLFAVISRG LLSELQLAGD QLLNFYTRVT LTWPQGQEWQ QTIASELPSR QALLDYFAGT DGTAVINSIV GFGSGVFDFL AQLVILLSLS TYWAVDQNRF ERLWLSLISA KHRQRARTIW RAVELEVGAY IRSEVAQSSL AGISLGIIFE LIGLPYPTLL ALWIAIAWLV PWVGVLLALI PAVAVGLLVN IPVALLAGLS TIAVFAFLQI WVEPRLYGAN RVSPVLVVLI LMVMAQAFGI LGMLIAPPLA ATIQILGREL WFSKSPEEQL ADDYRNQLHT LRQRLATIER DEVTAEVAAT AQHQSYIRQI EALMDQTQAA LGNAGMLNRE QS
|
| |