Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4267 |
Symbol | |
ID | 5736121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5441919 |
End bp | 5442926 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641281422 |
Product | hypothetical protein |
Protein accession | YP_001547027 |
Protein GI | 159900780 |
COG category | [S] Function unknown |
COG ID | [COG1300] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000574123 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCAG AAGATTTTAT CAATGCCAAA CATCATGCTT GGGAACGATT AACCCAGCTA ACCAGCCGTG CTCAGAGTAA TATTATTGCC ATGAATGCCA CTGAATTGCA AGAGCTAGGC CGACTTTATC GCCAAGCAAC CTCAGATTTA GCTCAAGCGC GGCGCGATTT TCCAGGCCAT CCCTTGACGA TCTATTTAAA CGATTTGGTG GCGAAGGGTC ATAGCAGCAT CTATCGTGAG CGCAATTCGC CAATTACCGG AATAAAAAAC TACTTTCTCT ATCAACTCCC CCAAGCTTTT CGTGAGCTAT TACCATTTAC AGGCATAGCA TTTTTAGCAT TTTTTCTACC AGCAGTGGTT GCATGGGTCA TCAGCTACCA AGATCCGGCG CGGGGAATCG CCTTAGCCCC TGAATTTCAG CCGGCGATTG ACAATATGCA AGACGACATC GAATGGTGGC GCGATTTAAA CGACAACAAT GCTGAAGGTG CAGTTATGAT TCTATCCAAC AACATTTTTG TCTCATTCCA GGCATTCGTA GGAGGCTTGA CCTTAGGTTT ACTAACCCTC TATGCCCTCT ATTACAACGG ATTAATGTTA GGAATTTTAG CGGGGTCAGC TCAAAATATA GGCTTTGCCG ATAACCTATG GGGCTTTATT GCGGCGCATG GACCAGTCGA ATTGAGCATT ATCTTTCTTG CTGGTGGTGC TGGCTTACAA CTTGCTTGGG CCATTTTGCG GCCTGGCATG GTTTCGCGCC GCGCTGCCTT GGCAATTGCT GCTCAACGGG CATTCAAAGT CTCGGGAGCA ATCGTCATGT TTCTAATCTT GGCAGGATTG ATTGAAGGCT TTATTTCGCC GCAATATCTA CCATTGTGGT TTAAGATTGC AGTTGGTATT ATCAGTGCTG GCAGTATGTA TGCCTATCTT TTATTAGCTG GACGCAATGT CCAACAACCA GAATCTGCTG AAGCAAGCGC CTTGAAGCTT GAAACCCCAG CACACTAG
|
Protein sequence | MVAEDFINAK HHAWERLTQL TSRAQSNIIA MNATELQELG RLYRQATSDL AQARRDFPGH PLTIYLNDLV AKGHSSIYRE RNSPITGIKN YFLYQLPQAF RELLPFTGIA FLAFFLPAVV AWVISYQDPA RGIALAPEFQ PAIDNMQDDI EWWRDLNDNN AEGAVMILSN NIFVSFQAFV GGLTLGLLTL YALYYNGLML GILAGSAQNI GFADNLWGFI AAHGPVELSI IFLAGGAGLQ LAWAILRPGM VSRRAALAIA AQRAFKVSGA IVMFLILAGL IEGFISPQYL PLWFKIAVGI ISAGSMYAYL LLAGRNVQQP ESAEASALKL ETPAH
|
| |