Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4436 |
Symbol | |
ID | 5736287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5676607 |
End bp | 5677767 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281599 |
Product | hypothetical protein |
Protein accession | YP_001547196 |
Protein GI | 159900949 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | [TIGR00374] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCGC AAACGCAAGC AAATCCTGCT AGTTTACGCC CTGAGCAAAC CAACCAAGGC TTGCCGCGTC CGGCTGAAAC CGAATCGCTC GAACAATTGG TTGAACAGGC CGCCGAAGCT GCACCGTTAC CTGCCGATGC ACTGCCCGAC GATTTAGCCG AGGTGCGTTC GAGCGGCTTT TCGCTACGTG ATAAATTTCT GAATGTTAAA TCCTTGGCAT CGTTTGGCAT TGCTTTTGCG ATTCTGGGCT TAGCATTTTG GCGAGCCGAT ATTAACCTCG CTGAGATGTG GCAGCAGATT CTTCAGACCA ATCTTTGGCT GTATAGCGCT GGTTTTATTG TTTTCTATGG TCTTTTTCCA ATTCGCGCCT GGCGCTGGCG GATTATCTTG CGTAGCGCTG GCTTTGAGGT TGATAGCCCG CAATCACGTA AAAATTGGTC GGGGATTGCG GCATTAAGCG AGTTTATTGG GCTTTCGTGG TTTGCCAACT GCGTCGTGCC TGCCAAACTG GGCGATGCCT ATCGGGGTTA TTTGGTTAAG AAAAACGGCA ACGCCTCATT CTCGCGCACC TTGGGTACAA TTTTCGCCGA ACGCATTGTC GATATGATTG TGCTGTTTGG CATGTTGGTA GTTTCGGGCT TATTGGTCTT TCAAGGCCAT CTCAATAGCT GGACCGAAAA ATTATTTATC ATCGGGATTG TCTTTACAAT TTTGCTGGTG ATTGGTTTGA TGTCGATGCG CTATCTGAGT CCGTTGATTC GCCGCGCCTT GCCCCAACGT TTCCACGATT TTTATGCTCG CTTCGAGGAA GGCACGCTTT CATCATTTCG ACCTTCACGC CTGCCAATTT TGCTGATTTT GACAATCATT GTTTGGCTGG GCGAGTCGAT GCGCTTGTTT TTCGTAATTG AGGCCATGGG TGGTTTAGGC TTATCGTTAT CAGCGATTAT TTTTGTGGCC TTAGCTAGCT CATTGTTGAC CGCCGTGCCA GCCTTGCCTG GTGGCTTGGG CTTGGTCGAG GTCGGGATTG CTGGTGTGAT GATGTTGTTT AGTGTCGGCC AAACCACTAG CACTGCCGTG GCGTTTCTTG ATCGGATCAT CAATTATTGG AGCATCGTGA TTTTGGGGTT GGTTTTGTAT CTGTTTAGCA AACGAAAGTG A
|
Protein sequence | MKSQTQANPA SLRPEQTNQG LPRPAETESL EQLVEQAAEA APLPADALPD DLAEVRSSGF SLRDKFLNVK SLASFGIAFA ILGLAFWRAD INLAEMWQQI LQTNLWLYSA GFIVFYGLFP IRAWRWRIIL RSAGFEVDSP QSRKNWSGIA ALSEFIGLSW FANCVVPAKL GDAYRGYLVK KNGNASFSRT LGTIFAERIV DMIVLFGMLV VSGLLVFQGH LNSWTEKLFI IGIVFTILLV IGLMSMRYLS PLIRRALPQR FHDFYARFEE GTLSSFRPSR LPILLILTII VWLGESMRLF FVIEAMGGLG LSLSAIIFVA LASSLLTAVP ALPGGLGLVE VGIAGVMMLF SVGQTTSTAV AFLDRIINYW SIVILGLVLY LFSKRK
|
| |