Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4588 |
Symbol | |
ID | 5736433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5869383 |
End bp | 5870666 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281750 |
Product | hypothetical protein |
Protein accession | YP_001547347 |
Protein GI | 159901100 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0268669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCTG CATCTAGTTT AGAGCTGCTT GGGCTAGCGA TGTGTCTCAT TATTGTGAGT GTCGCCTCAG CAGCCGAAGC AGCGCTCTCT ACTATCTCAC GCCACCGAAT GAATACGCTG TTGGAGAACG GCGAACGCCG CGCCAACGTG GCTATGCGCC TGTTGGAAGA CCCCTATCAT CTGCGAATTG GGACGTTGTT GCTCAGTACC GTCGGGACGA TTGGCGCAAC GGCGCTGCTC TTGACGTTTG TCGCCAATCG CACTGGTTGG CAGCAAGCCT CGTTTCTGTT GATCTTTCTG CTGGTTTGGC TCACCATCGG CACAACCTTA CCCAAAACCC TCGCCACCGC CTATCCCGAT AGTATGGTGC TTTGGACGGT GCGTCCATTG CGGATTTTGC TGTGGCTGGT TGCGCCAATG CAATGGCTGT TGCGCCAAAT TGCCCGCCCA ATTGGCTTGG TAACTGGCCA GCATCCTCTC ATCGTTACCG AGGAAGAACT GAAATTAATG GTCAACGTCG GCGAAGAAGA GGGCGTGATC GAGGCCGAAG AACGCGATAT GATCGAGGGT ATTTTTGAAT TTAGCGATAC GGCTGTACGC GAAGTGATGG TCCCACGGAT TGATATCGTC GGTTTGGCTG CGACTGCCTC GATGGAAGAA GCACTGAATT TGTTTATTAG TGCTGGCCAC TCGCGGTTGC CGATCTACGA CGAGTCAATT GACCATGTGC TCGGAGTGTT GTACGCTAAG GATCTGTTTC CGCTGTTGCG TGATGGGTTG CGTGATGCGC CATTACGCTC CTTGGTACGC CAGTCCTATT TTGTGCCCGA TTCGATCAAA GTTGATGATT TGATGCGGGC TTTGCAAAGC CGCAAAGTGC ACATGGCGAT TATCGTCGAT GAATATGGCA GCACGGCGGG CTTAGTCACA ATTGAGGATT TGCTGGAAGA GATCGTTGGC GAAATTCAGG ATGAGTTCGA TAGCGAAGAA GCGCCGATTC AACAGGTTGG CCCTCACGAA TGGCTGTTCG ATGGTCGGGT CTCAATTGAT GCAGTCAATG ATGCCACAGA GTTAACATTA ATCAATGACG ATGTAGATAG TCTCGGTGGC TTCGTTTTGT CGATGTTAGG CTCAATGCCC AAAGTCGGCG ATGTTATCCA AGCTGGAGAT ACAACAATTG AGGTTGTTAC GATTCAAGGC TTGCGACCAC AACGCCTACG CCTAAGCTTA GCCCATGCCG AACACGAGTT CGCCGAGGTT GGGAGTAATA CCGATGATGG TTGA
|
Protein sequence | MDPASSLELL GLAMCLIIVS VASAAEAALS TISRHRMNTL LENGERRANV AMRLLEDPYH LRIGTLLLST VGTIGATALL LTFVANRTGW QQASFLLIFL LVWLTIGTTL PKTLATAYPD SMVLWTVRPL RILLWLVAPM QWLLRQIARP IGLVTGQHPL IVTEEELKLM VNVGEEEGVI EAEERDMIEG IFEFSDTAVR EVMVPRIDIV GLAATASMEE ALNLFISAGH SRLPIYDESI DHVLGVLYAK DLFPLLRDGL RDAPLRSLVR QSYFVPDSIK VDDLMRALQS RKVHMAIIVD EYGSTAGLVT IEDLLEEIVG EIQDEFDSEE APIQQVGPHE WLFDGRVSID AVNDATELTL INDDVDSLGG FVLSMLGSMP KVGDVIQAGD TTIEVVTIQG LRPQRLRLSL AHAEHEFAEV GSNTDDG
|
| |