Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3761 |
Symbol | |
ID | 5735625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4730823 |
End bp | 4732277 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280913 |
Product | peptidase |
Protein accession | YP_001546525 |
Protein GI | 159900278 |
COG category | [S] Function unknown |
COG ID | [COG3182] Uncharacterized iron-regulated membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACTA CCACAACCAC CAACGACGGA GCAATCACCG ACGAACGCCG TTCACAATCT GTTTTCTATC GCGCAATCTG GCGTTGGCAT TTCTATGCCG GTTTGTTTGT TGTGCCGCTG ATGATTGTCC TAGCCGTAAC TGGCAGCATT TATCTATTCA AGCCCCAACT TGACCGTTTG ATGTACGGCG ATTTGATGCA TGTCCAACAC ACCAGTGGCT CGGCGCAAAG CTACACTAGC CAATTGGCCG CTGCCCAAGC TGCTTATCCC GCTGCCAGCG TTGGCAAAAT TCGCCCAAGC GATGCCCACG ATCGTAGCAC TGAAATTAGT ATGAGCACCA GCGATGGCCG TAATCTGACG GTTTTCGTCA ATCCCTATAC CAATCAAGTG CTCGGCGAGC GCGATGAAGA TTGGAATTTA CAAACGATTG CCTTGAAATT ACACGGCGAG TTGCTGATTG GCACAACGGG CGATCGGATT ATTGAATTAG CCGCTTGCTG GGCGATTTTG CTGACTCTTT CGGGGTTGTA TCTCTGGTGG CCACGCTCGA AAAGTGGCAT CTGGGGCACG TGGCTGCCAC GCTTGCGCAG CAAAAACAAA CGGATTTTCT GGCGCGATTT GCATGCCGTG CCTGGCATGT ATGCCTCGTT GATTGTGCTG TTTTTGCTGA TTTCGGGCTT GCCGTGGACT GGCTATTGGG GTGATAAATT TGCTAATGTT TGGAGCGGTT ACCCCAATCA ACTTTGGAGC AATATTCCTG AATCTACAGT ATTGACTGGC AGCCTCAACA CCACAACCGA CAAAGTTGTG CCGTGGGCAG TTGAACAAGC GCCATTGCCT CAATCTGACC CTGATCATGC TGAGCATCGT GGTGATGGAG CGAGTGCGCC GGTGCCAAGC AGCGCTACCG AAGGCCCACA AGCCGCAACG CCCGTCACGC TTGATTCGGT GATTGAAGTC GCCAAAGCCC GTGGCGTGAT TGCCAGCTTT ACGGTTACGC CACCTGATGG AGAAAAAGGC GTGTACACCA TCGCTGCGGT GGCGAATGAT CCTGCTGATG AAGCCACAAT TCACGTTGAT CAATATAGCG GAGCGATTTT GGCCGACATT CGCTGGCGTG ATTATGCCAT GGTTCCCAAA GCTGTTAGCA TGGGTATTTC GCTGCACGAA GGTAAATATT TCGGGCTGGC TAACCAACTT TTAGCCTTGT TTGGGGCCAT GACGGTGCTG TTGCTATCGG TTTCGGGCGT GGTGTTGTGG TGGAAACGCC GCCCTGAGGG CCGCTTGGGT GCGCCCAATT TGCCAGCCAA CTTCCCCCAC TGGAAGCCTG TGTTGCTGAT GGTGGTGCTG GCGAGCTTGG CCTTCCCCTT GGTTGGCGCT TCGTTGTTGT TTATGCTGGT GTTGGACCTG ACGGTGTTTC GGTTTGCGCC AAGCCTCAAG CAACGCCTTG CTTAG
|
Protein sequence | MTTTTTTNDG AITDERRSQS VFYRAIWRWH FYAGLFVVPL MIVLAVTGSI YLFKPQLDRL MYGDLMHVQH TSGSAQSYTS QLAAAQAAYP AASVGKIRPS DAHDRSTEIS MSTSDGRNLT VFVNPYTNQV LGERDEDWNL QTIALKLHGE LLIGTTGDRI IELAACWAIL LTLSGLYLWW PRSKSGIWGT WLPRLRSKNK RIFWRDLHAV PGMYASLIVL FLLISGLPWT GYWGDKFANV WSGYPNQLWS NIPESTVLTG SLNTTTDKVV PWAVEQAPLP QSDPDHAEHR GDGASAPVPS SATEGPQAAT PVTLDSVIEV AKARGVIASF TVTPPDGEKG VYTIAAVAND PADEATIHVD QYSGAILADI RWRDYAMVPK AVSMGISLHE GKYFGLANQL LALFGAMTVL LLSVSGVVLW WKRRPEGRLG APNLPANFPH WKPVLLMVVL ASLAFPLVGA SLLFMLVLDL TVFRFAPSLK QRLA
|
| |