Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2389 |
Symbol | |
ID | 5734270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3043570 |
End bp | 3044904 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279530 |
Product | hypothetical protein |
Protein accession | YP_001545157 |
Protein GI | 159898910 |
COG category | [S] Function unknown |
COG ID | [COG4325] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATAA CTCGCGAACT CGAAAATCAG TATCATAAGC TGGATTCATC GTTATGGTTT CGGCCCACGC TCATGGCAAT TGGCTCGGCT ATTTTGGCCT TTTTCACAGT TGAACTTGAT CGAGTTTTCG ATTTTGATCA TGTGGCTTTT CTACGGGCTG GGATCGACGA TGCGCGGGCG ATTCTCTCTT CTGTCACCAG TTCGATGCTC ACCGTCACCA CCGTTACCTT TTCGATCATT ATGGTAGCCT TGGTACTAGC GTCGCAGCAA TTTTCGCCGC GAATTATTCG TAATGTGATG CGCGATACGC CTTCGCAATA TGTGCTGGGC ACATTTATTG GCACATTTAT TTATAGTTTG CTGGTGCTGG GCCAGATTAA CGATCAAGCA TCGTTTGTCT TTGTGCCGAT TCTCTCACTT GCCACTAGCA TTATGTTAAC TCTCTTGAGT ATTGTGGCCT TTATTTATTT TGTACACCAC ATCGCCGAGA CGATTCAAGC CAGTGTCTTG ATTGCCCGCG CTGCCGAACG CACAATCGAT GTGTTGGATC GGCGCTTCCC CGAAACACTG GGCCATGCGA TGGAGCAAAT TCCGCCACCG CCAATCCCCA ATGAAACCCC AACCACGATT TACAATGCCA AGGGTGGCTA TATTCAGGCG ATCGATCCTG TGCCTTTGTT GGAGCTAGCT CAGCGCTTCG ATGTGGTGAT TTATATGGAT CGGGCGGTCG GCGATTTTGT GCCAACTGGC AATCCACTCT TACACATGGT TCCGCAACGT GAGCTTGATC CCGATAGCAT CGCTGAATTT CAAGATGTGT TTGAGATTGG TTTAGAACGA ACGTTGTTTG ATGATGTGTT GTTTGGCATT CGCCAGCTCG TGGATATTGC GCTCAAAGCG ATTTCGCCTG CGGTCAATGA CCCCAGCACC GCGATTAATG CGATCGATTT GTTGAGCGAT GTACTGGCGC AGGCCATTCG TCGCCCTGAG CAATCGCCAT GTCGCTACGA CGAATTTGAT CAGCTACGGG TGGTTGCGAA TACAATTACA TTTCGCCAGA TGTTAGGCAC TGCGCTCAAC CAAATTCGCC AATATGCCAA AGGCGAAATC GCCGTAACTG CCCGTTTGTT GGTGTTACTG AATGAAGTTG CGCTAGCATG TAACGATCAA GAACGTCGGG CCATGCTCTG GGAGCAAGCC TGCATCATCA CGCGGGGAGC CGATCAAGCC ATCACTGAGC CATTCGATCG CGCCTATATC AACGAACATT TGCTGACGCT TGCCAATACA CTAGCAATCG CCTCTGAGCA ACGCATCACG CTGAAAGTTG GCTAA
|
Protein sequence | MRITRELENQ YHKLDSSLWF RPTLMAIGSA ILAFFTVELD RVFDFDHVAF LRAGIDDARA ILSSVTSSML TVTTVTFSII MVALVLASQQ FSPRIIRNVM RDTPSQYVLG TFIGTFIYSL LVLGQINDQA SFVFVPILSL ATSIMLTLLS IVAFIYFVHH IAETIQASVL IARAAERTID VLDRRFPETL GHAMEQIPPP PIPNETPTTI YNAKGGYIQA IDPVPLLELA QRFDVVIYMD RAVGDFVPTG NPLLHMVPQR ELDPDSIAEF QDVFEIGLER TLFDDVLFGI RQLVDIALKA ISPAVNDPST AINAIDLLSD VLAQAIRRPE QSPCRYDEFD QLRVVANTIT FRQMLGTALN QIRQYAKGEI AVTARLLVLL NEVALACNDQ ERRAMLWEQA CIITRGADQA ITEPFDRAYI NEHLLTLANT LAIASEQRIT LKVG
|
| |