Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1124 |
Symbol | |
ID | 5733016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1288723 |
End bp | 1289913 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278263 |
Product | hypothetical protein |
Protein accession | YP_001543900 |
Protein GI | 159897653 |
COG category | [S] Function unknown |
COG ID | [COG5282] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03624] putative hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGACAA AACGTTTATC AATTAAGCAA CTTGGGAGTG TGCTATTGAT CAGTGCCGCC GCAGGCGCTG GCGCACGCTA TCTCGCCAAT AAAACCCGCC AACAAGGCCA AAAAATCTAC GATTATACAA CCCCACAAAA ACTGATCGAT TGGTCGGTTG CCCGCATGAT TGCCCTGCGA GTCTCGCAAT GGCAAGAGCA TCCCGTGATC AATCGGGCTG AGCGCCAAGC CGAATACGAT CAAATGGTTG AGCGCAGCCA GCCCTTAATC GACGATTATT TGAACGTCAA GCTGCCCGAA ACGCTCTCGC GGGTCAAAGT CGTTGATCGC AAAGAGTGGA TCGATGCCAA TTTGCGTTCA TTTGAGCGCT TGTTTGAGCC AATTGAGCAA CTTTATCGCC AAGCAGCTCA AAATGCAGGC CGTAATCCAT CAGTCAATGC GATCAATCGG GCATTTGTCG GCGCACAAGT TGGGGCAATG GTTGGGGTTT TAGCGCGAAA AGTCCTAGGT CAATACGATC TCAGTTTGCT TTCGCCGCAA GCTGAGCCAG GTATGCTCTA TTTTGTTGAG CCAAATATTG CCCGCGTGCA ACAAGGTTTG GGCGTTGATG ATCATGACTT CCGCTTGTGG ATTACGCTGC ATGAAACCAC CCATGCCTAT GAGTTTGAAG CCTATGGCTG GGTGCGCGAC CATTTCAGCA ACTTAATTCA GCGCTATTTC AACGAGCTTG GCGGCCAACT CGATGCGCTG CGCAATGGCG TTGGCAACTT CATCAACCGC ATTTTCAACA ATAGCAAAAC CCCCAACGAT GGCCATTGGA TGGAGCAGCT GTTGACTCCA ACCCAACGCC AAGTATTTAG CGAACTCCAA GCCTTGATGT CGTTGGTCGA AGGCTATAGC AACCACATTA TGAATGCGGT TGGCCGCGAA ATTCTGCCAA ACTTCGAGCA GATCGAAGCT CGCATGAGCG ATCGCAAAGA AAAACGCTCG ATCTTCGATG AATTGTTCAA TCGTATCACA GGTATGAGCC TCAAAATGCA GCAATACGAA CAAGGCGAAC GCTTCGTTAA TGCAATTGCT GATCATGGTG GCAAGGAATT AGCTGCCCGT GTTTGGGAAG GCCCAGCAAT GTTGCCAACC CTCGAAGAAA TTCGCAACCC ACAACTATGG ATCGCCCGCG TTGGCGCATA A
|
Protein sequence | MRTKRLSIKQ LGSVLLISAA AGAGARYLAN KTRQQGQKIY DYTTPQKLID WSVARMIALR VSQWQEHPVI NRAERQAEYD QMVERSQPLI DDYLNVKLPE TLSRVKVVDR KEWIDANLRS FERLFEPIEQ LYRQAAQNAG RNPSVNAINR AFVGAQVGAM VGVLARKVLG QYDLSLLSPQ AEPGMLYFVE PNIARVQQGL GVDDHDFRLW ITLHETTHAY EFEAYGWVRD HFSNLIQRYF NELGGQLDAL RNGVGNFINR IFNNSKTPND GHWMEQLLTP TQRQVFSELQ ALMSLVEGYS NHIMNAVGRE ILPNFEQIEA RMSDRKEKRS IFDELFNRIT GMSLKMQQYE QGERFVNAIA DHGGKELAAR VWEGPAMLPT LEEIRNPQLW IARVGA
|
| |