Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0702 |
Symbol | |
ID | 5732603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 807753 |
End bp | 808637 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277832 |
Product | hypothetical protein |
Protein accession | YP_001543478 |
Protein GI | 159897231 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00518069 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTA AATTATGGCT TGGGGCTGGC CTGAGTGCCG CCGCCAGTGC ACTTGCTGGC ATCGCGGCCT ATGCCGCCCA TCGTTTAAGT GGCCCAACTG CCGCGCTCAA ACGTGAAACG CTGTTTGCTT TTACGCCGTT TGAAACCGGG GTTGATTGGG AAGAAGTTCG GTTTAATTCG GTCGATGGCT TGCAAATTCG AGCATGGTGG CTTGGCCGCC CTGAAAGCAA GCGGGTCGTG ATTGGCTGCC ATGGTCATCG GGGCCGCAAA GACGAGTTGC TTGGCATTGG TTCAGGCTTG TGGCGAGCAG GCATGAATGT GCTAATTTTC GATTTTCGCG GGCGTGGCGA GAGCGATGAT TCAATTTGTT CGTTGGCCTA TCACGAGGTT GGCGATTTGC ATGGCGCGAT CAAGTATGTT GAAGCACGCT TGCCCGAAGC GCAAATTGGG GTGATTGGCT ATTCAATGGG TGCGGCGGTC AGCCTGTTAG GCAGCGCCGA TCAGCCAGCG GTTAAGGCTG TCGTTGCTGA TAGTAGCTTT GCCGAAATGG CCAATTTGGT TGATTTTGCT TTGGCCAATC GGCGTTTGCC GCCTCGCCCA TTGCGAGCTT TAGCCGACCA GATTACTGCC CAACGTTATG GCTATCGCTT CGAGGCCGTG CGGCCAATCG AGGCCTTGAT TCGCTATGGG CAACGGCCAT TGCTGGTTAT TCATTGCACT GGTGATACGG TGATTCCCGT GGTTGATGCC TATGATTTGT ATGCCGCCGC CCAAGGCCCG AAAGAACTGT GGATTGTTGA AGACATGCCT CATTGCGGAG CGTATTTCGC TGATCGCCCA GCCTATGTTA AACGAGTCGC CGAGTTTTTC GAGCGCTATT TGTGA
|
Protein sequence | MKRKLWLGAG LSAAASALAG IAAYAAHRLS GPTAALKRET LFAFTPFETG VDWEEVRFNS VDGLQIRAWW LGRPESKRVV IGCHGHRGRK DELLGIGSGL WRAGMNVLIF DFRGRGESDD SICSLAYHEV GDLHGAIKYV EARLPEAQIG VIGYSMGAAV SLLGSADQPA VKAVVADSSF AEMANLVDFA LANRRLPPRP LRALADQITA QRYGYRFEAV RPIEALIRYG QRPLLVIHCT GDTVIPVVDA YDLYAAAQGP KELWIVEDMP HCGAYFADRP AYVKRVAEFF ERYL
|
| |