Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4642 |
Symbol | |
ID | 5736489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5931478 |
End bp | 5932386 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281806 |
Product | sortase family protein |
Protein accession | YP_001547401 |
Protein GI | 159901154 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.068472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGCATA TAGAGTACGA GTTTGTAGTG GTACAACCGC CAACCATCGA TTCGACACAC GACGATTTGG GCTTGCTTCA AGAATTGCTG GCAGCACCAC CACCAGTGCG TTCGTCATTG CATCGGCCAG CCCAACTCCG TTCGCTCAAA GAAGAGCGGA AAACTGCACT TCAGGGTTAT CGTTTCCGTG GGGTGCTTGA CGCTATTTTA GTGCGCTCGG AACGTTTGTT GATTGTTGGG GTAATCATCT TTTTTGGTTA TTGGGTGGTC AACATCTACG GACGCGATTG GTGGTATGCT CGTAACAATC CGGCGGCTCC AGCAGTGGCT TGGGAAGCAC TCGCACCAGG AGCTAGTGCC GCTGAACTTG ATCGAGTGCT AGGCCAACAA TTACCAGTGA TTGCACCCGC CATCAGCGCG GCTGCACCCG ATTATCTTGT GCCTGCTCAG GCGTTTATCT TGCCACCTGC ATCGCCAACC CCAACGCCTG ATCCCATGAG CTTTGTTCCC AAACGGATGA TCGTGCCAAC AATGGAGCTT GATAGTCCGG TGCGTGAAGT TTTTCTGCGC GATGGCATTT GGGAAGTTGC CGATTATGCT GTGGGTTATC ATCATGGCAC GGCCTTACCA GGCAAAGGGA ATAGCGTTTT TGCGGGGCAT GCAGGGATTC GCGGTAGTGT TTTCGCCCGC CTGAATGAAC TTCAGATTGG TCAAGATATC TACGTCGAAA CTGCCGATAC CCGTTATCAT TATCAAGTCC AGACAATTCA ACAAGTTTGG CCAAACCAAG TCGAAGTTAT GTATCCAACC GAGCAAGCAA TTATCACCAT GATCACATGT ACCGCTTGGG ACACCCAGCG GTTGGTAGTT AAGGCTCGGC TGATTGATCA GGCAGCGCTC TCCTCTTAA
|
Protein sequence | MKHIEYEFVV VQPPTIDSTH DDLGLLQELL AAPPPVRSSL HRPAQLRSLK EERKTALQGY RFRGVLDAIL VRSERLLIVG VIIFFGYWVV NIYGRDWWYA RNNPAAPAVA WEALAPGASA AELDRVLGQQ LPVIAPAISA AAPDYLVPAQ AFILPPASPT PTPDPMSFVP KRMIVPTMEL DSPVREVFLR DGIWEVADYA VGYHHGTALP GKGNSVFAGH AGIRGSVFAR LNELQIGQDI YVETADTRYH YQVQTIQQVW PNQVEVMYPT EQAIITMITC TAWDTQRLVV KARLIDQAAL SS
|
| |