Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1613 |
Symbol | |
ID | 5733515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1873480 |
End bp | 1874544 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278752 |
Product | sortase family protein |
Protein accession | YP_001544384 |
Protein GI | 159898137 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGAC GACTACGAGC CTATTTGGTG ATTGTTGGCC TGTTGGTAGC TACCGTAAGT GCTGGCTCAG CCGAAACTGC GGCGGGACAG CCGCGCTACT TTGCCGAAAC TGGCCATAGT TTGGCCTATA ATTTTCGGCT ATTTTGGGAG CGCAATGGTG GCTTGCCAAT TTTTGGCTAT CCAATTACTG AAGTGTTCGT TGAAAATGGT CGTCCAGTAC AATATTTTGA GCGAGCGCGG CTGGAGTGGC ATGCAACCAT TGGCTGGACG CTAGCCGGCC ATCTGGGGCA TTGGGCAGCC GAAGGCTCAG CTAAACATCC AGCCTTCACG CCGCGCAGCG AAGCCGCCTA TCCTGGTCAA ATCTTCTTCC CTGAATCGGG GCATACCCTA GGTGGGCTGT TTCGCCAGTA TTGGGAGCGC AACGGTGGGT TGCAAGCGTT TGGCTATCCG TTATCGGAAG AATTTCTCGA GCGCAATCAA CAAGATGGCC AAATTTATAC GGTGCAATAT TTTGAGCGCA CACGCTTTGA ATATCACCCT GAATTGCCAG CAGCTTTTCA AGTCTCGTTG GGCCATTTAG GTCGCCAATA TTTGAATGCT ACTAAGGCTG CGCCGGAATG GGCTACCCGC AAAGTCAATA ATGCTGATGC AGCGTGGCAA GCATTACGGC CAACTCGTAT CAGCATTCCA CGAATTGGGC TTGATAGTAC GATTGTTGAA GCAGGTTTTT CGTTGGGAAC ATGGGACGTA CCAACCGATG CTGCGGCCCA TTATTGGCCA GTGGCAAGTT TTCCAACAAC GGCTGGGAAT ATAGTACTAG CAGGTCATGT CGGCTATCAT GGTATTATCT TCAGTCAGTT ACCGAATGCA GTCGTCGGCG ATCGCTTGAT CCTGACTGTT GATGGGGTAG AACACCGCTA CCAAGTAACT GACATAAGTA CTGTGACCCC CGACCAAACA TGGGTAATGG AGCCAACCGC TGAAGAAACG GTGACGCTAA TTACCTGTGT GCCGATCGGT GTGTATTCGC ATCGCCTAAT TGTGCGTGCG AAGCCCCAAC CGTAG
|
Protein sequence | MLRRLRAYLV IVGLLVATVS AGSAETAAGQ PRYFAETGHS LAYNFRLFWE RNGGLPIFGY PITEVFVENG RPVQYFERAR LEWHATIGWT LAGHLGHWAA EGSAKHPAFT PRSEAAYPGQ IFFPESGHTL GGLFRQYWER NGGLQAFGYP LSEEFLERNQ QDGQIYTVQY FERTRFEYHP ELPAAFQVSL GHLGRQYLNA TKAAPEWATR KVNNADAAWQ ALRPTRISIP RIGLDSTIVE AGFSLGTWDV PTDAAAHYWP VASFPTTAGN IVLAGHVGYH GIIFSQLPNA VVGDRLILTV DGVEHRYQVT DISTVTPDQT WVMEPTAEET VTLITCVPIG VYSHRLIVRA KPQP
|
| |