Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1651 |
Symbol | |
ID | 5733535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1918183 |
End bp | 1919148 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278790 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001544422 |
Protein GI | 159898175 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000137982 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTAT CTGTTCGGCG GTACGCACGG GTCGCGGTGG TTTTGAGCGT TTTAGGCTTG AGCTTGGCCC AAGGCGATGT GACCTTAGCC AAGAAGGAAG TCGAAGCTGG GGTTGACCCA CATACCATCG TGGCTAGTGA TGGGAAGCCT GTAGACGTTA TTACTGATGG CATTGTTTAT GATCAATTTG GGGCCTATAT CGCTGGTAAC GAAGGAACTG GTGTTTTAGC CGAAGATGAT CGAGCCGAGG CCGTTGATCC CAATGCCTTG CCTGCAACCC AAAGTGGCAG CCAAGAAGGC TTAAATTCAG TGATTGGTAC TGATAATCGC GTGCGAATCA CCGCGACGAC CTCAGACCCC TATCGCCGGA TCGGCCAAAT TACCTTTAGC AGTGGCGGCG GCAACTACAT TTGTACTGGT TGGTTGATCA GCGCCAACAC CGTGGCAACC GCAGGCCACT GTCTCTGGAG CAACAACGCT TGGGTCACCA ACGTTAAGTT CTACCCAGGT CGCAATGGCA CATCGAACCC TTACGGCGGC TGTAACGCCA CCAAACTCTT TACGGTTTCA CAATGGCAAA CCAGTGGCAG CCCCAACTAC GATTATGGTG CATTCAAAAT TAATTGTAGT GTTGGCAGCC AAACTGGCTG GTTCGGCTTG CGTGCCCCAA GCAACACCGG CTTAGTTGGC CAAGTAACCA ACATTGCTGG CTACCCAGGC GATAAAACCT CGGGCACGAT GTGGTTCCAC GCCGATACCG TGCGCAGCTA CACCAGCCTA CGACTCTCGT ATGCCAACGA CACCTATGGC GGCCAAAGTG GCTCACCAAT TTGGAACAGC AGCGGCAGCT GCACCAACTG TTCGATTGGC GTACACACCA ATGGCGGTAC CACCACCAAC TCTGGTACAC GCATCACCTC AACCGTCTTG AGCAACTTCA ACACCTGGAT CAATACTGCC CCATAA
|
Protein sequence | MHLSVRRYAR VAVVLSVLGL SLAQGDVTLA KKEVEAGVDP HTIVASDGKP VDVITDGIVY DQFGAYIAGN EGTGVLAEDD RAEAVDPNAL PATQSGSQEG LNSVIGTDNR VRITATTSDP YRRIGQITFS SGGGNYICTG WLISANTVAT AGHCLWSNNA WVTNVKFYPG RNGTSNPYGG CNATKLFTVS QWQTSGSPNY DYGAFKINCS VGSQTGWFGL RAPSNTGLVG QVTNIAGYPG DKTSGTMWFH ADTVRSYTSL RLSYANDTYG GQSGSPIWNS SGSCTNCSIG VHTNGGTTTN SGTRITSTVL SNFNTWINTA P
|
| |