Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0489 |
Symbol | |
ID | 5732388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 568929 |
End bp | 571535 |
Gene Length | 2607 bp |
Protein Length | 868 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277615 |
Product | lysyl endopeptidase |
Protein accession | YP_001543268 |
Protein GI | 159897021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000398143 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGAT CCCGTTGGGT ATTTGCCTTA CTTGCGATTA CCGTTTTATT ATCGGTCATC GGGGCTAAGG TGCCGTTCAC CGCTGCGCAA AATCAAGCCA GCACTTCCGA CCGTCCATTA GCATTGGCGC AAGGCTTGAA GGATTTGAAC ACAGTAGCCC AGATTGCTGT GCCAGAACTT GACCTTGCCA AAGAACGTGC TGATGCCGCT AAGCGCCCAA GTAATTTGCC AACGCGTTTC GCCAAAGATT ACCAAACCAC GCTTGACATC AAGCAAGTGG CTTCAATCGA AATTGTTGGC AATCGTACCG TCGCTCGTTT ACGCATCGAC GCACCAAAGG CCTTGTCGAT CAACGTAGGC TTTACCAGCT ACAACCTCCC CAAGAGTGGC CAATTGTTTC TCTATAGCCC TGATTATCGC TCAATTCTTG GGCCATACAG TGCTGCTGAT AACGAAGAGC ACGGCCAATT ATGGACTCCA ATCGTCGCTG GCGATCAAAT GGTTATCGAA TACAGCGCCG ATAGTGGCGA GTTTGCCCTT GCCGATTTGA CCTTGAGCGC AATTAACCGT GGCTTCAGTG GCTTCGGTAT TCCCCGCGAT CTGTTGGTCG ATAAATCTGG CTCGTGTAAC GTCGATGTGG TCTGTCCTGA GGGTGATGAC TGGCGGGCCG AAATTAACTC AGTTGCCGCT TATACCCGCA ATGGCTTGGA TATGTGTAGT GGTGCATTGA TCAACACCAC GGCCAACGAC CAAAAACCCT ACTTCCTAAC CGCGAATCAC TGTGGCATCA CCGCAGAAAA CGCAGCAACG GTCGTAACCT ATTGGAATTA CGAATCAACC CTGTGTCGCA CGGTTGGCTC AGCCGAGAAC GGCACGCCAT TGCCCAAGCC CAACACCACG ATGACTGGGG CAACTTTACT GGCCAACTAT GCCGCATCCG ACTTTGCCTT GATTGAACTT GATGACGAAG TTCCAACCGA ATATGCCCCA TTCTGGTCTG GTTGGAATGC CCAAAGTGGC GACTTCCCCA GTGTTGTGGC GATTCACCAC CCTGGCGTTG AAGAAAAACG CATCAGCTTC GAAAACCAAG CAACCCAAAC CACCGATTAC CTTGGCACAA CCGTCCCAGG CGGCGGCACC CACATTCGGG TCATCGACTG GGATCTGGGG ACAACTGAAG GTGGTTCATC AGGCTCACCA TTGTATGATC CCAATCACCG GATCGTTGGT CAGCTCCATG GTGGTTATGC CGCCTGTGGT AACGACCGCG ACGACTGGTA TGGCCGGATT TCGGTTTCGT GGAACGGTGG CGGCTCAAGC ACAAGTCGCT TGAAAGATTG GCTTGATCCA ACTAACTCAG GCTCATTAGT GCTTGATGGC ACAGGTGGAA CGCCAGCCTT CACTATGAAC GTCAACCCTG CGAGCGTGGC AGTTTGTGCG CCTGTTTCAG CCCAAACAAC GGTTAACCTC GGTTGGGTTC AAGGCTTCAG CGAGCCAGTA ACCCTCTCGG CCAGCAACTT GCCTGCTGGG GCAACTGCCA GCTTTACGCC AAATCCAGTT ATTTCGCCAA CCTTAAGTAG CCAACTCACG ATTGGCAACC TCAGCACAGC AATGGCAGGC GATTATAGCG TGGTTATCCG TGGCGATAGC ACGACGATTA GCCGCACCAC CGACCTCGAT CTTAGTATCT CAGGCGGTTT GCCAAGTGCT GCCCCAAGCT TGACGGCTCC TGCTAATAAT GCGACCAACG TTAGCGAAAC CCCTGCATTC AGTTGGTCAA GCGCTGCTGG TGCAACCAAC TACGTGCTGG AAGTTGCTAG CGATGCCAGC TTCAGCAACT TGGTTTACAC CGCAACCACT GAGTTGACCA GCTTGACCAG CGCTCCATTG AGCACCAACA CCAAGCACTA CTGGCGCGTC CGTGCTGCCA ATGCTTGTGG AGTCAGCGCC AATAGCAGCA TCTTCAGTTT CACCACCGAA GCTGCGCCAG GCGATTGTCC AATTGGCACC GAAACTGTGG TTGCCTTCAA CGAAACCTTT GATAGCGACC CAAGCTGGAC TCACGGCGGC ACTGGTGATA CCTGGGCCCA TGGCGCGTTC GGCTATAACG GTGGCAACGG CATCAAGGCA ATTGACCCTG ACTCGACATC TGATCAATGG ATTACGACTC CAGCTATCAG CTTGACCGAA GGGTTGACTC CAACGTTGCA ATTCTGGAAC TCGCAAACCA TCGAAGATCG CAATGCTGGT GGCTGTTATG ACGGTGCATT GGTTGAAGTT TCAACTGATG GTGGCTCAGC TTGGAACCAA ATTCCAAACT CAGCCTTGTT AACCGACCCG TATAACGGTG CAATTAATGC ATCAACCAAC CCACTCAATG GTAGCCAAGC TTGGTGTGGA GACCCCCAAG ATTGGTTGAA GAGCGTGGTT GATTTGAATG CTTACAACGG CCAAAGCGTG ATGTTCCGCT TCCGCCTTGC ATCCGACGAT TCGGTTGGCC GCCCTGATGG CTGGAAGATC GATAACGTCA GTGTTAAGGC TTGTGTTGCC GAAGCTGAGC CAGAAGGCCC ATCGTTGGTC TTCTTGCCAG CAATCACCAA GAACTAA
|
Protein sequence | MVRSRWVFAL LAITVLLSVI GAKVPFTAAQ NQASTSDRPL ALAQGLKDLN TVAQIAVPEL DLAKERADAA KRPSNLPTRF AKDYQTTLDI KQVASIEIVG NRTVARLRID APKALSINVG FTSYNLPKSG QLFLYSPDYR SILGPYSAAD NEEHGQLWTP IVAGDQMVIE YSADSGEFAL ADLTLSAINR GFSGFGIPRD LLVDKSGSCN VDVVCPEGDD WRAEINSVAA YTRNGLDMCS GALINTTAND QKPYFLTANH CGITAENAAT VVTYWNYEST LCRTVGSAEN GTPLPKPNTT MTGATLLANY AASDFALIEL DDEVPTEYAP FWSGWNAQSG DFPSVVAIHH PGVEEKRISF ENQATQTTDY LGTTVPGGGT HIRVIDWDLG TTEGGSSGSP LYDPNHRIVG QLHGGYAACG NDRDDWYGRI SVSWNGGGSS TSRLKDWLDP TNSGSLVLDG TGGTPAFTMN VNPASVAVCA PVSAQTTVNL GWVQGFSEPV TLSASNLPAG ATASFTPNPV ISPTLSSQLT IGNLSTAMAG DYSVVIRGDS TTISRTTDLD LSISGGLPSA APSLTAPANN ATNVSETPAF SWSSAAGATN YVLEVASDAS FSNLVYTATT ELTSLTSAPL STNTKHYWRV RAANACGVSA NSSIFSFTTE AAPGDCPIGT ETVVAFNETF DSDPSWTHGG TGDTWAHGAF GYNGGNGIKA IDPDSTSDQW ITTPAISLTE GLTPTLQFWN SQTIEDRNAG GCYDGALVEV STDGGSAWNQ IPNSALLTDP YNGAINASTN PLNGSQAWCG DPQDWLKSVV DLNAYNGQSV MFRFRLASDD SVGRPDGWKI DNVSVKACVA EAEPEGPSLV FLPAITKN
|
| |