Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4478 |
Symbol | |
ID | 5736329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5730739 |
End bp | 5735208 |
Gene Length | 4470 bp |
Protein Length | 1489 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641281641 |
Product | peptidase domain-containing protein |
Protein accession | YP_001547238 |
Protein GI | 159900991 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0387173 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAAGA GAATGCGTCC TAAGAGTCGG CTACTCCTGC TTTTTATTGG TTTTGCAAGC CTATTAAGTA GTCTACCAAC TACCCTTACA GCCGCAGCTT CCCCTCATTC AATTACAACT ACCAATGCTA GTGATGACTA TCAAATTTTG ATAAGTGGCC AAACTCTAAC CGAATTATCA ATTGAGCCAG CAACCGACCA CGATTATTTC TATATTGATG TTAATGCAGG TCAAACGATT ACTGTCGTAA TGGTGCGAAC AAGCGCCGAT CAAGCATTAA ATGGACTTTT AGATTTATTT GATCCAACAG GCGCGATGAT TGCCTCTGAT GATAATAGTG GTCATCGCGG CAACCCTTTA ATCCAAGAGG TTGAGGCCAT AACAACAGGT CGTTATATGC TTCGGGTACG TGATTATAGT GGAACCCGTA CCGGAACCTA TCAAATTACA GCGACTGTCG CTGAGCCATT AGTTTCAGGA CCAAGTTTTG CAATAACTGA TCTTGAGACC GATCCCTGTC CATTACCAAT GGCCGCAATC ACCGTACCAG AGATCGCAGA TCCGACTACA GGGCCAAAAA ATCCACAATT CGAAATAGGG AATCAAGCCT TACCTCGGAC TGGTTATCCC TTAAATTGTG AATTTGCGCG GGATCGGCAG ATTCTGGCAA CCGCCATTAC GCCTACAGCA TTCACCATTC AAAATGCTGG CCAAGTGCAA ACGACTGATG CCTTTGTGGT TGATAGCAGC GCTCAATATC TGTTGTTTGA CTATATGGTT GGGCGAAAAA CAGCGGATAA ACCAACACGA TTACATGTTG AGGTCTTGAG TGGGCCTACG TTTGCGACGA TCACTGATCT TAGTCAACAT ACGATTCGCG GACAGTATCT TGATGGTTGG AAACGTGGTG TGTTGGCGGT CGATGCATTT CGAGGGCAAA CCATCAAGCT GCGATTTATT AACGACAGTT CATCAGCTGC TCAGCCATCA GCACAAGTAC GAGCTATTAA GCTGTCGATT GAAGTTCCCG ATTGGCAACC ATCGCCTGTT GGCACAACCG CAATTGAGTA TAACGATAGC ATGGGAGCAC ATGCTGTTAT TACTGGGGCA AGCGCATTTT TGATTTCGGC TCCATTTACA ATTCCTCTGC AAACCCAAAG TCTAAGTTTC AATTACCAAA CTGGCCGACG CTTGAATAAT GCCAGTGCCC CAATTGAGGT GCAAGTGCTC AATGGGCCAG ATTTCATTAC CGCAACTCCG ATTGACAATA ACACAGTCAC AGGGCGCTTG AGTGATGGTT GGAAACGAGC GACACTTGAT ATCCAAGCAT TCCGTGGTCA GGTTGTCAAA CTCAAAATTG TGAATGATTG GTGGCCCAAC GAGCCCCAAA CAACCGCAAT TGATAGCTTT AAGTTAAATC GGGCTGTGCC AGGGTGGGAA GTCACCAATG CCAATTATGT CTCGATCGAG AGCTTGCAGA TTCCGCCAAG TACACTAACG AATGTATTAA CCAACACCGA TTTTGAAATT GGTTTTACTC CTATACCAGA CATTATCCCT AATCAAACAT TTGAGCTTCC AAGCAATCCG CTTCAAACCC TCAGTTTTTC AACAATGAGC TTGAATGGTA CGAATGTGGT AACTACGACT CCCCCAATCG TTGTGCCTGA ACAAGCCACC AGTTTACAAT TTGAGGCATT AATTGGCGAT AGTAGCAATC CAAGCTTGAT TAAACCAGTG ACGGTCGCGA TCCTCAGTGG TGATGCATTT GATATTCGCG AATTGCCGAT TGATCATCAA ATACGTGGCA CAATCCAAAC CGGCTTACAA ACAGCAGTAA TTGATATCAA ACGCTATCAA GGCAAAACGA TTAAGCTTCA GTTTACCAAT CACACAACCA ATGCGCCAAC GAGCCAATTG AGCAATTTTC GTTTAGTTGA TCATGTGCCG CAATGGCAAG CAAATAGCCA AACCAGGTTA AACTTAATCA ATGAATCATC AGCCAATCCA ACCCATGCCT TTTTGGTTGG AACCCAAAGT AGCCTTCTAT CGGCCCCATT TACGCTACCC ACTGAGGCTC AACAAATTCG TTTTGAGTAT CGTACTGGCC ATACAGATAA TGCCACGCGC CAAAGTCGGA TTCAATTAAC CGTCTTAAGC GGTCCAGACT TTGGGATTCG CACGCGGATT GATCAGAATC GGCTTGTTGG AACTGATGTG ATCGGTTGGC AATCAATTGC ATTTGATCTC CAACGATTTC AAGGCATGCC AATTAAACTC GAATGGGTGA CTGAGTTAAC CAATCAACCA TATCTGCGCT TGGATAATCT CCAGGTAGGG GTGGCAATGA CTGGTTGGCA AGCCAGTGAG TCCAGTGATA TTCTGATTGA GCCAACAACC CCGACGTTAG GTCAAAGCAT GCGGATTAAT GGAAATGCTG CAACCATTAC GAGCCAGCCA TGGACTGTAC TCAGTAATAC CGTCAGTCTT AGTTTCGATT ATAAAGTGCT AAGAATTAAC GAAACCGGCA ACGCGAACTT GTATGTGGAT GTCTTGAGTG GTCATAACTT TGAGGTGATA ACCCGCATTG ATGCGAATGG CTTGGTTGGC TCGATCACTA CCCCCAATAA TGGATGGCAA CGCGCAACAT TAAATGTATC CCAATTCCAA GGGCGGACGA TTAAGCTGCA ATTTAAAAAT GCGGGATATG CGATGGCTCA ATCGTGGATT GATAATCTGA CGCTCAATCA CGGGCAACCA AGCGCGAGCC ATGGTTCTGA TGAAGCCCCT GACGGCAGTT TCCTAACACT GCTGAATACT GGTACTGCCC AATCTGCCCT CTCATCGAGC TTTGTGGTTG CTACGGATAC CCAATTTTTG CGCTTCGAGT ATCAAACCGG GACATTTGAG CATGGCAACG AGCAACGCTC GTTTGTCGTC GATATTCTCT CAGGCAATAA TTTTGCAACT ATCACCACGA TTAACCAATC CTTGCCAAGC CGTTCGTTAA ATGATGGCTG GCAAGTTGCC AAACTACCAA TCAGTCAGTT CCAAGGGCAA ACGGTCAAAC TACGGCTAAC CATGCCTTTT GTTACCAAGC GTTCGGTTGT TCGCATTGAC AAACTGGCGT TGTTGAGCCC GCGAGCGCAG TTGACCACGC CAATCGCTGT TGATGGTACA ACCTATTTAA ATGTACCACT GACTGAATTA GGTGGCATTA CGACAACCGC TAGCATGACC ATTACCGCGC TGGTTGTTTA TGATGAGTAT GTCGATTTGG AGGGTATAGT ACGATATGAT AATGCTAGCT ATCAATTAGT ATCGACCGGA ACGACATATC GTTCTATGTT GGGCAGTCCC AACGATAAAG TAGTGGATAG TATTGATCAA TCGAATACTT TTAATCTGCT CCACTTTGCC GTACGAGATA ACCTGCCAAC GACGGCATCT CTTCAGCAGA TGGCTATTGC AGATCCTGTA ATTGCACTCT ATCTCCAGAA GAAGAATACT CGTCAATTAA CCGCCTTTGA ATTCAGTATT GATAATACGC TTTTGATTAA TGAGCTAGCG AGCGCAATGG CACGGTACGA TACTGATTAT TACCATGATA TATGGTTTAA GAAAATAATT AAACCCTATT TAATTGAAGA TATTGATATT ACAGACTATT CAGCTAATAC CAGCAATCAT TCAGAACTAT TACTGAAACA TAGGGTAAGA ACAAAAAATA TTGCCTGTGA AGTTATCGAA GAAGTTGTTT ATGCTATCGT TTTTAATGGG GACGCTGATA TTGAAAAGCA TCAGGAGCAT GATATTTATT CAAGTATTGA AATTATGTAT GAGCAAAGTA GTGGTAAACC AATTAATCCA AATAGTAATC CAGCAATATG TGATGGTTAC TATGATTTAC CAAGTAGCTC ATCAATTCAG ATAGGAACAA TTTATCCTAA CCCGAAGCAG ACATTGAGTG TGAAAATTGC GACATATGGA ACCGAAACAA ATGGAGAAAA AGCCGATTAT CTTACCAGAG GACGATATAA TGGTAGCTTT TATGTAGAAA AGAAAAAATA TACACCTCAA ATTTATCTGA ATGCCGCTGT TTCTCTTCCT AAAGTTCCAA TGTTGGCGTT TCTTACTGGC AATATAGCAT CAACTGTTGT TTCCGAAGGT CGAGAGGAAT TTGTAGCCAA AAACCAAGAA AAGGTATTTA AGCATGTATT TAAAGGGATT ATTGATCTTA ATAGTTTTGC TACCCGCTCT GCAAATCATT CATTTGATTA TGCAGTACTT GATACTGCTG GATATGATGC TAATGCTTTT TGGGGGATTA AGCACTGGAA GGGTGGTCGA TTCAGTAGTA GTGCTATGGT TTCTTGGCAT ATCCCTTTTA CTAATAATAT CTATGATGCT GAAAGCCATT TTTCCGAAAC AAATCATACA ATCATTATCT ATTTTATGAG CGGTAATTAA
|
Protein sequence | MRKRMRPKSR LLLLFIGFAS LLSSLPTTLT AAASPHSITT TNASDDYQIL ISGQTLTELS IEPATDHDYF YIDVNAGQTI TVVMVRTSAD QALNGLLDLF DPTGAMIASD DNSGHRGNPL IQEVEAITTG RYMLRVRDYS GTRTGTYQIT ATVAEPLVSG PSFAITDLET DPCPLPMAAI TVPEIADPTT GPKNPQFEIG NQALPRTGYP LNCEFARDRQ ILATAITPTA FTIQNAGQVQ TTDAFVVDSS AQYLLFDYMV GRKTADKPTR LHVEVLSGPT FATITDLSQH TIRGQYLDGW KRGVLAVDAF RGQTIKLRFI NDSSSAAQPS AQVRAIKLSI EVPDWQPSPV GTTAIEYNDS MGAHAVITGA SAFLISAPFT IPLQTQSLSF NYQTGRRLNN ASAPIEVQVL NGPDFITATP IDNNTVTGRL SDGWKRATLD IQAFRGQVVK LKIVNDWWPN EPQTTAIDSF KLNRAVPGWE VTNANYVSIE SLQIPPSTLT NVLTNTDFEI GFTPIPDIIP NQTFELPSNP LQTLSFSTMS LNGTNVVTTT PPIVVPEQAT SLQFEALIGD SSNPSLIKPV TVAILSGDAF DIRELPIDHQ IRGTIQTGLQ TAVIDIKRYQ GKTIKLQFTN HTTNAPTSQL SNFRLVDHVP QWQANSQTRL NLINESSANP THAFLVGTQS SLLSAPFTLP TEAQQIRFEY RTGHTDNATR QSRIQLTVLS GPDFGIRTRI DQNRLVGTDV IGWQSIAFDL QRFQGMPIKL EWVTELTNQP YLRLDNLQVG VAMTGWQASE SSDILIEPTT PTLGQSMRIN GNAATITSQP WTVLSNTVSL SFDYKVLRIN ETGNANLYVD VLSGHNFEVI TRIDANGLVG SITTPNNGWQ RATLNVSQFQ GRTIKLQFKN AGYAMAQSWI DNLTLNHGQP SASHGSDEAP DGSFLTLLNT GTAQSALSSS FVVATDTQFL RFEYQTGTFE HGNEQRSFVV DILSGNNFAT ITTINQSLPS RSLNDGWQVA KLPISQFQGQ TVKLRLTMPF VTKRSVVRID KLALLSPRAQ LTTPIAVDGT TYLNVPLTEL GGITTTASMT ITALVVYDEY VDLEGIVRYD NASYQLVSTG TTYRSMLGSP NDKVVDSIDQ SNTFNLLHFA VRDNLPTTAS LQQMAIADPV IALYLQKKNT RQLTAFEFSI DNTLLINELA SAMARYDTDY YHDIWFKKII KPYLIEDIDI TDYSANTSNH SELLLKHRVR TKNIACEVIE EVVYAIVFNG DADIEKHQEH DIYSSIEIMY EQSSGKPINP NSNPAICDGY YDLPSSSSIQ IGTIYPNPKQ TLSVKIATYG TETNGEKADY LTRGRYNGSF YVEKKKYTPQ IYLNAAVSLP KVPMLAFLTG NIASTVVSEG REEFVAKNQE KVFKHVFKGI IDLNSFATRS ANHSFDYAVL DTAGYDANAF WGIKHWKGGR FSSSAMVSWH IPFTNNIYDA ESHFSETNHT IIIYFMSGN
|
| |