Gene Haur_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4478 
Symbol 
ID5736329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5730739 
End bp5735208 
Gene Length4470 bp 
Protein Length1489 aa 
Translation table11 
GC content43% 
IMG OID641281641 
Productpeptidase domain-containing protein 
Protein accessionYP_001547238 
Protein GI159900991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0387173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAGA GAATGCGTCC TAAGAGTCGG CTACTCCTGC TTTTTATTGG TTTTGCAAGC 
CTATTAAGTA GTCTACCAAC TACCCTTACA GCCGCAGCTT CCCCTCATTC AATTACAACT
ACCAATGCTA GTGATGACTA TCAAATTTTG ATAAGTGGCC AAACTCTAAC CGAATTATCA
ATTGAGCCAG CAACCGACCA CGATTATTTC TATATTGATG TTAATGCAGG TCAAACGATT
ACTGTCGTAA TGGTGCGAAC AAGCGCCGAT CAAGCATTAA ATGGACTTTT AGATTTATTT
GATCCAACAG GCGCGATGAT TGCCTCTGAT GATAATAGTG GTCATCGCGG CAACCCTTTA
ATCCAAGAGG TTGAGGCCAT AACAACAGGT CGTTATATGC TTCGGGTACG TGATTATAGT
GGAACCCGTA CCGGAACCTA TCAAATTACA GCGACTGTCG CTGAGCCATT AGTTTCAGGA
CCAAGTTTTG CAATAACTGA TCTTGAGACC GATCCCTGTC CATTACCAAT GGCCGCAATC
ACCGTACCAG AGATCGCAGA TCCGACTACA GGGCCAAAAA ATCCACAATT CGAAATAGGG
AATCAAGCCT TACCTCGGAC TGGTTATCCC TTAAATTGTG AATTTGCGCG GGATCGGCAG
ATTCTGGCAA CCGCCATTAC GCCTACAGCA TTCACCATTC AAAATGCTGG CCAAGTGCAA
ACGACTGATG CCTTTGTGGT TGATAGCAGC GCTCAATATC TGTTGTTTGA CTATATGGTT
GGGCGAAAAA CAGCGGATAA ACCAACACGA TTACATGTTG AGGTCTTGAG TGGGCCTACG
TTTGCGACGA TCACTGATCT TAGTCAACAT ACGATTCGCG GACAGTATCT TGATGGTTGG
AAACGTGGTG TGTTGGCGGT CGATGCATTT CGAGGGCAAA CCATCAAGCT GCGATTTATT
AACGACAGTT CATCAGCTGC TCAGCCATCA GCACAAGTAC GAGCTATTAA GCTGTCGATT
GAAGTTCCCG ATTGGCAACC ATCGCCTGTT GGCACAACCG CAATTGAGTA TAACGATAGC
ATGGGAGCAC ATGCTGTTAT TACTGGGGCA AGCGCATTTT TGATTTCGGC TCCATTTACA
ATTCCTCTGC AAACCCAAAG TCTAAGTTTC AATTACCAAA CTGGCCGACG CTTGAATAAT
GCCAGTGCCC CAATTGAGGT GCAAGTGCTC AATGGGCCAG ATTTCATTAC CGCAACTCCG
ATTGACAATA ACACAGTCAC AGGGCGCTTG AGTGATGGTT GGAAACGAGC GACACTTGAT
ATCCAAGCAT TCCGTGGTCA GGTTGTCAAA CTCAAAATTG TGAATGATTG GTGGCCCAAC
GAGCCCCAAA CAACCGCAAT TGATAGCTTT AAGTTAAATC GGGCTGTGCC AGGGTGGGAA
GTCACCAATG CCAATTATGT CTCGATCGAG AGCTTGCAGA TTCCGCCAAG TACACTAACG
AATGTATTAA CCAACACCGA TTTTGAAATT GGTTTTACTC CTATACCAGA CATTATCCCT
AATCAAACAT TTGAGCTTCC AAGCAATCCG CTTCAAACCC TCAGTTTTTC AACAATGAGC
TTGAATGGTA CGAATGTGGT AACTACGACT CCCCCAATCG TTGTGCCTGA ACAAGCCACC
AGTTTACAAT TTGAGGCATT AATTGGCGAT AGTAGCAATC CAAGCTTGAT TAAACCAGTG
ACGGTCGCGA TCCTCAGTGG TGATGCATTT GATATTCGCG AATTGCCGAT TGATCATCAA
ATACGTGGCA CAATCCAAAC CGGCTTACAA ACAGCAGTAA TTGATATCAA ACGCTATCAA
GGCAAAACGA TTAAGCTTCA GTTTACCAAT CACACAACCA ATGCGCCAAC GAGCCAATTG
AGCAATTTTC GTTTAGTTGA TCATGTGCCG CAATGGCAAG CAAATAGCCA AACCAGGTTA
AACTTAATCA ATGAATCATC AGCCAATCCA ACCCATGCCT TTTTGGTTGG AACCCAAAGT
AGCCTTCTAT CGGCCCCATT TACGCTACCC ACTGAGGCTC AACAAATTCG TTTTGAGTAT
CGTACTGGCC ATACAGATAA TGCCACGCGC CAAAGTCGGA TTCAATTAAC CGTCTTAAGC
GGTCCAGACT TTGGGATTCG CACGCGGATT GATCAGAATC GGCTTGTTGG AACTGATGTG
ATCGGTTGGC AATCAATTGC ATTTGATCTC CAACGATTTC AAGGCATGCC AATTAAACTC
GAATGGGTGA CTGAGTTAAC CAATCAACCA TATCTGCGCT TGGATAATCT CCAGGTAGGG
GTGGCAATGA CTGGTTGGCA AGCCAGTGAG TCCAGTGATA TTCTGATTGA GCCAACAACC
CCGACGTTAG GTCAAAGCAT GCGGATTAAT GGAAATGCTG CAACCATTAC GAGCCAGCCA
TGGACTGTAC TCAGTAATAC CGTCAGTCTT AGTTTCGATT ATAAAGTGCT AAGAATTAAC
GAAACCGGCA ACGCGAACTT GTATGTGGAT GTCTTGAGTG GTCATAACTT TGAGGTGATA
ACCCGCATTG ATGCGAATGG CTTGGTTGGC TCGATCACTA CCCCCAATAA TGGATGGCAA
CGCGCAACAT TAAATGTATC CCAATTCCAA GGGCGGACGA TTAAGCTGCA ATTTAAAAAT
GCGGGATATG CGATGGCTCA ATCGTGGATT GATAATCTGA CGCTCAATCA CGGGCAACCA
AGCGCGAGCC ATGGTTCTGA TGAAGCCCCT GACGGCAGTT TCCTAACACT GCTGAATACT
GGTACTGCCC AATCTGCCCT CTCATCGAGC TTTGTGGTTG CTACGGATAC CCAATTTTTG
CGCTTCGAGT ATCAAACCGG GACATTTGAG CATGGCAACG AGCAACGCTC GTTTGTCGTC
GATATTCTCT CAGGCAATAA TTTTGCAACT ATCACCACGA TTAACCAATC CTTGCCAAGC
CGTTCGTTAA ATGATGGCTG GCAAGTTGCC AAACTACCAA TCAGTCAGTT CCAAGGGCAA
ACGGTCAAAC TACGGCTAAC CATGCCTTTT GTTACCAAGC GTTCGGTTGT TCGCATTGAC
AAACTGGCGT TGTTGAGCCC GCGAGCGCAG TTGACCACGC CAATCGCTGT TGATGGTACA
ACCTATTTAA ATGTACCACT GACTGAATTA GGTGGCATTA CGACAACCGC TAGCATGACC
ATTACCGCGC TGGTTGTTTA TGATGAGTAT GTCGATTTGG AGGGTATAGT ACGATATGAT
AATGCTAGCT ATCAATTAGT ATCGACCGGA ACGACATATC GTTCTATGTT GGGCAGTCCC
AACGATAAAG TAGTGGATAG TATTGATCAA TCGAATACTT TTAATCTGCT CCACTTTGCC
GTACGAGATA ACCTGCCAAC GACGGCATCT CTTCAGCAGA TGGCTATTGC AGATCCTGTA
ATTGCACTCT ATCTCCAGAA GAAGAATACT CGTCAATTAA CCGCCTTTGA ATTCAGTATT
GATAATACGC TTTTGATTAA TGAGCTAGCG AGCGCAATGG CACGGTACGA TACTGATTAT
TACCATGATA TATGGTTTAA GAAAATAATT AAACCCTATT TAATTGAAGA TATTGATATT
ACAGACTATT CAGCTAATAC CAGCAATCAT TCAGAACTAT TACTGAAACA TAGGGTAAGA
ACAAAAAATA TTGCCTGTGA AGTTATCGAA GAAGTTGTTT ATGCTATCGT TTTTAATGGG
GACGCTGATA TTGAAAAGCA TCAGGAGCAT GATATTTATT CAAGTATTGA AATTATGTAT
GAGCAAAGTA GTGGTAAACC AATTAATCCA AATAGTAATC CAGCAATATG TGATGGTTAC
TATGATTTAC CAAGTAGCTC ATCAATTCAG ATAGGAACAA TTTATCCTAA CCCGAAGCAG
ACATTGAGTG TGAAAATTGC GACATATGGA ACCGAAACAA ATGGAGAAAA AGCCGATTAT
CTTACCAGAG GACGATATAA TGGTAGCTTT TATGTAGAAA AGAAAAAATA TACACCTCAA
ATTTATCTGA ATGCCGCTGT TTCTCTTCCT AAAGTTCCAA TGTTGGCGTT TCTTACTGGC
AATATAGCAT CAACTGTTGT TTCCGAAGGT CGAGAGGAAT TTGTAGCCAA AAACCAAGAA
AAGGTATTTA AGCATGTATT TAAAGGGATT ATTGATCTTA ATAGTTTTGC TACCCGCTCT
GCAAATCATT CATTTGATTA TGCAGTACTT GATACTGCTG GATATGATGC TAATGCTTTT
TGGGGGATTA AGCACTGGAA GGGTGGTCGA TTCAGTAGTA GTGCTATGGT TTCTTGGCAT
ATCCCTTTTA CTAATAATAT CTATGATGCT GAAAGCCATT TTTCCGAAAC AAATCATACA
ATCATTATCT ATTTTATGAG CGGTAATTAA
 
Protein sequence
MRKRMRPKSR LLLLFIGFAS LLSSLPTTLT AAASPHSITT TNASDDYQIL ISGQTLTELS 
IEPATDHDYF YIDVNAGQTI TVVMVRTSAD QALNGLLDLF DPTGAMIASD DNSGHRGNPL
IQEVEAITTG RYMLRVRDYS GTRTGTYQIT ATVAEPLVSG PSFAITDLET DPCPLPMAAI
TVPEIADPTT GPKNPQFEIG NQALPRTGYP LNCEFARDRQ ILATAITPTA FTIQNAGQVQ
TTDAFVVDSS AQYLLFDYMV GRKTADKPTR LHVEVLSGPT FATITDLSQH TIRGQYLDGW
KRGVLAVDAF RGQTIKLRFI NDSSSAAQPS AQVRAIKLSI EVPDWQPSPV GTTAIEYNDS
MGAHAVITGA SAFLISAPFT IPLQTQSLSF NYQTGRRLNN ASAPIEVQVL NGPDFITATP
IDNNTVTGRL SDGWKRATLD IQAFRGQVVK LKIVNDWWPN EPQTTAIDSF KLNRAVPGWE
VTNANYVSIE SLQIPPSTLT NVLTNTDFEI GFTPIPDIIP NQTFELPSNP LQTLSFSTMS
LNGTNVVTTT PPIVVPEQAT SLQFEALIGD SSNPSLIKPV TVAILSGDAF DIRELPIDHQ
IRGTIQTGLQ TAVIDIKRYQ GKTIKLQFTN HTTNAPTSQL SNFRLVDHVP QWQANSQTRL
NLINESSANP THAFLVGTQS SLLSAPFTLP TEAQQIRFEY RTGHTDNATR QSRIQLTVLS
GPDFGIRTRI DQNRLVGTDV IGWQSIAFDL QRFQGMPIKL EWVTELTNQP YLRLDNLQVG
VAMTGWQASE SSDILIEPTT PTLGQSMRIN GNAATITSQP WTVLSNTVSL SFDYKVLRIN
ETGNANLYVD VLSGHNFEVI TRIDANGLVG SITTPNNGWQ RATLNVSQFQ GRTIKLQFKN
AGYAMAQSWI DNLTLNHGQP SASHGSDEAP DGSFLTLLNT GTAQSALSSS FVVATDTQFL
RFEYQTGTFE HGNEQRSFVV DILSGNNFAT ITTINQSLPS RSLNDGWQVA KLPISQFQGQ
TVKLRLTMPF VTKRSVVRID KLALLSPRAQ LTTPIAVDGT TYLNVPLTEL GGITTTASMT
ITALVVYDEY VDLEGIVRYD NASYQLVSTG TTYRSMLGSP NDKVVDSIDQ SNTFNLLHFA
VRDNLPTTAS LQQMAIADPV IALYLQKKNT RQLTAFEFSI DNTLLINELA SAMARYDTDY
YHDIWFKKII KPYLIEDIDI TDYSANTSNH SELLLKHRVR TKNIACEVIE EVVYAIVFNG
DADIEKHQEH DIYSSIEIMY EQSSGKPINP NSNPAICDGY YDLPSSSSIQ IGTIYPNPKQ
TLSVKIATYG TETNGEKADY LTRGRYNGSF YVEKKKYTPQ IYLNAAVSLP KVPMLAFLTG
NIASTVVSEG REEFVAKNQE KVFKHVFKGI IDLNSFATRS ANHSFDYAVL DTAGYDANAF
WGIKHWKGGR FSSSAMVSWH IPFTNNIYDA ESHFSETNHT IIIYFMSGN