Gene Haur_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0544 
Symbol 
ID5732402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp630304 
End bp633585 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content42% 
IMG OID641277671 
Producthypothetical protein 
Protein accessionYP_001543320 
Protein GI159897073 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGCTC AGCGTGTGGC TAATTTTGTT CAGGAATATC AACGATATAT CAATAATTCA 
GCGAATGAAG AGGAGCTTCG GAGCGGGTTT TATCTGGCTG CTACTAATGC TTTGGGTATT
CCAAATTTTA CCCTAGAACG AGGCCGGCAG GATATCCGCC GGAATCGTGT CATTCTAGAG
TTTAAGAATA AAGGCCTATT TCGAGGTCAA TCCACCAGCC TTAAATTTAA AGAGGCGCAG
GATCAACTCA TTAATAGGTA TATCCCTCAG CAATCGCTTC GTGATGGGAG AAGGCCCTCA
GATTATATTG GCGTGTGTTT TGATGGAGAA CATCTTGCCT TTACCTTTGT TGAACCAGAT
CAAACGGTTC GGGTCACGAA ACTATTGCCA TTTGATGAAC ATAGTGCTGG GGCTTTGGTT
ATGGCGCTTG ACGTTGATGA TCGGCGTGAA CTAACCCCAC AAAATGTTAT TGATGATTTT
GGTCCATCTT CCTTTATTGC GCGACAGACC CTACAAGCAT TATGGCATCA CTTAAATGTA
TCTCTTGATA TGGGTGTCAA GCGGATACAC ATGCTCTTTA CCGAATGGAA AAGTCTTTTC
CAACAAGGAA CAAGCATTGG TACACGAGGG CGACAGAAAC TTCAGGCGTA TCTTCAGTCA
GTGGGTTTAC CTGAATCTGC TGATATCACC CGTCTCCTTT TTGTCCTTCA TACCTATCAT
GCACTTTTTT TCAAATTGCT TGCGGCTGAG GTTGTGCTGA CAAATGCTAT TATCCCGGGT
ATGACCCAAA CCGATTTTTG TTTTTCTACC GCAGGACTTC CTGATCGATC CCTTATTAAG
TTGTTGGTAG ATGATATCGA GGAATCAAAA GTATTTCGCC GTGTCAACAT CCTCAATTTT
GTTGAGGGGA CTTTTTTTAC CTGGTATACA CATGAGGCTC CCGCAGGATT GATTGAAGCA
ATTCGACAGA TTATGCAGCG GCTTAACCTT TATCGGCTAA GCGATCTTCA ACTGGAACGA
ACACGCGATA TTGTGAAGTA TGTTTACGAG CAAATAGTTC CTGAACCGCT TCGTCATAGT
CTGGGGGAAT ATTTCACGCC AGAATGGCTG GTTGAGTTTA CTCTTGATCG TGTTGGATAT
CAAGGCTCCC AAATTTTGGA TCAGAAGATT CTCGATCCAT GCTGCGGCTC AGGCAATTTC
CTTATTCATG CTATTGAGCG CTATAAACAG GCGGCTCACG CCCAAGGATG GGATGATTCT
GCTATTCTTC ATGGCATTAC CAATCACATC TTTGGATTTG ACCTAAACCC TCTTGCCATG
TTAACTGCAC GGGTTAACTA TCTTATTGCG ATTTCGGATC TCTTGAAAAC TTCATCTGCG
GTTGAAATCC CTGTATATCA AGCCGATGCC GTTTACATTC CAAAGCCCCG ACCTGATGAT
CCGACTATTT ATCGCTATGA TATTTCAACC CGTCTAACAG GGTGTCCTAT ACTCGCCCTA
GATATCCCTG TATCCCTAAT TCACAAACAA CATTTGTTTG CACGAGTGCT TGAGGAGATG
GAAGAGTCGA TAAATGAGCA CCAGCATACC TCTGCATTTA TTACACATCT TAAGAGAAAT
CCTGAATTCT GTAGAGTAGA GGATCATCTT GACTGGATTC CATATCTGGA AAAAATGTTT
AATGACATCC AGTTTCTTGA ATCACGCTCA TGGAATCGGA TATGGTGTCG GATTGCGCGA
AATTACTTTG CCTCCGTTGC AATAGGGTCT TGCGATATCG TTGCTGGTAA TCCACCCTGG
GTTCGTTGGT CTGAATTACC CCAGCTTTAT GCGGAGAAAA TTAAACCGAT TTGTGATGAA
TATGGAATTT TCTCTGATGA ACGTTTTTTT GGGGGGAATG AGCTTGATAT ATCGGGGATG
ATAACCTATA CCGTTATTGA CCAGTGGTTG GATGAGGGTG GTAAGCTGGC CTTTATTCTT
CCACAAAATC ACCTGCAATC TCAGTCATCT GGTGGATTTC GTCAGTTTCA GATTCGCGAA
ACCCCGTTAG AGGTTTTACA GGTTGATGAT TTTAGCGAGG TTAAACCATT CCGCCGCGTA
GGTAATCGTC CCGCAGTGAT AACCATCTAC AAGGGGCGAG CTACAACCTA TCCTGTTCCC
TATAACATAT GGAAACGTAT AACCCCAACA ACTATCTCTG AAAATGTAAG CTATGCGACT
GCGGCTGAGA ATCTTCTATC GATACCTCAT GAAGCTTATG CATTAGAGGA CGCAGGAAAA
CGTTGGAGTA TTCTTCCGCT AGGACGATTT CCTCTCCTTT CCATGTTGAA TGGTGAGGAC
AGGTCAATTG AAGGCCGTAA GGGTATTGTT ACCGATTTGA ATGGGGCATA CTTTATTCGA
ATTCTTGGCC ATGGTTATCG GACTGGAACT CTTCGTTTTA AAACTTCTCC AGATCAGGGT
CAAAAACCTG TACCAGAGCG AACATATGAG ATTGAGGCTG ACCTTGTATA TCCTCTGATA
AAAGGGGCAA AAAATGTTCA ACCATTTTAT GCCACCACAA GCGAGTTAGC CGTAATTGTT
CCTAATAAGG GGATTAATTC ATCCTCTATG CCATCCATGT CTCGACTGAC TTATCAGGGG
TATCCACTAG CAGCACGTTA TTTCCAGTCT CTGAATCGCG ATGTGCTCAT TGATGGTGTT
GGCCTACTCG ATCAACGATC AACTTGGCGA ACCCGAATGC GGCCATTTTT GGAAAAACAA
TATGCCGATA ATCTTGCCGA TATACCCTTT TATGCAATCT ATAACGTAGG AGACTATACA
TTTGCTCCCT ATAAGGTTGT ATGGGCAGAA ATGTCTGGAA GTCTTAAGGC GGCAGTGATT
TCTGATGGTC TTGTACCCTA TATTGAGCAG CGAAAGATTA TTATCCCTGA TCATAAAATA
TACTTTGCAT CCTTTTCGAG TGAAAAATAT GCTCACTTTA TTTGTGCGCT ACTCAATTCA
TCAATTATAC GTGAATTTGT GGATAGCTTC ACTATAAAAT TGCAGGTTGG AACAATATTT
AAGCATCTTC GATTGCCTAA GTTTGATTTA GAAAATGCCG AACATCTATC ACTCGTTGAC
TTATCCATTA CGGCACATAA CACTATGAAT AAAACAAATG GCAGTGGTAA TATTGAAGCA
TTTATAGAAC AAATTGATAT CATTGCTGTA AGATTATTAG CTAAATTTGC TGCCACTATT
ACCGATCAAC TACAACAAGA AGGGTTTAAT TTAGAATTTT AA
 
Protein sequence
MYAQRVANFV QEYQRYINNS ANEEELRSGF YLAATNALGI PNFTLERGRQ DIRRNRVILE 
FKNKGLFRGQ STSLKFKEAQ DQLINRYIPQ QSLRDGRRPS DYIGVCFDGE HLAFTFVEPD
QTVRVTKLLP FDEHSAGALV MALDVDDRRE LTPQNVIDDF GPSSFIARQT LQALWHHLNV
SLDMGVKRIH MLFTEWKSLF QQGTSIGTRG RQKLQAYLQS VGLPESADIT RLLFVLHTYH
ALFFKLLAAE VVLTNAIIPG MTQTDFCFST AGLPDRSLIK LLVDDIEESK VFRRVNILNF
VEGTFFTWYT HEAPAGLIEA IRQIMQRLNL YRLSDLQLER TRDIVKYVYE QIVPEPLRHS
LGEYFTPEWL VEFTLDRVGY QGSQILDQKI LDPCCGSGNF LIHAIERYKQ AAHAQGWDDS
AILHGITNHI FGFDLNPLAM LTARVNYLIA ISDLLKTSSA VEIPVYQADA VYIPKPRPDD
PTIYRYDIST RLTGCPILAL DIPVSLIHKQ HLFARVLEEM EESINEHQHT SAFITHLKRN
PEFCRVEDHL DWIPYLEKMF NDIQFLESRS WNRIWCRIAR NYFASVAIGS CDIVAGNPPW
VRWSELPQLY AEKIKPICDE YGIFSDERFF GGNELDISGM ITYTVIDQWL DEGGKLAFIL
PQNHLQSQSS GGFRQFQIRE TPLEVLQVDD FSEVKPFRRV GNRPAVITIY KGRATTYPVP
YNIWKRITPT TISENVSYAT AAENLLSIPH EAYALEDAGK RWSILPLGRF PLLSMLNGED
RSIEGRKGIV TDLNGAYFIR ILGHGYRTGT LRFKTSPDQG QKPVPERTYE IEADLVYPLI
KGAKNVQPFY ATTSELAVIV PNKGINSSSM PSMSRLTYQG YPLAARYFQS LNRDVLIDGV
GLLDQRSTWR TRMRPFLEKQ YADNLADIPF YAIYNVGDYT FAPYKVVWAE MSGSLKAAVI
SDGLVPYIEQ RKIIIPDHKI YFASFSSEKY AHFICALLNS SIIREFVDSF TIKLQVGTIF
KHLRLPKFDL ENAEHLSLVD LSITAHNTMN KTNGSGNIEA FIEQIDIIAV RLLAKFAATI
TDQLQQEGFN LEF