Gene Haur_4827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4827 
Symbol 
ID5736672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6154274 
End bp6156031 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content51% 
IMG OID641281992 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001547585 
Protein GI159901338 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTGT CTGTTGTCCT TGATTACACC CTCGCGATTC CTCAACCTCA ACGCCATCTC 
ATCAACGTTA CATTGCGGAT TGAGGGTTTA ACGGGTTCTG AAACGTCGTT GCAATTGCCA
GCATGGACAC CAGGCTCGTA TCTGCTCCGC GAATATGCCC GCCACGTGCG CTTTGTCGAG
GCTCATACCG ACGGCCAAGC GCTAGCCATC CACAAAACTG ATCGCTTGAC ATGGCAGGTG
CAAACCAATG GGGCCAGTGC GATCACGGTA ACCTATCAAG TATATGGCTA TGAATTGACC
GTGCGCACCA ATCACATTGA TGCAACGCAC GCGCATATTG TGCCAGCCGC CACCTTGCTT
TATTTGCCCG AATATCAGGA TTGTCGCTAT AACGTACATC TGGAACTGCC AAGTAGCTGG
GAAGTTGCCA CGGGCTTGCC CAAATTGGCC GATGGTTGTT ATGGTACCAA CGATTTAGAT
GAATTGATGG ATTGTCCGTT TGAGGTCGGT GAATTGCGCC GCTACCGCTT TGACGTAGCC
GAAAAGGCTC ACGAGGTGGT GGTTTGGGGC CATGGCAACG AAGATATCGA GCAAGTGCTG
GCCGACACGA AGCAAATTGT CGAAACTGAG TATGCCTTCT GGGGCGATTT ACCCTACGAT
TATTATTTGT TTATTGTGTT GCTAGCTGGA GCCAATACTT ATGGCGGCTT AGAGCATCGT
AATTCAACCT CGCTCTTGCT ACCGCGCCAT GTCTTCAAAC CAAGCAAAAC CTATGAACGC
GCTCAAGGCC TAATTGCCCA TGAATTTTTT CATACCTGGA ATGTCAAACG GCTACGGGCT
GCCCCACTCG GCCCCTTCGA TTACACCCGC GAGAATTACA CTCGCTTGTT GTGGGTGATG
GAAGGCTTCA CCGAATACTA CACCGATTTG ATGTTGGTTC GCGCAGGCTT GATGACTCCC
CAACGCTACC TTGAGCGCTT GGCCGATGAT ATTAGCACGC TGCAAAATAC TCCTGGCCGC
TTGGTTCATA GCCTCAGCAG TTCCTCATTC GATGCTTGGA TCAAGTTCTA TCGACCTGAT
GAATCAACGC CTAACACGAC AGTTTCCTAT TATCTCAAGG GTGGGTTGGC GGCATTGGTG
CTCGATATGC AGTTGCGCGA GCAGAGTAAC GGTCAGCAAT CGCTTGATGA TTTGATTCGC
TATCTCTATC AGACGTATCC AATCACTGGC CCGGGGATTC CCGAAGCCGA TGGCATGCAG
CAAGCCCTGC AAGCACTCAC AGGCAGCGAT TGGAGTGACT ATTTCGCCAA GTATATCGAT
GGATTAAGCG AATTACCCTA CGCCGAAGCC TTTGCCACCG TTGGCTTGCA AATGCAATGG
AACTACAAAG ATCGTGATGC GCAAGGCAAT CCACGGCCCC AATTAGGGAT TCGCAGCAAA
GCTGTTGATG GTCGGGTGCA AATTACCCAT GTGCTCGATG GCGGCGACGC TGATCGGGCT
GGCTTGGCCG CTGGCGATGA ATTAATTGCG CTCGATGGTT GGCGCATCGA TGAAGATGGC
TTGAACAAGC GGCTAGCCGA TTATCCAATT GGCGCAACCG TGCAATTAAG TTTCTTCCGC
CGTGATGAAT TATTGCACGT GCCCGTTACA TTTAGCCAAC CCAACCCTGA TTTCTTGAGC
CTAACCTTGG TGAGCCAACC AAGCGCCAGC CAACGCCAAC AAGCAGCGAC ATGGCTCGGT
ACGCCATTGT TTAAATAA
 
Protein sequence
MPVSVVLDYT LAIPQPQRHL INVTLRIEGL TGSETSLQLP AWTPGSYLLR EYARHVRFVE 
AHTDGQALAI HKTDRLTWQV QTNGASAITV TYQVYGYELT VRTNHIDATH AHIVPAATLL
YLPEYQDCRY NVHLELPSSW EVATGLPKLA DGCYGTNDLD ELMDCPFEVG ELRRYRFDVA
EKAHEVVVWG HGNEDIEQVL ADTKQIVETE YAFWGDLPYD YYLFIVLLAG ANTYGGLEHR
NSTSLLLPRH VFKPSKTYER AQGLIAHEFF HTWNVKRLRA APLGPFDYTR ENYTRLLWVM
EGFTEYYTDL MLVRAGLMTP QRYLERLADD ISTLQNTPGR LVHSLSSSSF DAWIKFYRPD
ESTPNTTVSY YLKGGLAALV LDMQLREQSN GQQSLDDLIR YLYQTYPITG PGIPEADGMQ
QALQALTGSD WSDYFAKYID GLSELPYAEA FATVGLQMQW NYKDRDAQGN PRPQLGIRSK
AVDGRVQITH VLDGGDADRA GLAAGDELIA LDGWRIDEDG LNKRLADYPI GATVQLSFFR
RDELLHVPVT FSQPNPDFLS LTLVSQPSAS QRQQAATWLG TPLFK