Gene Haur_2675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2675 
Symbol 
ID5734540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3429561 
End bp3431936 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content50% 
IMG OID641279817 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001545441 
Protein GI159899194 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.300574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGAC GCATCAGCTT ACTGCTGGCT TTGTTGGCGA TCTTGTTTAC GACAATGCCT 
GCCTCGGCAG AAATTGTCAA TGTCCCTTAC TTTGAAGAAC CGGAATTGCT GCTTGCTGAA
GATGAGTTGC TCAAACTCGC CCAACTGCAA GACAACACCT ACCCCAGTGT TGGCACCAAA
ATCAGCCCCG ATGACACGAC GGTGGTCATT GGCAACTATC GTTATAGCGA CACTGGTTCA
GCTTTCTTGA ATGTTGTTGA TGGCTCGATT GTGCCAATTC AGCCATTACA ACTCCCCGAA
GATAGCGATT TCTTCCCTTT GGCAGCCACC GAAATGGTTT GGCTTGATAA TGACAACATT
GGTCAAGTCT TATACGACCT GTTTATGGGT GGGATGGTGC TGTCCATTAA TCGCTACAGT
GGCCAAATTA GCTTGTATCC AGTTAATTTG CCATTCTTAC CGTTATCGAT TGCGCCCAAT
GGCTCGCGCT TGTTGGTGGT CACCTTCGAA GCATCCGAAC TTGAAGCAAT GCGCCAATCG
CCCGATTCGG TGAAGTTGCC ATTCAACATC GAAGCCCCAA AAACCACCAT GGAACGCACC
ATGCCCAAGG ATCGGATTGC CTATTACAGT CATACCGATT CACGCCGCCA CATGAGCGAA
GAAACCCTTG ATTTGGCAAT CTTCGATCTA ACGACTGGCG CATTAACGCC ACTCTACAGC
GTGCCTGATC ATACCTTGCT CTACGATTAT GCATGGTCAA AAGATGGCTC AAAATTCGCT
TTGATTCGCG ATACGGTGAT TTTGGGCGAA GGTTTTGGTG AAAAGCGCTT GGTTGACGTG
ATGACCCAAG ATGCACTTGG CGGTCTTTCA CCCAAGGATA ACCCACTCTT CACCGAAAAT
GTGCTCGATA TCTTCGATCT GACGACTGGC AACTTCCAGC CAGAAGCATG GCGAGCGGTT
GATGGCGATG GTCGCGTTGT TCGCGACATC GAATGGAGCA CCGACGGCCA ACGCTACATC
GTGCGCTTGG AACGTCCAGC TCAAATCGCT GGTCGCCCAC ACCCAACCTA TATCTTCCCA
GATATGGCCA GCTATCAATT CCGTAGCGTC GATGGCACGT TGCAACGCGA ATTGTATGCA
CCTGAATTGC AAACCCCTGA AGCTTCAGGC TTCTTCTATC TCTCACCAGA TGAAGTCTTG
TTCATCACTG CCAATGGCAC CAACCAAGCC TTGTACTACT TCAACCAAGG CTCAGGCGAG
TTCCGCAAAG TGTCGAACAT GGATGGCACC TATTTTGGTG TAACCACCAC GAATATGAGC
CGTCAGTTGA TCTTCAGCTA CATGTCGTTC AGCCAGCCAG CCGATATTTA TCGTTTGAAC
TGGGATGGCC AAGCGCTCAG TCGCTTGACC TGGGCCAACG CCGAGCTTGA AAAAATTAAT
AATGTTCGAG TTGATAGCGT TTCGTTTACC GTCAGCAGTG GCGCACAACG CAATGGCTTC
TTAATTCAAC CTGCTGGCGC TGAGTTCCCA CCAAAAGATG TGCCAATCGT GATGTGGCAA
GAAGGTGGAC CACGCGCTAC AATGACCCAA TTCTTCGCGA CCAACACTGA AAATCCCTAC
AACCTGTTGC CAAACTTTGG CATCGCGGTG TTGTATGTGC CACTGCCTGG TCGCTTGGGC
TTCGGGCCAG AATTCTTGAA CGCCTTGGCT GATAATGACA ACTTCGGCAA GATCGATATC
GACGAAGGTG CCGAAATTAT TGGCCAAGCA ATTTCACGCG GTTGGACCTC ACAAAATAAG
GTTGGGGTAA CTGGCTGTTC ATACGGCGGC TATTTCAGCG CCCAAAGTAT CACCCGCCAC
CCAACTCGCT ACGCTGCTGC CAACCCACAA TGCACCTTGC TCAACAACGC CAATGAATTC
CACTTTGGCT TGGGGCCATT AATTGCCTAC CTCGAAGGTG GCACACCAAT GGATAAGCCC
GCTGAATATG CCGCTGATTC GCCATTGAAT CGCGCTGATC GCGTGCGCAC TCCAACCCTG
TTGTTCCATG GCGAATACGA CTTCTTGCCA GTCAAGTATG CCGTTGACTT CCATGACCAA
ATCGAAATTC AAAAGCACCG CGTCAAGTTG GTGACCTATG AACTCGAAGG CCATGGTTTG
AGCGACCCTG CCAACCAATA TCGCGCTGCC CAAGAGCAAA TCTTGTGGTT CCGCCAATAT
TTGAGCGGTA GCCCAAGTGT CGCTGCCGAG CCAGTCGTGA CCGATGCCGC AACCATGACA
GTGCCTGAAA CTACTGATGT AATCGTGTTT ACGGAAACCG CTACGTTTGC AGCACCAAGC
TTGCAATTCG GCAAGAATCT GATTACTGCT GAATAA
 
Protein sequence
MRRRISLLLA LLAILFTTMP ASAEIVNVPY FEEPELLLAE DELLKLAQLQ DNTYPSVGTK 
ISPDDTTVVI GNYRYSDTGS AFLNVVDGSI VPIQPLQLPE DSDFFPLAAT EMVWLDNDNI
GQVLYDLFMG GMVLSINRYS GQISLYPVNL PFLPLSIAPN GSRLLVVTFE ASELEAMRQS
PDSVKLPFNI EAPKTTMERT MPKDRIAYYS HTDSRRHMSE ETLDLAIFDL TTGALTPLYS
VPDHTLLYDY AWSKDGSKFA LIRDTVILGE GFGEKRLVDV MTQDALGGLS PKDNPLFTEN
VLDIFDLTTG NFQPEAWRAV DGDGRVVRDI EWSTDGQRYI VRLERPAQIA GRPHPTYIFP
DMASYQFRSV DGTLQRELYA PELQTPEASG FFYLSPDEVL FITANGTNQA LYYFNQGSGE
FRKVSNMDGT YFGVTTTNMS RQLIFSYMSF SQPADIYRLN WDGQALSRLT WANAELEKIN
NVRVDSVSFT VSSGAQRNGF LIQPAGAEFP PKDVPIVMWQ EGGPRATMTQ FFATNTENPY
NLLPNFGIAV LYVPLPGRLG FGPEFLNALA DNDNFGKIDI DEGAEIIGQA ISRGWTSQNK
VGVTGCSYGG YFSAQSITRH PTRYAAANPQ CTLLNNANEF HFGLGPLIAY LEGGTPMDKP
AEYAADSPLN RADRVRTPTL LFHGEYDFLP VKYAVDFHDQ IEIQKHRVKL VTYELEGHGL
SDPANQYRAA QEQILWFRQY LSGSPSVAAE PVVTDAATMT VPETTDVIVF TETATFAAPS
LQFGKNLITA E