Gene Haur_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2468 
Symbol 
ID5734349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3155562 
End bp3157448 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content51% 
IMG OID641279608 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001545234 
Protein GI159898987 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGCA TCGAAGCCTT GCTTGCCGCT CGTCAATTTG TTGTTCCACA ACGCGCTGGC 
GATTATCTTT ATTTTATTAG CGATCTGAAT GGCCGCCTAA GTTTATATCG GATGCTGCTG
ACTGGCAGTG TGCCTGAGCC GTTGCTACCG CCCGATATTG CCTTGCAAAC GCCGCACCAT
ATGGGCGGAA AATCGTTTGT AGTGCTGGCC GAATACAATC AAATTGTGGT TATGATCGAT
AAAGATGGCG ATGAAAACTA CCAACCGCTG CGCATTCCCC TGACTGGGGG CTTCCCTGAG
CCAGTTTTTG GCGATCAATT TGCTGATGCC CAAACCAACC TCTCCAAGCT TGACCCAAGC
ACAGGGATTG GCTATTTGAA TGTTGCTTCA CGCGTGCGGC CCGAACTTAG TTGCTATCAA
ATTAATGTGT TGACTGGCAC GAGCACTTTA CTGCATACTG GCCCAGATGG CCCATTTTAT
GCGACCTCAG CTCCCGATCA ACAAACAATT ATCACGGTCG ATGGCTATGG CATTGGTGAT
AGCGTGATTT ATCGCCAGCA GCTTGGTAGC ACCGAGCGCT CGGTAGTTTT CGGTACGCCG
ATGGATCAAC GTACAACGCC AGTTGAGCCA AATGGTATGG GCTTTGGCGA ATGGGTCAAT
GATCAGGTGG CCTTGGTGAG CACGAGCCTC TTTGATGATT GTTACAGTTT AGCCTTGTTG
CGCCTTGATG GCGCGCAAAG CTTGGATTCC GTGACGATCG AGGGCTTGGT GCATAGCGGC
CAAGGCGAAT TTGATCGCTT GTTACATCTG ACTGAGCAGC GCTTTTTGAT TGGCTACAAT
ATCGATGGCT GTTCGTGGTG CTACGAAGCA GAGTTTGATC TAGCTGGCAA ACGTATGTTG
GTTACCAAGG TTTTGGTTGG CCAAGCACCG CTCGACAATG GCGTTTTAGA GTCGATTGAC
TATGATCAAG CGAGTGATAG CTTTGCGCTT TCGTTCTCGA CTGCGATTGC TCCAACCCAA
ATCTACACGA TTAAGTCCAG CCAAGAGCTG CAACAGCACA CCACCGAACG AGTTTTGGGC
ATCCCCGTTG AGCATTTAGC GGCTGGCGAA GATGCCTCAT TCAACTCACA TGATGGCCTG
CGCATTTCGG CACGACTTTA TCGCCCAGCT CCAGCTTTGG GCTATGAAGG CCCACGCCCC
TTGGTGTATT ACATCCATGG TGGCCCGCAA GGCCAAGAAC GCCCCGATTT TGCCTGGTTC
TCGATGCCCT TGATTCAATT TTTGACCTTG AAGGGCTTTG CAGTCTTTGT GCCTAATGTG
CGTGGCAGCA GTGGCTATGG CTTTAAGTAT ATGAACCACG TTACCCACGA TTGGGGTGGC
CAAGATCGGC TTGATCATGT GCATGCCATG ACTAAGGTTT TAGTCAATGA CCCGTTGATC
GATATCAAAC GAACTGGGGT GATGGGGCGT TCGTATGGCG GGTTTATGAC CCTGACCTTG
CTGGGCCGTC ACCCTGAGCT TTGGCGAGCA GGCATCGATA TGTTTGGCCC CTACGATTTG
CACACCTTTT CGGCGCGAGT GCCTGAAACT TGGAAGAGTT ACATGGCAAC CCAAGTTGGC
GATCCTGTAA CTGAGCATGA TTTCCTAGTC GAGCGCTCGC CCAAAACCTA TATGCACAAC
TTAGCTTGCC CATTATTGGT GACTCAAGGA GCCAACGATC CACGGGTGAT TGAGCGTGAA
TCGAGCGAAG TGGTGCACGA ATTGCAAGCC TTGGGCAAAA ATGTTGATTA TCTGTTGTTC
AGTGATGAAG GCCACGATGT TTTGAAGTAT GCCAACAAAG TGACATGCTA TAACCGGATC
ACCGACTTTT TCAGCCAGCA TCTCTAG
 
Protein sequence
MPRIEALLAA RQFVVPQRAG DYLYFISDLN GRLSLYRMLL TGSVPEPLLP PDIALQTPHH 
MGGKSFVVLA EYNQIVVMID KDGDENYQPL RIPLTGGFPE PVFGDQFADA QTNLSKLDPS
TGIGYLNVAS RVRPELSCYQ INVLTGTSTL LHTGPDGPFY ATSAPDQQTI ITVDGYGIGD
SVIYRQQLGS TERSVVFGTP MDQRTTPVEP NGMGFGEWVN DQVALVSTSL FDDCYSLALL
RLDGAQSLDS VTIEGLVHSG QGEFDRLLHL TEQRFLIGYN IDGCSWCYEA EFDLAGKRML
VTKVLVGQAP LDNGVLESID YDQASDSFAL SFSTAIAPTQ IYTIKSSQEL QQHTTERVLG
IPVEHLAAGE DASFNSHDGL RISARLYRPA PALGYEGPRP LVYYIHGGPQ GQERPDFAWF
SMPLIQFLTL KGFAVFVPNV RGSSGYGFKY MNHVTHDWGG QDRLDHVHAM TKVLVNDPLI
DIKRTGVMGR SYGGFMTLTL LGRHPELWRA GIDMFGPYDL HTFSARVPET WKSYMATQVG
DPVTEHDFLV ERSPKTYMHN LACPLLVTQG ANDPRVIERE SSEVVHELQA LGKNVDYLLF
SDEGHDVLKY ANKVTCYNRI TDFFSQHL