Gene Haur_3115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3115 
Symbol 
ID5734987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3929497 
End bp3931308 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content50% 
IMG OID641280259 
Productoligoendopeptidase F 
Protein accessionYP_001545881 
Protein GI159899634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGG TTGAAGAACA GGTACCAACC CGCGAGGAAG TAAGCGCCGA AGACACTTGG 
GATATTAGTA GTCTCTATGC AGACCAAGCG GCTTGGGAGG CTGATGTTGA ACGAATTAGC
AGCGATTTGC TGCCAGCCTT GACCAATTTG CAAGGCACGC TCGCCAATGG TCCTGAGGCG
TTGTTGGCAG TGTTTCAAGC CCAAGAAGCC CTTGGCATGG TGCTCGAACA AATTTATGTC
TATGCCAGTT TACGAGCCGA TGAAGATACG GCCAACCAAC ATTACCAAGC CCTCGAAGAA
CGGGCCACCG CCCTCTCGAT TAAGGCAAGC GCCGCTACCT CTTGGATTGA GCCAGAGCTT
TTAGCCCTTT CCGATGAGCA AATTTTGGGC TATGTGAGCA GTTTGCCCGC CCTCGAACTT
TATCGCCGCG CCTTAGAAGA GCAAATTCGT TTGCGCCAAC ACACCCGCTC TGGCGAAGTT
GAAGAATTAT TGGCCCAAAC TGGCGAGATT AGCCGTGGCG CTCAAACCAC CTTCAACATG
TTTAGCGATG CTGACCTCAA ATTCCCGCCG ATTGAAGATG AACAGGGCAA GCCGCTCGAA
GTGACGATGG GCCGCTACGC AGTGTTGCTG GAAAACCCCA ACCAACGCAT TCGCCGCGAT
ACTTTTATGA GCATTCACCG CACCTATCGC CAATTTCGTA ATATGTTGGC GGCCAATTAT
GCGACCAATG TGCGCAGTAA TATTTTTTAT GCCAAAGCGC GGGGCTACGA TTCAGCCTTA
GATGCCAGCC TAAAACCCAA AGAAATTCCT ATGAGCGTCT ACGATAATTT GATCAGCACG
GTGCACGAGC ACTTGCCCAA ATTGCATCGT TATGGCGCAG TGCGCAAGCG CATTTTGGGG
GTTGATAGCC TGCATGCCTA CGATTGGTTT GTGCCATTAA ACGGCGCAGC CCCAACCAAA
ATCGACTTTG AACAAGGTGC TTCGTTGATT TTGAGCGCCT TGGAGCCACT TGGCGCTGAA
TATAGCTCCA ACCTCGGTCA TGGGCTGGAA TCGCGCTGGG TTGACCGCTA CGAAAATAAA
AATAAACGCT CAGGAGCCTA TTCATGGGGT TGTTACACCT CACAGCCCTT TATTTTGATG
AACTACAAAA ATAACTTGAA TAGTCTCTTT ACCCTGGCCC ATGAGCTTGG TCACTCGATG
CACTCGTTGA TGACCCGTAA ATATCAACCC TATACCTATG GCCATTACAC CTTGTTTGTG
GCCGAAGTCG CTTCAACTTT AAACGAAGCT TTGCTGGCCG AATATATGCT CAAAACCAGC
GATGACCCAG CCTTGCGCTT GCAATTGGTC ACCCAGCAAA TTGATGATAT TCGCGGCACG
TTGTTGCGTC AAACCTTGTT TGCCGAGTTC GAGCGCGAAA CCCATCGCAT GGTCGAGCAA
GGTGAAGCGC TAACTGCCGA TAACCTCAGT GCCTTGTATC GCCGCTTGAT CGAGCAATAC
TATGGCCCCG AATTGGTCAT CGATGAAGAA TTGGATATTG AATGGGCACG GATTCCTCAC
TTCTACCGCT CGTTCTATGT CTATCAATAT TCCACTGGCA TTTCAGCTGC CTTGGCCTTG
GCCGATAAGA TTTTGACCGA AGGCGCTGGT GCTGCTGAAA ACTACGTCAA CTTCTTGCGA
GGTGGTAATT CCAAATCATC AATCGATCTA CTCAAGGGTG CTGGGGTCGA TATGACCACC
CCCGACCCAA TTCATCGAGC CATGAATCGC TTTGGCGATT TGGTGACCAA ACTCGATGAA
TTAACCGCCT AA
 
Protein sequence
MTMVEEQVPT REEVSAEDTW DISSLYADQA AWEADVERIS SDLLPALTNL QGTLANGPEA 
LLAVFQAQEA LGMVLEQIYV YASLRADEDT ANQHYQALEE RATALSIKAS AATSWIEPEL
LALSDEQILG YVSSLPALEL YRRALEEQIR LRQHTRSGEV EELLAQTGEI SRGAQTTFNM
FSDADLKFPP IEDEQGKPLE VTMGRYAVLL ENPNQRIRRD TFMSIHRTYR QFRNMLAANY
ATNVRSNIFY AKARGYDSAL DASLKPKEIP MSVYDNLIST VHEHLPKLHR YGAVRKRILG
VDSLHAYDWF VPLNGAAPTK IDFEQGASLI LSALEPLGAE YSSNLGHGLE SRWVDRYENK
NKRSGAYSWG CYTSQPFILM NYKNNLNSLF TLAHELGHSM HSLMTRKYQP YTYGHYTLFV
AEVASTLNEA LLAEYMLKTS DDPALRLQLV TQQIDDIRGT LLRQTLFAEF ERETHRMVEQ
GEALTADNLS ALYRRLIEQY YGPELVIDEE LDIEWARIPH FYRSFYVYQY STGISAALAL
ADKILTEGAG AAENYVNFLR GGNSKSSIDL LKGAGVDMTT PDPIHRAMNR FGDLVTKLDE
LTA