Gene Haur_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1817 
Symbol 
ID5733675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2113344 
End bp2115137 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content45% 
IMG OID641278960 
Productpeptidase M3A and M3B thimet/oligopeptidase F 
Protein accessionYP_001544588 
Protein GI159898341 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000237555 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCA CCCAAAGTTT TCTTGTAGCC CGTTGGATAC GAGACGATAT TCTACCAACC 
GATACTGAAA CGAATACATA CCAATCGTAT CAGCAAACAA TTGCCGATCT TGATCGTTGT
GTGGCACAAT TCGAACTCCT GCGTTCATCC CTCGATAAGT CCCTTTCGTC CGAGGCGGTG
CTCCAAGCTA TTCGCGATTT TGAAACGATT ACCACCTTCA TAAAACGACT TAGCGGGTAT
GCCGAGCTTT GGATAGCAGA GGATACGCAA AATCCGCATG CCCAAGCATG CGCAACCCTG
ATTGATATTG TGATTACAAA AGCAACGAAT AAAACACTTT TTTTTCCCCT ATGGTGGAAG
AATTTGCCTG AGGATGTGGC TGCATCGATT CTTGGAGACA TCCCCCAGTA CGCCTATTGG
TTGCGACAGA TGCGAAGTGC TGTTATCCAC ACGCTGCCAG AACCTGTCGA GCAAGCTATT
AATCTCAAGA ACAGTACTGG TGTCACAGCG CTTCGTGCCT TGTACGATGC GATAACGAGT
CGGTATAGCT TTACCTTAGA AGCCGATGGA CAGATTCACC ACTTAACTGA TAGTGGTATT
TGGGGCTATG CATCGCATCC TGATCCTGCT GTTCGCGATC GTGCTTTTGT TGAGCTGTAC
CGCGTTTACA GCCAAGATGC CTCGCTGCTT GGTCGTATTT ATTTCACCCT CGTGCAGGAT
TGGTATCAAG AATATATTGA ACTGCGTCGT TATAGCAGTC CCTTAGCGGT ACGAAATCAG
ATGAATGATA TCCCCGACGC GCTGATTGAA ACCCTCTTTC GCGTTTGCCG CACCAATACC
CCACTCTTTC ACCGCTACTT CCGATTAAAA GCACGTTGGC TTGGAATGGA ACGAATGCGC
CGTTGTGATT TAGCGGCCCC GATTATTACC AAGAAACAAT CCTATACCTG GAAACAAGCG
GTTGAGATGA CACTTGCAAC CTTTGAGTCC TTTGATCCGC TTTTTTACCA GCTTGCGCAT
CGCATTTTTC AAGCAAATCA TGTGGATAGT GAAGTGCGGA GTGGTAAGCG TCGTGGTTGT
TGGTGTTTAG ATTTTGGACC TACGATAACT CCATGGGTTC AGATTGACTT TAATGGTCGC
GTCGATGACA TTGCTGCCCT CATACATGAG TTTGGGCATG CTATTCATGG CATGCTCGCT
GAACACCAAT CGGTATTGCA ATATGCTCCA TCAATTCCCC TTGCTGAAGT TGGAGCATTG
TTTTGTGAGT TACTTTTTGC CGATCATCTC TTACAACAAA CCCATGATCC TGAGGTACGG
ATTGGGTTAC GATTTAAACA GTTGAACGAT TCTTTTGCCT TTCTCCATCG TCAAATCTAT
TTTACCTTTT TTGAATGCAC AGCCCATGAT TTGATTCAGC AGGGTGCTTC GATTGACGAT
GTTGCCCAAG CCTATCTTGA TACGGTCAGA GAGGAATTTG GTGATACCAT TGACATTCCT
GATGCAATGC GCTGGGAGTG GACATTAATT TTCCATCTCT TCCATTATCC ATTCTATATG
TATAGTTATG CATTTGGACA ACTGCTTGCT TTAGCGCTCT ATCAGCAATA TCGCCAAGAA
GGAAATTCTT TTAAAGACCG CTTCTTTGAA ATATTGAGGG CTGGAAGTTC TGATCATCCT
GTCGCCATTT TGTCTAAGGC CGGGGTTAAT ATTGCTGATC CACTATTTTG GCAAGGTGGG
TATGATGTGA TTCAGATAAT GCTCGAGGAT ATTGAACAGA TACCAATTCC CTAA
 
Protein sequence
MNTTQSFLVA RWIRDDILPT DTETNTYQSY QQTIADLDRC VAQFELLRSS LDKSLSSEAV 
LQAIRDFETI TTFIKRLSGY AELWIAEDTQ NPHAQACATL IDIVITKATN KTLFFPLWWK
NLPEDVAASI LGDIPQYAYW LRQMRSAVIH TLPEPVEQAI NLKNSTGVTA LRALYDAITS
RYSFTLEADG QIHHLTDSGI WGYASHPDPA VRDRAFVELY RVYSQDASLL GRIYFTLVQD
WYQEYIELRR YSSPLAVRNQ MNDIPDALIE TLFRVCRTNT PLFHRYFRLK ARWLGMERMR
RCDLAAPIIT KKQSYTWKQA VEMTLATFES FDPLFYQLAH RIFQANHVDS EVRSGKRRGC
WCLDFGPTIT PWVQIDFNGR VDDIAALIHE FGHAIHGMLA EHQSVLQYAP SIPLAEVGAL
FCELLFADHL LQQTHDPEVR IGLRFKQLND SFAFLHRQIY FTFFECTAHD LIQQGASIDD
VAQAYLDTVR EEFGDTIDIP DAMRWEWTLI FHLFHYPFYM YSYAFGQLLA LALYQQYRQE
GNSFKDRFFE ILRAGSSDHP VAILSKAGVN IADPLFWQGG YDVIQIMLED IEQIPIP