Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1817 |
Symbol | |
ID | 5733675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2113344 |
End bp | 2115137 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641278960 |
Product | peptidase M3A and M3B thimet/oligopeptidase F |
Protein accession | YP_001544588 |
Protein GI | 159898341 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000237555 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACCA CCCAAAGTTT TCTTGTAGCC CGTTGGATAC GAGACGATAT TCTACCAACC GATACTGAAA CGAATACATA CCAATCGTAT CAGCAAACAA TTGCCGATCT TGATCGTTGT GTGGCACAAT TCGAACTCCT GCGTTCATCC CTCGATAAGT CCCTTTCGTC CGAGGCGGTG CTCCAAGCTA TTCGCGATTT TGAAACGATT ACCACCTTCA TAAAACGACT TAGCGGGTAT GCCGAGCTTT GGATAGCAGA GGATACGCAA AATCCGCATG CCCAAGCATG CGCAACCCTG ATTGATATTG TGATTACAAA AGCAACGAAT AAAACACTTT TTTTTCCCCT ATGGTGGAAG AATTTGCCTG AGGATGTGGC TGCATCGATT CTTGGAGACA TCCCCCAGTA CGCCTATTGG TTGCGACAGA TGCGAAGTGC TGTTATCCAC ACGCTGCCAG AACCTGTCGA GCAAGCTATT AATCTCAAGA ACAGTACTGG TGTCACAGCG CTTCGTGCCT TGTACGATGC GATAACGAGT CGGTATAGCT TTACCTTAGA AGCCGATGGA CAGATTCACC ACTTAACTGA TAGTGGTATT TGGGGCTATG CATCGCATCC TGATCCTGCT GTTCGCGATC GTGCTTTTGT TGAGCTGTAC CGCGTTTACA GCCAAGATGC CTCGCTGCTT GGTCGTATTT ATTTCACCCT CGTGCAGGAT TGGTATCAAG AATATATTGA ACTGCGTCGT TATAGCAGTC CCTTAGCGGT ACGAAATCAG ATGAATGATA TCCCCGACGC GCTGATTGAA ACCCTCTTTC GCGTTTGCCG CACCAATACC CCACTCTTTC ACCGCTACTT CCGATTAAAA GCACGTTGGC TTGGAATGGA ACGAATGCGC CGTTGTGATT TAGCGGCCCC GATTATTACC AAGAAACAAT CCTATACCTG GAAACAAGCG GTTGAGATGA CACTTGCAAC CTTTGAGTCC TTTGATCCGC TTTTTTACCA GCTTGCGCAT CGCATTTTTC AAGCAAATCA TGTGGATAGT GAAGTGCGGA GTGGTAAGCG TCGTGGTTGT TGGTGTTTAG ATTTTGGACC TACGATAACT CCATGGGTTC AGATTGACTT TAATGGTCGC GTCGATGACA TTGCTGCCCT CATACATGAG TTTGGGCATG CTATTCATGG CATGCTCGCT GAACACCAAT CGGTATTGCA ATATGCTCCA TCAATTCCCC TTGCTGAAGT TGGAGCATTG TTTTGTGAGT TACTTTTTGC CGATCATCTC TTACAACAAA CCCATGATCC TGAGGTACGG ATTGGGTTAC GATTTAAACA GTTGAACGAT TCTTTTGCCT TTCTCCATCG TCAAATCTAT TTTACCTTTT TTGAATGCAC AGCCCATGAT TTGATTCAGC AGGGTGCTTC GATTGACGAT GTTGCCCAAG CCTATCTTGA TACGGTCAGA GAGGAATTTG GTGATACCAT TGACATTCCT GATGCAATGC GCTGGGAGTG GACATTAATT TTCCATCTCT TCCATTATCC ATTCTATATG TATAGTTATG CATTTGGACA ACTGCTTGCT TTAGCGCTCT ATCAGCAATA TCGCCAAGAA GGAAATTCTT TTAAAGACCG CTTCTTTGAA ATATTGAGGG CTGGAAGTTC TGATCATCCT GTCGCCATTT TGTCTAAGGC CGGGGTTAAT ATTGCTGATC CACTATTTTG GCAAGGTGGG TATGATGTGA TTCAGATAAT GCTCGAGGAT ATTGAACAGA TACCAATTCC CTAA
|
Protein sequence | MNTTQSFLVA RWIRDDILPT DTETNTYQSY QQTIADLDRC VAQFELLRSS LDKSLSSEAV LQAIRDFETI TTFIKRLSGY AELWIAEDTQ NPHAQACATL IDIVITKATN KTLFFPLWWK NLPEDVAASI LGDIPQYAYW LRQMRSAVIH TLPEPVEQAI NLKNSTGVTA LRALYDAITS RYSFTLEADG QIHHLTDSGI WGYASHPDPA VRDRAFVELY RVYSQDASLL GRIYFTLVQD WYQEYIELRR YSSPLAVRNQ MNDIPDALIE TLFRVCRTNT PLFHRYFRLK ARWLGMERMR RCDLAAPIIT KKQSYTWKQA VEMTLATFES FDPLFYQLAH RIFQANHVDS EVRSGKRRGC WCLDFGPTIT PWVQIDFNGR VDDIAALIHE FGHAIHGMLA EHQSVLQYAP SIPLAEVGAL FCELLFADHL LQQTHDPEVR IGLRFKQLND SFAFLHRQIY FTFFECTAHD LIQQGASIDD VAQAYLDTVR EEFGDTIDIP DAMRWEWTLI FHLFHYPFYM YSYAFGQLLA LALYQQYRQE GNSFKDRFFE ILRAGSSDHP VAILSKAGVN IADPLFWQGG YDVIQIMLED IEQIPIP
|
| |