Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3039 |
Symbol | |
ID | 3624836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 3917315 |
End bp | 3918835 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637701881 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_306511 |
Protein GI | 73670496 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000107892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.196612 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCTA AAACTCAACA GATCCTCGAA GTCTTTGAAG AAATCAATAA AATCCCGAGA CGTTCAAAGC ATGAAGAACA GATTTCAACC TGGCTGCAGG AGTGGGGCCG GTCCAGAGGT TTTGAGGTAA AAGCCGATTC CTTAAACAAT GTGCTTATCA AAGTCCCTGC AACAACTGGA TATGAGGACT CTCCCACACT TATCTTGCAA GGGCATATGG ATATGGTCTG CGAGAAATGT AAAGACTCCA TACACGACTT TTCCAGAGAC CCTATAAGAT GTATTTGCGA CGGGGACTGG ATGCGGGGAG ATGGAACTTC CATAGGTGCG GATGATGGAA TTGCCCTTGC ACTTGGACTG GTAATGGCAG AAGCAGGGAA GAAAGGAGAA ATCGGGCATC CACCACTTGA GCTGCTTTTT ACGGTAGACG AGGAAACAGG TCTGACAGGA GCGAGAGGAC TTGAAACTGG CTTTTTTGAA GGAAAAATAC TTCTGAACCT TGACTCAGAG GATGAAGGCA TTTTTGTAAT AGGGTGTGCA GGCGGGCAGA ATTCACAGAT TACACTTCCT GTTGAGTGGG AACTGCTTGA TTTCAAAGAA GAAAATTTAT TTAGGTTCTT CAGGCTCTCG GTTGAAGGAC TGGAAGGTGG GCATTCCGGG ACTGAAATCA ACAAACAACG CGCAAACGGG ATACAGTTGC TCTCCCTGGC ACTGGTCAAA CTCAGGGGAA AACTTGGGAT AGAAAATGTA AGGCTTGTCC TGTTGAACGG AGGCACTGTC CATAATGCAA TTCCCAGTAC TGCAGAAGCT TTTATTGCCC TCCACGGGGA AAAACTCGAA AAAGCCGCAG AAACAATGTC AGATATCAGG CACGCATTTA AAGCTGAATA TGCAAAAACT GATCCTGGAC TTATTTTGAA TTTTGAAGAA ATTAGCAGAG AAATAATTTT TGAAGTTGCA GAAGATAAGA TCCCCAGAGA GGAAGTGTCT CATGTTCGGG TCTTTTCGCT AGAGACCGAA GAAAAACTTC TAGAACTGAT CCTGGGGCTG CCGCACGGGG TTTACAGGAT GTCGGAGACT ATTCATGGGC TTGTCGAAAC CTCAAATAAT CTTGCAACAG TCCGGACGGC TGGAAATGAA GTAATAATCG TGTCAAGCCA ACGCAGTTCA AACAATCTCA GGCTGGCTGA AATTAGCGGA AAAGTGGAAG CAATCTCAAA GCTTGCAGGC GCATGGGTTG AACATGAGCC CGGATATCCT GCATGGGAGC CCAACCTGAA GTCCGAACTG ATCTTAAAAT GCAAGCAAGT TTATACCGAG ACATTTGGGA AAGAGCCTGA GATTGAAGTG ATTCACGCAG GGCTTGAATG CGGAATAATC GGTTCGGCGC ATGAAGGTAT GGAAATGATT TCTTTTGGGC CAACGATCAA AGATGCGCAT TCTCCTGCTG AAAAGATCTT CGTTCCTTCG ATTGAGAAAG TCTGGATATT CCTGGAAAAC CTGTTTAAAA CTTATTGCTG A
|
Protein sequence | MHPKTQQILE VFEEINKIPR RSKHEEQIST WLQEWGRSRG FEVKADSLNN VLIKVPATTG YEDSPTLILQ GHMDMVCEKC KDSIHDFSRD PIRCICDGDW MRGDGTSIGA DDGIALALGL VMAEAGKKGE IGHPPLELLF TVDEETGLTG ARGLETGFFE GKILLNLDSE DEGIFVIGCA GGQNSQITLP VEWELLDFKE ENLFRFFRLS VEGLEGGHSG TEINKQRANG IQLLSLALVK LRGKLGIENV RLVLLNGGTV HNAIPSTAEA FIALHGEKLE KAAETMSDIR HAFKAEYAKT DPGLILNFEE ISREIIFEVA EDKIPREEVS HVRVFSLETE EKLLELILGL PHGVYRMSET IHGLVETSNN LATVRTAGNE VIIVSSQRSS NNLRLAEISG KVEAISKLAG AWVEHEPGYP AWEPNLKSEL ILKCKQVYTE TFGKEPEIEV IHAGLECGII GSAHEGMEMI SFGPTIKDAH SPAEKIFVPS IEKVWIFLEN LFKTYC
|
| |