Gene Mbar_A3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3039 
Symbol 
ID3624836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp3917315 
End bp3918835 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content45% 
IMG OID637701881 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_306511 
Protein GI73670496 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000107892 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.196612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCTA AAACTCAACA GATCCTCGAA GTCTTTGAAG AAATCAATAA AATCCCGAGA 
CGTTCAAAGC ATGAAGAACA GATTTCAACC TGGCTGCAGG AGTGGGGCCG GTCCAGAGGT
TTTGAGGTAA AAGCCGATTC CTTAAACAAT GTGCTTATCA AAGTCCCTGC AACAACTGGA
TATGAGGACT CTCCCACACT TATCTTGCAA GGGCATATGG ATATGGTCTG CGAGAAATGT
AAAGACTCCA TACACGACTT TTCCAGAGAC CCTATAAGAT GTATTTGCGA CGGGGACTGG
ATGCGGGGAG ATGGAACTTC CATAGGTGCG GATGATGGAA TTGCCCTTGC ACTTGGACTG
GTAATGGCAG AAGCAGGGAA GAAAGGAGAA ATCGGGCATC CACCACTTGA GCTGCTTTTT
ACGGTAGACG AGGAAACAGG TCTGACAGGA GCGAGAGGAC TTGAAACTGG CTTTTTTGAA
GGAAAAATAC TTCTGAACCT TGACTCAGAG GATGAAGGCA TTTTTGTAAT AGGGTGTGCA
GGCGGGCAGA ATTCACAGAT TACACTTCCT GTTGAGTGGG AACTGCTTGA TTTCAAAGAA
GAAAATTTAT TTAGGTTCTT CAGGCTCTCG GTTGAAGGAC TGGAAGGTGG GCATTCCGGG
ACTGAAATCA ACAAACAACG CGCAAACGGG ATACAGTTGC TCTCCCTGGC ACTGGTCAAA
CTCAGGGGAA AACTTGGGAT AGAAAATGTA AGGCTTGTCC TGTTGAACGG AGGCACTGTC
CATAATGCAA TTCCCAGTAC TGCAGAAGCT TTTATTGCCC TCCACGGGGA AAAACTCGAA
AAAGCCGCAG AAACAATGTC AGATATCAGG CACGCATTTA AAGCTGAATA TGCAAAAACT
GATCCTGGAC TTATTTTGAA TTTTGAAGAA ATTAGCAGAG AAATAATTTT TGAAGTTGCA
GAAGATAAGA TCCCCAGAGA GGAAGTGTCT CATGTTCGGG TCTTTTCGCT AGAGACCGAA
GAAAAACTTC TAGAACTGAT CCTGGGGCTG CCGCACGGGG TTTACAGGAT GTCGGAGACT
ATTCATGGGC TTGTCGAAAC CTCAAATAAT CTTGCAACAG TCCGGACGGC TGGAAATGAA
GTAATAATCG TGTCAAGCCA ACGCAGTTCA AACAATCTCA GGCTGGCTGA AATTAGCGGA
AAAGTGGAAG CAATCTCAAA GCTTGCAGGC GCATGGGTTG AACATGAGCC CGGATATCCT
GCATGGGAGC CCAACCTGAA GTCCGAACTG ATCTTAAAAT GCAAGCAAGT TTATACCGAG
ACATTTGGGA AAGAGCCTGA GATTGAAGTG ATTCACGCAG GGCTTGAATG CGGAATAATC
GGTTCGGCGC ATGAAGGTAT GGAAATGATT TCTTTTGGGC CAACGATCAA AGATGCGCAT
TCTCCTGCTG AAAAGATCTT CGTTCCTTCG ATTGAGAAAG TCTGGATATT CCTGGAAAAC
CTGTTTAAAA CTTATTGCTG A
 
Protein sequence
MHPKTQQILE VFEEINKIPR RSKHEEQIST WLQEWGRSRG FEVKADSLNN VLIKVPATTG 
YEDSPTLILQ GHMDMVCEKC KDSIHDFSRD PIRCICDGDW MRGDGTSIGA DDGIALALGL
VMAEAGKKGE IGHPPLELLF TVDEETGLTG ARGLETGFFE GKILLNLDSE DEGIFVIGCA
GGQNSQITLP VEWELLDFKE ENLFRFFRLS VEGLEGGHSG TEINKQRANG IQLLSLALVK
LRGKLGIENV RLVLLNGGTV HNAIPSTAEA FIALHGEKLE KAAETMSDIR HAFKAEYAKT
DPGLILNFEE ISREIIFEVA EDKIPREEVS HVRVFSLETE EKLLELILGL PHGVYRMSET
IHGLVETSNN LATVRTAGNE VIIVSSQRSS NNLRLAEISG KVEAISKLAG AWVEHEPGYP
AWEPNLKSEL ILKCKQVYTE TFGKEPEIEV IHAGLECGII GSAHEGMEMI SFGPTIKDAH
SPAEKIFVPS IEKVWIFLEN LFKTYC