Gene Mhun_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_1301 
Symbol 
ID3922586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp1484368 
End bp1486404 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content47% 
IMG OID637896939 
Productpeptidase S9, prolyl oligopeptidase active site region 
Protein accessionYP_502761 
Protein GI88602583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.32869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0583688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAC TGAACGTAAT TTTTATCCTG ATGGTAATAA TTGGTTGTGG AGCGGTGGTA 
TTCCCGGTTG TTTCAGCTGA GCCTTTTGGA AATCTCGATT ATGATGATAT CCTGAAGATG
GATTCTCTTC GTCATTTTGC ACTCACATCT GATGAAGAAA ATATTGTGTT CATGCTCATA
ACCGGAGATG ATTTTACACC CCCGGCTGAT AACGGGACCC TGATGATGGT GAATACATCA
ACCGGAAAAG TGGTCACCCT TACCGGTCCT GATGAATCGG TTACGGTATG GGCATTATCT
TCATTGAGAC CGCTTCTTGC CTACGCATCA ATGCCCCGGA ATGGCGGGAA AGAGATATTG
ACTCTCCTTG ATCTCTCCAC GATGCAGCGG GAGAAGAAAC AAAAGGTATC TGATGAGCTC
CTCTCCGGTT TTGCATGGCT CGGAGATCAT TCTCTGGTAT ATGCAGGAGC TTCTCCTGAT
ACTCCGGAGG ATACCCGGCC TGGTGATGTC ATCATCATGG ATGAGATCCC TGACCCGGTG
ATTTTGAAGT CCTATGACAT CAGGAGCGGA GCAGTAACAG AGCTGACATC CAATACTGAT
ATTATCTACG CATATCATCC GTCTCCTGAC GGGAGGTATA TTGCATATAA ATCGTCCATC
TATCCTGAAG TCTGGACAGA GAAACCATCA TTTTCCTATT ATGTGCTTGA CACGACGACC
GGGACTGAAG ATGAAGTCAT GACCCGTATT GAGGGATACC AGGATGAGAA TGAGTTTGCC
TGGTCTCCGG ATAGTTCAAT GGTCTATATC GGACGAAATC TGAATGGCGG CCTTCGATAT
CCTGTTTCGT ATGCGAGTGA TATCGTGGTA TATACTCCTG CAACCCGGAT ATTGGAAGAG
ATCCCTCTCC AATGGGAGAA AAAAATGCAC AAAGATCTCT TCAATGATGA TGTAGAGATG
AGACCTTTTG ACGGAGGTGT GTATGTCCTT CTTGCTGACG GAACAAATCC ACAGTTGGCA
AGGTATGATA AGAACGATAC GGGCTGGACA AAGACCCTGC TTTCCGGTGA GCATCAGGGG
AACATCTTTG CCCTGGAATC AAGCAGAGAT GGTTCACGGA TCTTCTATAA TTTTAATTCA
GCATCAGTGC CGCCACAGAT CTATGCAGCG GATGTTATCG CAGGGGAGAT ACGGAATCTG
AAAAGAATGA CAAGCCTGAA TGAAGACCTT CTGAAGAAAC CGCTCGGAAC CTCGGAGGTT
ATTGAATGGA CCGGTGCAAG GGGTGATACG GTTCAGGGTA TTCTCCGGTT CCCTCCGGGT
TACACACCAG GAACACCGTA TCCGCTCGTC TTTGTAATCC ATGGCGGACC GACATATACT
GATTTTGACA GTTGGCGTGA TACCTGGGAG TTTCCGTACC ATCTGATCAC CGACCGGGGA
GCAATTACCC TATCTACGAA TTATCATGGA AGCAGTAACT GGGGCTTTGA GTTTGCACAG
TCCATTGAGG GCGGTCATAT CCACGATTAC CCGACAGAAG ATTTCATGAA GGGTATTGAA
TATCTCTCAG AACAGGGGAT TATTGATAAG AACCGGGTTG GTGTGACCGG ATGGTCAAAT
GGAGGAATTC TCACTCTCTA TTGGATTACC CAGGATCCAT CCCTCAAGGT AGCGGTTGCT
GGTGCCGGAT ATGCAGATGA GAACTCACAG GTCTCAAATA CCAATGGTAT TGTGATGAAC
CTGATGTATC ATGAATATAC GCCCTTTGAG AATCCTGAAT ATTATATTCC GATCATGGGG
GTGTATAAAG CAGAGCATGT ACAGACTCCT CTTCTTATGC TTCAGGGAAC AGAGGATAAT
GCTGTTGCTC CGGCAAGTGC TCTGTCCACG TACCGGGCAT ATAAGATGGC AAGTAAAGCG
GATGTACGGA TGATTCTCTT CAAGGATCAG CCTCATCATA TGACAACGTA TCCAAATCAG
CTGAGAAAGG TGAGTGAAGA GATAGACTGG CTATCAAACG GCCTCGGCCT ATCTTAA
 
Protein sequence
MSRLNVIFIL MVIIGCGAVV FPVVSAEPFG NLDYDDILKM DSLRHFALTS DEENIVFMLI 
TGDDFTPPAD NGTLMMVNTS TGKVVTLTGP DESVTVWALS SLRPLLAYAS MPRNGGKEIL
TLLDLSTMQR EKKQKVSDEL LSGFAWLGDH SLVYAGASPD TPEDTRPGDV IIMDEIPDPV
ILKSYDIRSG AVTELTSNTD IIYAYHPSPD GRYIAYKSSI YPEVWTEKPS FSYYVLDTTT
GTEDEVMTRI EGYQDENEFA WSPDSSMVYI GRNLNGGLRY PVSYASDIVV YTPATRILEE
IPLQWEKKMH KDLFNDDVEM RPFDGGVYVL LADGTNPQLA RYDKNDTGWT KTLLSGEHQG
NIFALESSRD GSRIFYNFNS ASVPPQIYAA DVIAGEIRNL KRMTSLNEDL LKKPLGTSEV
IEWTGARGDT VQGILRFPPG YTPGTPYPLV FVIHGGPTYT DFDSWRDTWE FPYHLITDRG
AITLSTNYHG SSNWGFEFAQ SIEGGHIHDY PTEDFMKGIE YLSEQGIIDK NRVGVTGWSN
GGILTLYWIT QDPSLKVAVA GAGYADENSQ VSNTNGIVMN LMYHEYTPFE NPEYYIPIMG
VYKAEHVQTP LLMLQGTEDN AVAPASALST YRAYKMASKA DVRMILFKDQ PHHMTTYPNQ
LRKVSEEIDW LSNGLGLS