Gene Mhun_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_1160 
Symbol 
ID3922968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp1312122 
End bp1314983 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content48% 
IMG OID637896802 
Productexcinuclease ABC, A subunit 
Protein accessionYP_502627 
Protein GI88602449 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.216625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TCATCATCAA AGGTGCCAGG GAACATAACC TGAAGAACAT CTCTGTCACT 
ATTCCCCGTG ACCAGTTTAT CGTCATCACT GGATTATCCG GTTCAGGAAA ATCAACCCTC
GCATTTGACA CCCTCTATGC TGAAGGACAA CGGCGGTATG TTGAATCACT CTCCTCATAT
GCCCGCCAGT TCCTAGGCCT CATGAATAAG CCGGATGTGG AAGCAATTGA AGGATTGTCA
CCGGCCATAT CAATTGAACA GAAATCAACC TCGAAAAATC CCCGGTCAAC GGTCGGAACG
GTAACCGAGA TCTATGACTA TCTGCGGTTA CTCTACGCAC GGATTGGTAT CCCCTATTGT
CCTGAACATA ACCTTCCCAT CATGGGACAG TCACCTGAGC GTATAGCCGA ACGGATTGAA
GAGGAATGCA CCGGGCAGAT AACCATCCTT GCACCCCTTA TCAGGAAAAA AAAGGGGACA
TACCAGCAGC TCTTCAAGGA TCTGAATAAG GAAGGATTTG CACGGGTCAG GGTGAATGGA
GAGATATACC GGACGGATGA TGAGATCACC CTGGAACGGT ACAAGATGCA TGACATCGAT
CTCGTTATCG ACCGGCTCGA TGTATCAGAT CACTCACGGC TTGTTGAGGC ATGTGAACAG
GCATCTGCCA GATCCGAGGG ACTTATCATC ATCACCTGTG AAGACGGGGA AGATCGGACG
TATTCATCTA AAATGGCCTG TCCGATCTGT GGCATCACCT TTGAAGAACT CCAGCCACGA
ATGTTCTCCT TTAACAGCCC ATTTGGTGCC TGTCCGGACT GTAAGGGGCT TGGGATCAGA
ATGGACTTTG ATCCTGATCT CATTATCCCT GAAAAAGAAA AATCCATAGC CGAAGGTGCA
ATCGCAACCT ACCGGAATTT CCTTGATGGA TACCGGTCCC AATTTGTCGG AGCCGTTGCA
AAACATTTTG GCTTTACGGT CAATACCCCG ATTAAAGATC TGAACGAAAA ACAATACAAT
GCTCTGATGT ACGGTTCCAA TGAAAAGATC TCGTTTCATA TGAGTTACAA ACAGGGTGAA
GGAGAGTGGT CGCATAAGGG AACCTGGGAA GGGCTGCTCC CCCAGGCAGA ACGTCTGTAC
AGTCAGACCA ATTCAGAATA CCGGAAACGT GAACTTGAAA AGTTCATGAA AATTACTCCC
TGTCAGACGT GCAAGGGAAA AAGACTGAAG GATAAAATTC TCGCGGTCAG AATAAAAGAC
AAATCCATCA TCGACCTGAC CGATCTCTCT ATTACTGCAA GTATTGCATT CTTCAACAAC
CTGGAACTGT CAGAGAAAGA GACAGAAATT GCCAGGCAGA TACTGAAAGA GATACTTTCC
AGACTTACCT TCCTCGAACG GGTTGGTCTT GGATACCTGA CTCTTTCACG AAGTGCAGGC
ACACTCTCTG GTGGTGAAGC ACAGCGGATA CGTCTTGCTA CCCAGATTGG TGCCAACCTC
ATGGGGGTGC TCTATGTGCT TGATGAACCA TCCATCGGAC TGCATCAACG GGATAATAAC
CGGCTCATCG ACACCCTCAG GCAACTCCGC GATCTTGGAA ACACTCTGAT TGTAGTCGAA
CATGATGAAG ATACCATACG GGCGGCAGAC TGGGTTATCG ATATGGGGCC CGGTGCCGGC
CTTAATGGTG GTGAGATCAT CGCCGAAGGG ACACCCCACG AGATCGAACA GAATGAGCGG
TCACTTACCG GTGCGTACCT CTCCGGACGG ATGCAGATCG AAATCCCGGA GAAGAGGAGG
ATTCATGATC GGTATATATC CATCACCGGG TGTCAGGAAA ATAACCTGAA GAACATCACC
GCCCGAATAC CAATGGGTAC CTTAACCCTC ATCACCGGTG TATCCGGATC AGGGAAATCA
TCTCTCATTT ATGACACGCT CTATCCGGCC CTGCAAAAGA TGGTCTATCA TTCCAGAGTT
GAAGCAGGAA AACATACATC TGTCACCTGT GATGAACCCA TCGACAAGGT TATTGTCATC
GACCAGTCAC CCATCGGGAG AACTCCCCGG TCAAACCCGG CCACCTATAC GAAGGTCTTC
GATGAGATTC GGAATATCTT TGCAGAGACA AAAGAAGCAA AGATGAGGGG ATACAAATCC
GGACGATTCT CATTTAATGT CAAGGGAGGC AGGTGTGAGG CATGTCAGGG AGACGGCCTC
ATAAAAATTG AGATGAACTT TCTTCCTGAT GTCTATATTG AATGTGAAGA GTGTAAAGGG
ACCAGGTATA ACCGGGAGAC CCTTGAGGTC AAATATAAAG ACAAATCCAT TGCCGACGTC
CTGGCGATGA GTGTGGATGA GGCAATCGAA CTCTTCTCAG CCATCCCGAA GATCAGAAAT
AAATTGCAGA CTCTTATTGA CGTAGGACTG GGATACATCA AGCTTGGACA GAGTGCTACA
ACCCTCTCCG GTGGTGAGGC GCAGCGTATC AAACTTACCA GGGAGCTTTC AAAACGGGCA
ACCGGGCAGA CGGTCTATCT TCTTGACGAA CCGACAACCG GCCTGCATTT CCATGATGTC
AGAAAACTCA TCCAGGTCTT TTCAGAACTG GTTGCAAAAG GAAACACGGT AATTGTAATC
GAACATAACC TCGATGTTAT CAAATCTGCA GATTATATCA TCGATCTTGG TCCTGAAGGG
GGAGATGCCG GTGGAGAGAT CATCGCAACC GGGACACCTG AAGAGGTAGC AGGCAACCCA
GCCAGTTATA CCGGCATGTT CCTTGCCAAA CTCCTCCCTT CATCACCTCC AGATAAGAAG
AAACGAACCC CACGTAAAAA ATCTGAGCAG ACAAGTGGAT GA
 
Protein sequence
MKNIIIKGAR EHNLKNISVT IPRDQFIVIT GLSGSGKSTL AFDTLYAEGQ RRYVESLSSY 
ARQFLGLMNK PDVEAIEGLS PAISIEQKST SKNPRSTVGT VTEIYDYLRL LYARIGIPYC
PEHNLPIMGQ SPERIAERIE EECTGQITIL APLIRKKKGT YQQLFKDLNK EGFARVRVNG
EIYRTDDEIT LERYKMHDID LVIDRLDVSD HSRLVEACEQ ASARSEGLII ITCEDGEDRT
YSSKMACPIC GITFEELQPR MFSFNSPFGA CPDCKGLGIR MDFDPDLIIP EKEKSIAEGA
IATYRNFLDG YRSQFVGAVA KHFGFTVNTP IKDLNEKQYN ALMYGSNEKI SFHMSYKQGE
GEWSHKGTWE GLLPQAERLY SQTNSEYRKR ELEKFMKITP CQTCKGKRLK DKILAVRIKD
KSIIDLTDLS ITASIAFFNN LELSEKETEI ARQILKEILS RLTFLERVGL GYLTLSRSAG
TLSGGEAQRI RLATQIGANL MGVLYVLDEP SIGLHQRDNN RLIDTLRQLR DLGNTLIVVE
HDEDTIRAAD WVIDMGPGAG LNGGEIIAEG TPHEIEQNER SLTGAYLSGR MQIEIPEKRR
IHDRYISITG CQENNLKNIT ARIPMGTLTL ITGVSGSGKS SLIYDTLYPA LQKMVYHSRV
EAGKHTSVTC DEPIDKVIVI DQSPIGRTPR SNPATYTKVF DEIRNIFAET KEAKMRGYKS
GRFSFNVKGG RCEACQGDGL IKIEMNFLPD VYIECEECKG TRYNRETLEV KYKDKSIADV
LAMSVDEAIE LFSAIPKIRN KLQTLIDVGL GYIKLGQSAT TLSGGEAQRI KLTRELSKRA
TGQTVYLLDE PTTGLHFHDV RKLIQVFSEL VAKGNTVIVI EHNLDVIKSA DYIIDLGPEG
GDAGGEIIAT GTPEEVAGNP ASYTGMFLAK LLPSSPPDKK KRTPRKKSEQ TSG