Gene Mhun_2840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2840 
Symbol 
ID3923109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp3118100 
End bp3120991 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content53% 
IMG OID637898450 
ProductPKD 
Protein accessionYP_504251 
Protein GI88604073 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0720897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATG GGATTCGAAT AATCAGGCCG GCGGCTATGC TGGCTGTTCT GCTCTGTCTG 
CTGATGATGA CGGCAGGGGC AGTACCACCG CTGCCTGCCG AATTTTATGG TAAAGTAACC
GTTGACGGAA CACCTGCTTC AATTGGTACT GCGCTGATCG CAAAGATCAA TGATCAGGTA
CGAGGAAAAA TTGCTCTTTC CACTGCCGGA ACATATGGTG GGACCGGCAT CTTTGATGAT
AAACTCGTTG TTGCTGCAAC CGAAGATGAT GTGAAATCCG GATCGGCAAC CATATCCTTT
TATATTGGGG ATAAGAAGGC AGACCAGACC GTTCCATTTG AACCTGGTGT TGCCAAAGAA
CTGGATCTGA CGGTTGGCAA TTTCGGAGCT GACTTTACGG CAAACCCGAC TTCAGGAGCG
GCACCACTTA CGGTTCAGTT TACTGATACA TCCACGACCG AGTGGTCAGT ATGGACCTGG
GACTTTGGCG ACGGAGGTAG CTCGGTCATT AAAAACCCAA GCCATGTATA CGAGACTCCG
GGAACCTATA CCGTTAAGAT GACCGTGGGC TCCATGAGCG GTACCTACAC CGTCACCAAG
GATAATTACA TTACCGTAAC CCAGTCCGGG GGCATTGTTG CTGACTTTAC TGCGACACCG
ACATCAGGAA CCGCTCCGCT CACCGTCCAG TTCACTGACA CATCAACCGG AAGCCCGACC
ATGTGGGCAT GGGACTTTGG GGACGGAACC ACAGAAGGAA TGATTGCAAA TCCCTCGCAC
ACATACCAGA ATGCCGGTAC ATACACCGTA AAACTGACCG CAAGTTCAGC AACCGGTGGT
TCCAGTACCA AAACAAAAGA GGGATACATT ACCGTAACCC AGTCCGGTGG CATTGTTGCA
AACTTTACCG CAGCACCGAC ATCAGGTACT GCTCCACTGA CAGTTCAGTT CACCGACACC
TCAACCGGAG GTCCGACCAT GTGGTCGTGG GACTTTGGTG ACGGTACAAC GGAAGGAATG
CTCGCAAATC CCTCACACAC ATATCAGAAT GCCGGAACAT ACACCGTAAA ACTAACGGCA
AGTTCAGCAA CCGGTGGCTC CAGTACCAAA ATTCGGGAAG GATATATCAC CGTCAGCCCC
TCGGGCTCCG GACCCACTGC AGCCTTTACC GTTGACAAGC GGAGTGGGCC AAAACCCCTG
ACTGTTCAGT TCACTGATCA GTCCACCGGA GGGCCGACCA TGTGGGCATG GGACTTTGGA
GATGGTGGAA CCTCAATGGT CGCATCACCA TCATACACCT ACCAGGAAGC AGGAGTGTAT
ACCGTCAGTC TGACCGCCTC CAACACGGCA GGATCTGATA CCAAGACAGA AAAGGACTAT
ATCTCTGTGA CCGGAGACAT ACCGCCTCCG GTAGCAATGT TTGAAGCAAC ACCACTCTCC
GGCTCTGCTC CACTGACGGT CCAGTTCACT GACCTGTCAA TCGGACCGCC AACTTCCTAT
GCATGGGACT TTGGGGATGG CGGAACTTCA ACCGAGGCAA ATCCAAGCCA CGTATATTCC
GCTGGTGGCA CATATACCGT CAAGCTCACG GTGAAGAATA GTGGCGGATC TCACACCATG
ACCCGTGAGA ATTACATATC TGTTGGTGGA TCAGGAATCA TTGCAGACTT CTCTGGAACA
CCGACCTCAG GAACCGTGCC ACTCACCGTC CAGTTCACTG ACCTCTCGAC CGGCGGGCCG
ACCATGTGGG CATGGGACTT TGGGGATGGC GGAACCTCGA CGGTTGCATC ACCTTCATAC
ACCTACCAGA CCCCAGGAAC CTACACGGTC AAACTTACCG CATCATCTCA GACTGGTGGC
ACGAGCACCA AGGTCAGAGA AGGATATATC ACGGTCTCCC CATCGGGCGG TATTATTGCA
GACTTTGTTG GAACCCCAAC GAGTGGAAAT GCACCTCTGA CAGTCCAGTT CAGTGACCGT
TCCCAGGGTG GGCCGACCAT GTGGTCATGG GTCTTTGGAG ACGGTGGAAC CGCACTGGTT
GCAAATCCGG TACATGTATA TCAGCAGCCT GGCAAGTACA CCGTCAGTCT TACGGCGAGC
AATCAGGCAT CATCCAATAC TGCAGTAAAG ACAGATTATG TCACGGTCTC TTCAGGACCG
GTTGGTTCAG GATCAATCAG GATAATTTAC GCACCTGATC GGTCGTCTGT TTATCTTGAT
AATGCCCTGA AGGGTGAGAC AAAATTCCTG CAGACGTTCA GAATAGAGAA TCTCCCGGCA
GGAAGCTACC AGCTGAAAGT TACCAAGCCA GGATTTTCAG ATTACTATGT GAACGTTCCG
GTTACTTCAG GCAGGGCAAC TGAGGTCGTT GCAGATATGA GACTGCAGCC AAGCCAGAAT
GGTATCCTGA GCGTGTATAC CTATCCGGCC GGATCAACGG TGTATGTTGA TGGCGTGGAG
GCAGGAACTG GTCCGCTCTG GCTTGCCGAC GTAACTCCAG GTATGCATCA GGTACGGGTC
TCCTCTGCAG GGTATCTTGA CTGGAACCAG GCTATTGATG TGAAAGGTGG CGGAAGTGTG
AACTATGTGA CCGCCGCTCT CTATCCGTCA TGGTGGACAC CCATTTACGG ATATGTGATG
ATCTCATCCA TGCCAGGGAA CGGAGTGGCC TACCTTGATG GAGTGGCTCA GGGTAAAACT
CCGGTTACCC TGTCACAGGT TTCTCCAGGG CAGCATACCA TCAGAATCGA ACTGCCCGGC
TATCAGCCTT GGGAACAGGT CGTGAATGTT ATGGAAGGAA GAACGTCCTA CGTCCTTGCC
CAGATGACCA CCGGTGGCAG CAGTGGAACG ACCCCGGTGA TTGTTGCATC CGCAGGGAAT
ACGACCAATT AA
 
Protein sequence
MMDGIRIIRP AAMLAVLLCL LMMTAGAVPP LPAEFYGKVT VDGTPASIGT ALIAKINDQV 
RGKIALSTAG TYGGTGIFDD KLVVAATEDD VKSGSATISF YIGDKKADQT VPFEPGVAKE
LDLTVGNFGA DFTANPTSGA APLTVQFTDT STTEWSVWTW DFGDGGSSVI KNPSHVYETP
GTYTVKMTVG SMSGTYTVTK DNYITVTQSG GIVADFTATP TSGTAPLTVQ FTDTSTGSPT
MWAWDFGDGT TEGMIANPSH TYQNAGTYTV KLTASSATGG SSTKTKEGYI TVTQSGGIVA
NFTAAPTSGT APLTVQFTDT STGGPTMWSW DFGDGTTEGM LANPSHTYQN AGTYTVKLTA
SSATGGSSTK IREGYITVSP SGSGPTAAFT VDKRSGPKPL TVQFTDQSTG GPTMWAWDFG
DGGTSMVASP SYTYQEAGVY TVSLTASNTA GSDTKTEKDY ISVTGDIPPP VAMFEATPLS
GSAPLTVQFT DLSIGPPTSY AWDFGDGGTS TEANPSHVYS AGGTYTVKLT VKNSGGSHTM
TRENYISVGG SGIIADFSGT PTSGTVPLTV QFTDLSTGGP TMWAWDFGDG GTSTVASPSY
TYQTPGTYTV KLTASSQTGG TSTKVREGYI TVSPSGGIIA DFVGTPTSGN APLTVQFSDR
SQGGPTMWSW VFGDGGTALV ANPVHVYQQP GKYTVSLTAS NQASSNTAVK TDYVTVSSGP
VGSGSIRIIY APDRSSVYLD NALKGETKFL QTFRIENLPA GSYQLKVTKP GFSDYYVNVP
VTSGRATEVV ADMRLQPSQN GILSVYTYPA GSTVYVDGVE AGTGPLWLAD VTPGMHQVRV
SSAGYLDWNQ AIDVKGGGSV NYVTAALYPS WWTPIYGYVM ISSMPGNGVA YLDGVAQGKT
PVTLSQVSPG QHTIRIELPG YQPWEQVVNV MEGRTSYVLA QMTTGGSSGT TPVIVASAGN
TTN