Gene Mhun_2060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_2060 
Symbol 
ID3923710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp2326973 
End bp2329927 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content51% 
IMG OID637897667 
Producttetratricopeptide TPR_2 
Protein accessionYP_503487 
Protein GI88603309 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATA TACCTGAACA GGAAGGACGC AGTTGGTTTG ATTTTCTGAT CACCCGGGTT 
ACTGCTCTCC TTGATGCGGA GGAATATGAC CAGGCCCTTA CTCAGGTTGA CATGGGGCTG
TCTGTCTGGA AGGGTAATCT GTACCTCCTC TTCCTGAAAT CTATCGTTCT TTCGTACCTG
AAACGTCCGG ATGAGGCATT AAGGCAATTA TATCAGATCT ACCGGGAGAA TCCCTTTGAA
GAGAAGTTCA GGGATGAGCT GGGCAGAAAT CTCATGCTCC AGGGGTACTA TCATAGTGCT
CTCTCTGTTC TTCAGCACTA TGAATTTATG TCCAATGGTC TTGCCGGAAT TCGAAAACTC
TTCATCAGTA TCTGCCATAT ATTCACCGGA AACCCGGATC TGGCCATTGA ATATTGTGAC
GGGATACAAT CCCAGCTGGG TGACACCCCT GATGTCACCG GATTTATGGC GACAATTCAT
GCACTCAAAG CTCTCGCTCT TATTCATACG GGGGAGATGT CTGCTGCCGC GGAGATCATG
GCCGCCGTTG GAGAGGAGGG AGATAAAATG CCCCTGGTCC CCTATGTAAA AGGGATATTA
GCATGGAGAT CTGGTGATAT CAGTGCTGCA GAGACCTTTC TTCGGGCTTC ATGCGAGGAC
CAGCCCCAGG ATATGCAGTC AAAAGTTGAT CTTGCTCATC TCCTGACCGA ATGTGGAAGA
GAGGATGAGG CTCTTGTGAT CAGGCGGGAT ATCTCTGGCT TTTATGCACC GCCCCACCCA
GAGGCCGCTC CCGTGTTCAG GGCTGAAGAG CTCCTGAAAT CAGGCAGATA TGAAGAAGCT
GAAGTTTTTC TCGAATCTGC ACAAAAATCA TCTCCGGACG ACATGGATCT CTTCATTCCC
TACCTTTCAT CGTTATATTT CACCGGCCGG TATGATGATC TGATCAGACT GGCAGAGACG
GGTGAAAAGG CGAGCGATGC ATGGAAAGCG ATATATCTGA AAGCATCGGC CCTGTATTCA
TCAGGCAAGA TCAGGGAGGC CATCAGGGTT CTGATAACCG CAGCAGCGAG GGATCGGACG
AATATGATGT TTTCCCTGCT TCACATGAAA TCTGCCGGTT ATTTCAAGCC CCCGTCAGAA
GAGACACCGG AGTCTCTGGC CCTTGAATTA TTCATGACCG ATGGCATGGA TGCTGCAATA
CCACCCTTTG AGGTCCTTGC TCATGCAAGC GGGGCTCCGG AGTATCTGGT TTTTCTCTGT
TTCTGTTACA TTATGGCGAA CCGGGGGACA GATGCCGTTC TCATATCCGA ACGGTTTGCA
GATTCGGAAG AGCAGGATTT CTGTCTGATC CGTGCCATTG CATTCAGGTC GGTCGGTTGC
CTGCATGATG CTGAGATAAT CACACAGGAT GTTCTCACGA AGAGTCCTGA TTCCCGGCCC
GCACTGAATA TGTATGTCAG GACGCTTCTT GAACAGGAAC GATACCGTGA TGCATATTTT
GAACTGAGTT CACGAAAGGA TGCGTTGTCA TCTGACTGCA CTCTGCTTGA AGTGTTTATC
CGCTGCCTTA TCAGGTCTGG TGAATATCGC GATGCAGCCC TTACTGCTGT CCGGATCATG
CAGATGAATC CCGGACGTCC GATGGATTAT TACCTGTGTG CCCTGGCTTG TCAGCTTGAG
GGGAGTTTTC GTACCGGTCT TTATGCGATG CGGGAGAACT TCCGAATATG CGGCTCTTCG
AAGAAAAACC TCCTGATGTA TGGTCGGTTA TTACTCCTGG CTACGGAAGC ACAAGAAGCC
GCGATTCTCT TCTCTGAATG TAAGAATTCC CTTCCAAACA CGGCAGAGGT CAGGTATCTG
CATACAGCAG CCCGGATTCT CTCAGGGAGC GATAAGGAGT GTGAATCGGT TTCGGAGTAT
GCTGCCCGGT TCTTCCAGGC TGAGAAAGGA GAGCTTCGGG CATTTATCGG GGATTACCGG
CAGGCCGCCA GGATGCTTGC AGAGGAGATT CGCCAGGATC CTGATGATTC ATCGCTGCGG
ATTACCTTTG CCAGAGTGTT ATTTGAACTG GAACAGTATG GGGAGGCGCT TTCCCAGCTC
TCTCTTGCAG CCGGCCGGTA TGGATACGAT GGAGAGGTGA AGGACCTACT GGAGGAAGGG
CTGAAGATGC TCCGATTGCA GGGTCCTCTC GAGGATCTGA AAACGATAAG CCATACTCCG
GAGAGGTACG CAGAACTGCT CCATGAACGG GCGGTCAGTC TGAAACGATA TGGGGTGGTC
AGGCTTGCTG TAAATGTCTG TACGACGCTC ACCAATATAG AGTCAGATTC TCCGATAGTC
TCTCGGTACC AGGCAGATGC ATATCACATG CTTGGCGATC TGGAAGAGAA CGAAGAACGA
TATGTACAGG CGATTGCGGT ATATGACCAA CTTCTCCAAA ACGATCCTGA CAACCCTGTT
TTTCTTGCTG AGAAGGGCCT GGTTCTAGAT AATCTCCGTC GGTTTGATGA GGCAGAGCCG
GTTCTCCTTC GAGCCCTTGA ACTGGATCCC TCATCCGGTG TTGCCCATTC TGCACTCTGC
TGGTGTTTAT CCAACCTGGG GCGGCATGAA GAGGCTGTTG TGCATGGGAA TCAGGCTGTT
ATACTCATGC CTTATGAATG GGGAGCCTGG AATAACCGTG GCCTGTCAAG ACTTGGTCTG
CGTGATTTCG AGGGGGCGGT ATCTGATTTC AGGCAGGCAA TCAGGTGTCA GCCCGGAGAG
GTTATTGCAC GGAGAAATCT CTGCGATGCC CTCGCTGCCC TTGATGATGA CGATGCCCAT
GAAGAGTATG AACAGATCAT CGTCAGGTTT GGGAAGCGGG CATTTCCTGA AGAGGAAGAA
GAGATGCCAG CACAACCCCG GGTGCCAGGA AGACCCCTGA AAACTCCGGT CGGGTATATG
GATGGGTATT GGTGA
 
Protein sequence
MEDIPEQEGR SWFDFLITRV TALLDAEEYD QALTQVDMGL SVWKGNLYLL FLKSIVLSYL 
KRPDEALRQL YQIYRENPFE EKFRDELGRN LMLQGYYHSA LSVLQHYEFM SNGLAGIRKL
FISICHIFTG NPDLAIEYCD GIQSQLGDTP DVTGFMATIH ALKALALIHT GEMSAAAEIM
AAVGEEGDKM PLVPYVKGIL AWRSGDISAA ETFLRASCED QPQDMQSKVD LAHLLTECGR
EDEALVIRRD ISGFYAPPHP EAAPVFRAEE LLKSGRYEEA EVFLESAQKS SPDDMDLFIP
YLSSLYFTGR YDDLIRLAET GEKASDAWKA IYLKASALYS SGKIREAIRV LITAAARDRT
NMMFSLLHMK SAGYFKPPSE ETPESLALEL FMTDGMDAAI PPFEVLAHAS GAPEYLVFLC
FCYIMANRGT DAVLISERFA DSEEQDFCLI RAIAFRSVGC LHDAEIITQD VLTKSPDSRP
ALNMYVRTLL EQERYRDAYF ELSSRKDALS SDCTLLEVFI RCLIRSGEYR DAALTAVRIM
QMNPGRPMDY YLCALACQLE GSFRTGLYAM RENFRICGSS KKNLLMYGRL LLLATEAQEA
AILFSECKNS LPNTAEVRYL HTAARILSGS DKECESVSEY AARFFQAEKG ELRAFIGDYR
QAARMLAEEI RQDPDDSSLR ITFARVLFEL EQYGEALSQL SLAAGRYGYD GEVKDLLEEG
LKMLRLQGPL EDLKTISHTP ERYAELLHER AVSLKRYGVV RLAVNVCTTL TNIESDSPIV
SRYQADAYHM LGDLEENEER YVQAIAVYDQ LLQNDPDNPV FLAEKGLVLD NLRRFDEAEP
VLLRALELDP SSGVAHSALC WCLSNLGRHE EAVVHGNQAV ILMPYEWGAW NNRGLSRLGL
RDFEGAVSDF RQAIRCQPGE VIARRNLCDA LAALDDDDAH EEYEQIIVRF GKRAFPEEEE
EMPAQPRVPG RPLKTPVGYM DGYW