Gene Mboo_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1000 
Symbol 
ID5411677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp979638 
End bp981098 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content57% 
IMG OID640868226 
ProductTPR repeat-containing protein 
Protein accessionYP_001404161 
Protein GI154150543 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.326539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.768066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTCA TCGATTCAAT CCTGAAAAGC AGCGGAGTAA ATGCAGAGAC CGAGTTTTGC 
AAGGCCGAGA CCCTCTGCCG GCAGGGATAC TACACCGATG CGGTAAATAT CCTCGATAAA
GTACTTGCCG CGGAGCCCAA TCATCTCAGG GCCTCGCAGC TCAAGGGCTT TGCCCTTTAC
CAGATGGGAA CTTTTGAAGA AGCACTCCAG TATTTCGACA AGGCGCTGGG CATTGATGCA
AACCTTCCCG ATGCGCTGGT ATACAAGGGA CTCATCTACT CCGGTTTTGG GAAACATGCC
CACGCACTTG ATCTCTATGA CCGGGCACTT GCGATCCATC CCGGCTTTAT CCAGGCCTGG
TATGCAAAAG GACTCACCCT TGCCATCCTT GAACGGTACG ACGAGGCGAT CCAGTCGTAC
GACCGGGTGC TCGTCCTCCA GCCAAAGCAC GTGGATGCCC TGATCGGGAT AAGCGTTGCC
CGTAAGAAAA AAGGAGCCGG GCCAAAAGAG AACACAATCC TCCAGCATCC CAAAACCAAC
CTCCCGGAAA AATCCCGTCC TGCCCCTGCT GCCCCGATCG CAGCATCACC CGCACAAACC
CAGAAACCCC TTGCTCCAAA ACCTGCGCCG GTACTGGTTC AAAAGCCTCC TGAACCGGTC
GCGATCCCGG CACACCCCAA ACTATCTTCC CGGCAGGATC CTGCACCGGC GCTGGTACCG
GCGGAATCAC GAATAGTTCC AAACTCCTCA CCGGCCGCGG TGCTGAATGT GCGACAGCCC
GCTGCGGTAC CAGCCCATAA TGCATCCCGG GCCATGCCAA AAATGCCGCC GGTACCTGCC
ACGGTTCCGG CACAGCCACG CACCGCACCG GAAGCACCGG CCACACACCG TGGATTCCCG
GAAGAAAACC TTCTTGAACC GGAACCATCA TCACCGGCAT CTCCCCGATG CAGCACCTAC
GAGGAGATGA TCCGGGAGAT CGCTGCAAAC CCGGAGAAGG TACCGGGCCC GGACCGCTGG
CTTCTTCTGG GTAACCTCTC CATGAAACTC GGGAAGTTCC GGGACGCAGC CGGTATGTTT
GAGCATTACC TTGGACTTGT CCAGAATGAT GCCGATGCAT GGCGGGCATT AGGGGATGCA
CATAAAAAAT GCGGCCTCTA TGACGAGGCC CGTGAGGCCT ATGACCACGC ACTTGCACTG
AACCCGGAGA CAGCGGCCGT CTGGATCAGC CACGCAAAAG TGCTGGTGAT GCTGCGGGAT
CATGAGGGTG CTCTTGTCTC CTGCGATCAG GCCATCTCGC AGGACGGGGA ATATATCGAA
GCATGGCTGT ATAAGGGTTT CATCCTCAAA AAAATACACC GGAACGACGA TGCAATGGCT
GCGTACGATC GCGTACTCAT GCTTAACCCG GGCCATGATC ACGCAGTCCG GGAACTGCGG
CGCATGAAGG GCGGGGCGTA A
 
Protein sequence
MGLIDSILKS SGVNAETEFC KAETLCRQGY YTDAVNILDK VLAAEPNHLR ASQLKGFALY 
QMGTFEEALQ YFDKALGIDA NLPDALVYKG LIYSGFGKHA HALDLYDRAL AIHPGFIQAW
YAKGLTLAIL ERYDEAIQSY DRVLVLQPKH VDALIGISVA RKKKGAGPKE NTILQHPKTN
LPEKSRPAPA APIAASPAQT QKPLAPKPAP VLVQKPPEPV AIPAHPKLSS RQDPAPALVP
AESRIVPNSS PAAVLNVRQP AAVPAHNASR AMPKMPPVPA TVPAQPRTAP EAPATHRGFP
EENLLEPEPS SPASPRCSTY EEMIREIAAN PEKVPGPDRW LLLGNLSMKL GKFRDAAGMF
EHYLGLVQND ADAWRALGDA HKKCGLYDEA REAYDHALAL NPETAAVWIS HAKVLVMLRD
HEGALVSCDQ AISQDGEYIE AWLYKGFILK KIHRNDDAMA AYDRVLMLNP GHDHAVRELR
RMKGGA