Gene Mboo_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1387 
Symbol 
ID5412037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1420151 
End bp1421152 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content52% 
IMG OID640868620 
ProductTPR repeat-containing protein 
Protein accessionYP_001404548 
Protein GI154150930 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.316988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.93517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATGA GCACAAAAGC TGGTATGCTG GTCCTTGTTC TCCTTATTCT CATTCCCGCG 
GTGGCAGCTG AGGATGCTAC GGACTGGTAC ACCAAAGCCC AGAACGCTGC TTCAGCAGGC
GACTATACCG ATGCAGTGAT CTATTACACC AATGCAATCA GTCTCAATCC TACCTATGAT
GCCGCATACG CCGGGGAGGC GGCGGCACTT AATTCACTCG GGCAGTATTC CGCGGCACTT
ACCGCAGCAA ACCAGGCCCT TGCAATCCGT TCAAGTCCCA CAGCGCTTGG AGCGCAGGCG
GATGCCCTTT TTTACCTGGG CAGGTACAAT GACGCGATCG GGGCTTATAC CAATTATACT
GCAGTCGTAA CCAATCAACA ATCGGCATAC TGCAATCTTG CATATTCCTA TGTTCAGGTC
AACTATACAA CACAGGCACT TGCCACGTAT TCCCAATGTA CAAATCTTAA CCCAAATGAT
CCATTGGTAT GGAACCAGAT TGGCCTTGTG GATATGTCCT TGGGCAAGTA TACTGATGCC
CTGAATGCAT TTAACAAGGC AACTGCTCTT ACCGTCAATA ATGCTGAGAT CTGGAACAAC
AAAGGAGAGG CACTTGTTGC ACTGGGCCGG TATCAGGATG CAGTTCCCTG CTTTAACACC
GCGCTCACGT TGAATCCGAC CTACACTGCA GCACAGGAAA ACCTGAATGC GGCCATGGGC
AAGGGCCAGG TCTATACGTA TACAATGACA CCCACCCCGA CAGAGTCCCG CTGGTACCTG
GGAGGGATTA CCCCAACAAG CGCAACCCCG ACTGAAGTTG CAGTATCCCT TGAAAATATT
ACCCAGGTTG TCCAGACAGC AACGCCGGTT ACTACAATCC AGGCAACAAC ACCTGTCCCG
ACACGGACCA CGTATACGCC GCTCTCACCA GTTCCGGTGC TTGCGGGCCT TGGTTTTGCA
GTATTCCTTC TGGTCCGGAT CAGGAGAGGT AAGAACCGGT GA
 
Protein sequence
MQMSTKAGML VLVLLILIPA VAAEDATDWY TKAQNAASAG DYTDAVIYYT NAISLNPTYD 
AAYAGEAAAL NSLGQYSAAL TAANQALAIR SSPTALGAQA DALFYLGRYN DAIGAYTNYT
AVVTNQQSAY CNLAYSYVQV NYTTQALATY SQCTNLNPND PLVWNQIGLV DMSLGKYTDA
LNAFNKATAL TVNNAEIWNN KGEALVALGR YQDAVPCFNT ALTLNPTYTA AQENLNAAMG
KGQVYTYTMT PTPTESRWYL GGITPTSATP TEVAVSLENI TQVVQTATPV TTIQATTPVP
TRTTYTPLSP VPVLAGLGFA VFLLVRIRRG KNR