Gene Mboo_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2326 
Symbol 
ID5410521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2392344 
End bp2393639 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content58% 
IMG OID640869582 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001405483 
Protein GI154151865 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.419836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTT TTGCCCCCGG TGCGATAATT CTCGTACTTT TGGCGGTACT CTGCGTAATG 
CCGGTGCTTG CCCAGGACAA ATATCTCGGT GGCTCTCCGC AGATAACGGC GTATATCGCC
GGGACCAACG AGTTTTCGCC CGGTGAGGAT GCGACGATCA CGGTAGTTAT CCAGAACAGC
GGGACTGCAG ATGTCCTGTT CCTCAACCAG GGAACGCTGC ATCAGGCGGA TATACCCACG
ACCGCGAAGT TATTGACGGT ATCTCTTACT TCCGGAGGGG CGCCGATCAA CATCACGACC
GGTCCGCAGG CACTCGGGGA TCTTGCGAGT CCCGGGATCA CGAGTGTACC GTTCACGGCG
AAGATCACCA CGGATGCGAA CATGGGATCC TACACGCTGC CGCTTGAGGT GCAGTACTCG
TATCTCTCGA ACAGCCTGGC AAACCAGCCG GCAAGCGACG AGGTCAACCC GGAGTATTCA
CCGGTAAACG TAACCATTCC GCTCACGGTC CGGATCCAGC CGGTGGTCCA GGTCACGGTA
CTGGACGCCG AGGCGTCGGG CCTTGCGGTC GGAACGGAAG GCTATGTCAA CCTGACCCTC
AAAAACACGG GGTACCAGGA TGGGACCAAA GCAACCGTGC AGATCCTGCA GCATGGTGAC
AGCGCCATCT ACCCGACCGA TAACAGCGTC TGGATCGGCG ACTTCCCGCG CAATGGCACG
GTCACCTGCC AGTACAAGGT CTCGGTCTCC GACAATGCCC AGCAGCAGAC CTATCCGGTC
GATGTGGAAG TGACCTATAC CAACTCTGAC GGTAACGTAG TCACATCCGC GATCGATACC
GTGGGTATCC CGGTGGCCGG CAAGATCACC TTTGCCGTGG TCTCTCCCCC GGCAGAGGTA
GTCCAGGGTG CAAATACCGT GATCACTATC ACCTACCAGA ACACCGGTGC GATCACCGCG
CGGAGCGCCG AGGCACGGCT CACCACCTAT GCCCCGCTCT CAAGCGACGA TTCCCTGGCC
TATCTCGGCG ATATCCCCCC GGGTGGAACC GTGACTGCAC GCTATGCCAT ATCCGCCGAT
ACTAATGCCG CAACCGGGAC CTATCCCCTC GATACCGAGG TCAGGTACCG CGATCAGCTC
GATAACAGCC AGGTCTCCGA TACCTTTACC GCAAACGTTA CCGTCACCCA AAAACCTCCC
ACATCCCCCC TGGTACAGGC TGCTGAAATT GTCGCTGCCA TCGCCATTAT CGCGGCGGCC
GGATATTACT TCTTTGTGAT GAGGAGGAAA CAATGA
 
Protein sequence
MRFFAPGAII LVLLAVLCVM PVLAQDKYLG GSPQITAYIA GTNEFSPGED ATITVVIQNS 
GTADVLFLNQ GTLHQADIPT TAKLLTVSLT SGGAPINITT GPQALGDLAS PGITSVPFTA
KITTDANMGS YTLPLEVQYS YLSNSLANQP ASDEVNPEYS PVNVTIPLTV RIQPVVQVTV
LDAEASGLAV GTEGYVNLTL KNTGYQDGTK ATVQILQHGD SAIYPTDNSV WIGDFPRNGT
VTCQYKVSVS DNAQQQTYPV DVEVTYTNSD GNVVTSAIDT VGIPVAGKIT FAVVSPPAEV
VQGANTVITI TYQNTGAITA RSAEARLTTY APLSSDDSLA YLGDIPPGGT VTARYAISAD
TNAATGTYPL DTEVRYRDQL DNSQVSDTFT ANVTVTQKPP TSPLVQAAEI VAAIAIIAAA
GYYFFVMRRK Q