Gene Mboo_0313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0313 
Symbol 
ID5410848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp301688 
End bp302893 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content57% 
IMG OID640867529 
ProductPKD domain-containing protein 
Protein accessionYP_001403478 
Protein GI154149860 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCCCG CAGCCTGCTG GGGTCATTGG TCCTATGTTA TGCACCCGGA TACGATACTA 
ACCAGAAGAT GCGGAAAAAT AATCCGTGGC ATTTTTATTG TCGTTCTCAT CACTTCTCTT
CTTTTCTTCT CCCCGGCGGT TGCCGGAGAT CCAAAACTCA CGATCGTCCT GACCGCACGA
ATTGTTCCTG TCTCATCAGC CTGGTTTGCC ACAAACGTCA CGGAGGGACC TGCACTGCTT
CCGGTAGCAT TTACCGACCA GTCCACAGAC ATGCCGGTAT CCTGGAAATG GGACTTCGGG
GACGGGGCTG TGGATACCGT CCGGAACCCG GTGCACATCT TCGAGACCCC GGGCATATAC
ACGGTTCTGC TGACCGTTGG TACAATGTCC GGCAATTCGT CACGAGCAGC ACAAATTATC
GATGTGGGTG CGCCGCCGGA TGCGCTGGGA TCCAATACCC TCAGGACATC AGCGATCGGC
CCGGGAACAA CAACGACTCT TGACTTTTCA GCGCAGGCCG GGACGAGTAT CGATATAACG
ACAAACAACA GCGTACAGGC CGGTACACCG GTCACCGTGA CCGAATATTC CAGCCCGCCG
TTTCCAGAAA TGTCACTTCC GGCATTTACT GCTGCAGGCA GGTACATTGC AATCTCCGCA
CAGGGGCTGG AAGCAAATGT CAGCTCGGTA ACCATAACGA TGCAGGAACC GGCCCCCCTC
CCGGCAGGTG TCACCGAGGC AAACCTTGTA ATCGAATTCT TCGACCCTGC AACCGACACC
TGGACCGTTC TTCCCTGTAC CCTCGATACC ACCAGCCACA CGATATCGAC CACATCCACG
CATCTGAGCG CCTACGGGCT GTTTGCATCA CCAGTGACTG CAGCACCCGG GAGTGGCAGC
TCTTCTGGCG CAAGTGAAGG GGGCTTTCTC TCCGGAGGGA GTTCCTCCGG CGGCAGTGGC
AGTGTAGGTC TCCTGCAATT TTTCCAGCTG GCGCCTGTCC GGGCTCCTCT GACACCGGCA
GGAATTCATG TGAACACTCC GGCAGTAGTA CCTGTATCTG CCCAGAAGGC ACCGGTCCAG
ACTCCTGCAG CGGGAACATC GGCACAGACT TCACCCGGGC TTCCCCTCAT GGGAATAGCA
GTTGTCGTGG CAGTACTTGC CATAACCGGT GTTGCACTGC TGGCATTTGC CCGGAGGAAC
ACATAA
 
Protein sequence
MDPAACWGHW SYVMHPDTIL TRRCGKIIRG IFIVVLITSL LFFSPAVAGD PKLTIVLTAR 
IVPVSSAWFA TNVTEGPALL PVAFTDQSTD MPVSWKWDFG DGAVDTVRNP VHIFETPGIY
TVLLTVGTMS GNSSRAAQII DVGAPPDALG SNTLRTSAIG PGTTTTLDFS AQAGTSIDIT
TNNSVQAGTP VTVTEYSSPP FPEMSLPAFT AAGRYIAISA QGLEANVSSV TITMQEPAPL
PAGVTEANLV IEFFDPATDT WTVLPCTLDT TSHTISTTST HLSAYGLFAS PVTAAPGSGS
SSGASEGGFL SGGSSSGGSG SVGLLQFFQL APVRAPLTPA GIHVNTPAVV PVSAQKAPVQ
TPAAGTSAQT SPGLPLMGIA VVVAVLAITG VALLAFARRN T