Gene Mboo_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0737 
Symbol 
ID5410995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp702193 
End bp703458 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content56% 
IMG OID640867958 
Producthypothetical protein 
Protein accessionYP_001403898 
Protein GI154150280 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.107641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.209363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTG TAAAGATTTC TGCAATCGCA CTCCTGGCGC TTGCCCTTGT CCTTGTGCTG 
TTCACCGCCG GCTGCACGCA GTCTGCGGGT ACCACCTCAG GGAGCACCAC TGCCGGTTCC
TCAACCGGTA CCACGACCAC ATCGTCCAAT GGATCTGTCG GGATTCTCGA AACCGCGGGA
GTCGGCCCGA TGCCCTCGCT TCTTGCCACC GGTCAGGTTG ACGGGTACAT TGCCTGGCAG
CCGGTTGTCG AAGCCGGTGT GGAAAGCGAT ATAGGCCATG TTGCAGTCTA TTCCAAGGAC
CTCCCGCCTG CAGGCGAGTG GAACAACCAC CCGCAGAACG TCTTTGTGGT AAGAAAGGAC
TTCTACGCCC AGAACCCGGA CTTTGTCAAT GACTTCTCTG CGCTCAACCT TGCAGCAACG
CAGTACATCA ATGATCACCC GAATGAAACG GCCGCACTCG ATGCTGACTG GCTGGCCGGC
AAGCAGAACT TCACCTACGG GAATGTAAGT GTCTCATCGG TAACTGCAGT CAGCAACTCC
CTGCCCACCA TCGCATTCAC GAACAATCCG TCAGATGCCT GGAAGGCCAG TACCGGAAAT
TTCGTGCAGG CACTTATCCA GCTCGGAACA GTCAAAGGCT TTGTTGCGAA CAGTACGAAC
CAGGATGCAG TTCTCTACAA CTTCCAGCCG TACACCAGCG CAACATCTAT CCTTGCTTCA
AAGAACGTGC CCACACCTGC ACCCATCCAG AACACGGTAG GGGTCGGGTA CCTCAATGCC
GTTGATCACT CGGCACTCTT TGTTGCGGTC AAGAACTGGC AGTACTTCAA TGACACCTAT
GGTATTGCCC TTAAACCCGA AGATCTCACC AAGGCAAAGC CGGATGTAGC TGACTTCATG
GTAAACGGCC AGAAAATTGC CACCGTCAAC CTGGTACCGG CAAATGCCGG GCCAAACCTG
ATGCAGCTCG CGGCAACCAA CTCAATTCAG ATGTCTTATG TCGGAGTTCC GCCGGCAATC
AACGCGATCG ACCAGGGTAC CCCTATCACC ATCGCCTACC CGATCGATAA CCTCGGTACC
GGCCTTGTGG TGGAAAATGG CGCACCCGCA CAGGACTGGC AGAGCTTTGC CGCATGGGCA
AAGGCGCGCT CGGATGCAGG AAAGCCGCTC GTGATTGCAG CGCCCGGAAA AGGCTCCATC
CAGGATGTTA TGATCCGCGC TGCCCTGCAG AGCAGCGGTA TTACCGTAAC TGAAGTGAAG
GTTTGA
 
Protein sequence
MKAVKISAIA LLALALVLVL FTAGCTQSAG TTSGSTTAGS STGTTTTSSN GSVGILETAG 
VGPMPSLLAT GQVDGYIAWQ PVVEAGVESD IGHVAVYSKD LPPAGEWNNH PQNVFVVRKD
FYAQNPDFVN DFSALNLAAT QYINDHPNET AALDADWLAG KQNFTYGNVS VSSVTAVSNS
LPTIAFTNNP SDAWKASTGN FVQALIQLGT VKGFVANSTN QDAVLYNFQP YTSATSILAS
KNVPTPAPIQ NTVGVGYLNA VDHSALFVAV KNWQYFNDTY GIALKPEDLT KAKPDVADFM
VNGQKIATVN LVPANAGPNL MQLAATNSIQ MSYVGVPPAI NAIDQGTPIT IAYPIDNLGT
GLVVENGAPA QDWQSFAAWA KARSDAGKPL VIAAPGKGSI QDVMIRAALQ SSGITVTEVK
V