Gene Mboo_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1737 
Symbol 
ID5411969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1817136 
End bp1818545 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content55% 
IMG OID640868972 
Producthypothetical protein 
Protein accessionYP_001404897 
Protein GI154151279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTG GTGTCTGCTA CCCGGACAAG TATGCCGTTG AAGAAACCTT CCAGCTTCTC 
AAAGTACCCT GGGAATGGTA CGCGCCCGGC CGGCAGTACG ATATTGTCAT TTCCCGCAAA
GCTGATGTGC CGGAATGGAC CGGCAGTCTT GTTGATCTTA CCGGCAACGA TGTCTTCAGC
AAAGTGGCCG GGCTGCTCAA CACCGGCGCG GAACACCTGC ACGAACCAAC GTGCGATATC
CTGCTCGATG CTCTCCGGCA GGAACTCAAA AAACATACTC TGCTTGTGGA GATCCCGCCC
ACCCCGTGGG GACATCCCTA CATGGTCGCG CTGACCCACG ATGTGGATCT CACCTCAGTC
CGGGAGTGCC GGTGGACTAC CGTGGGATAT GCAGCATACC AGTGCTTTAC GCAGGGGAGC
TGGTCAGCGG GATTACATCT CCTCCTTGCG CGCTGCGGAT TTTGTTCCGA TCCATGGGTG
CTCTTTTCCC GCTGGAAAGA GTTTGAGGAC TCTCTTGGGG TCCACTCTAC CTTCTTTTTT
GTCCCAAAAC CGGGGAAGCC AGGCATCAGG GCACACCCGT ACCGGGCCAT CAGTTATAAT
GTAACCCAGA AATCAGAAGT GCTGCGTGAC CTGGAAAACG GCGGATGGGA GACCGGCGTC
CACGGCCTTG ACAACTGGGC TGATGCTGGT TCCGGCGAAC AGGAGAGAGT ATTTCTAGAG
CCGGTGACGG CACATCCTGG CAACCGGACG CACTGGCTCC TCTTTGATAC AAACAGCTGG
AAATTTCTCG ATGCTGCGGG ATATGTATAC GATACGACCT TCGGGTACAA CGACGATGCC
GGGTTCCGTG CCGGCACCCT GCAGGCATAT CGGCCGCGCG ATGTAAAAAA TCTGCTGGAA
CTCCCGCTCC ATATCCAGGA TCTCGGGCTC TTTGGGAAAT TCTGCTGGGC ACCTACAGAG
AACGGTTGGG AAAAAACACC GTGCCTGCAT CTCGATGAAC CAACGGCACG TGCGTGGTGC
GACCGCATCT TTGCGTACGC ACGTACTTAT GGAGGAGCAG TAACGGTACT CTGGCACTAC
GAGAATCTTA TGCCGCCCAA GGGCTGGTCC GGGTTTTACG CGGACATGGT CAGGCAGGCG
AAGGTGGACG GGGCGTGGGT GACGACAGCG GGGGAAGTGG CGAAGTGGTT CAGGAAACGG
AGGGAGATCT ACTTTACGGC GTTACAGTTC CAGAATGAAA TTATTATTAC TGCTGATGGA
ATTCTCCCGG AACAAGATAC ACACGCCCTA CAAGCGCGAG TACATATCCC GCCGGATCAA
ATTGTATCGA TAGACGCGGA GTATTCCCCA GCTTGTGGAT ATATTGACAT AAAGATGGAC
AAACGGTGCA TAACGGTGCG CATAGCATGA
 
Protein sequence
MSIGVCYPDK YAVEETFQLL KVPWEWYAPG RQYDIVISRK ADVPEWTGSL VDLTGNDVFS 
KVAGLLNTGA EHLHEPTCDI LLDALRQELK KHTLLVEIPP TPWGHPYMVA LTHDVDLTSV
RECRWTTVGY AAYQCFTQGS WSAGLHLLLA RCGFCSDPWV LFSRWKEFED SLGVHSTFFF
VPKPGKPGIR AHPYRAISYN VTQKSEVLRD LENGGWETGV HGLDNWADAG SGEQERVFLE
PVTAHPGNRT HWLLFDTNSW KFLDAAGYVY DTTFGYNDDA GFRAGTLQAY RPRDVKNLLE
LPLHIQDLGL FGKFCWAPTE NGWEKTPCLH LDEPTARAWC DRIFAYARTY GGAVTVLWHY
ENLMPPKGWS GFYADMVRQA KVDGAWVTTA GEVAKWFRKR REIYFTALQF QNEIIITADG
ILPEQDTHAL QARVHIPPDQ IVSIDAEYSP ACGYIDIKMD KRCITVRIA