Gene Mboo_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1601 
Symbol 
ID5412196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1672601 
End bp1674076 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content55% 
IMG OID640868835 
ProductNHL repeat-containing protein 
Protein accessionYP_001404761 
Protein GI154151143 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0447928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.136115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGT ACCGTCTGGT ATTGGCAGGA TTTCTCCTCA TTGTATTGCT GATAATAATT 
GTGCCGGTAT CGGCTGCCTC TCCTATATTC ACCCATGTTA CCCAGTGGGG ATCATCAGGA
TCAGGAAACG GGCAGTTTAG CCAGCCGGAG GAGGTCGCGA TCAACACCAC CGGGTACATC
TACGTGACGG ATAACCAGAA CAACCGGATC CAGGTATTTG ATCCGAGCGG AAACTATGTT
TCCCAGTGGG GATCAGCGGG ATCAGGAAAC GGGAAGTTTG AAGGTCCCTC CGGGATTGCC
GTCAATACGA CCGGGTATGT CTACGTGACA GACTATGGCA ACGGCCGGAT TCAGGCATTT
GATCCGAGCG GAGCCTATGT CACCCAGTGG GGAGGTTTCT ACCACCTCAT TGGTGTCGCC
GTCAACACGA CCGGGTATGT CTATGTGGCT GACTCGGGCA ACAACCAGAT CAAGGTATTT
GATCCGAGCG GGACCTCTGT TACCCTGTGG GGATCAGCAG GCTCGGGAAA CGGGCAGTTT
AACCTGCCCT GGGTTATCAC CGTCAATACT ACCGGGTATG CCTATGTGTC AGACTGGAAC
AACAACCGGA TCCAGGTCTT TGGTCCGAGC GGAAACTATG TTTCCCAGTG GGGATCAGCA
GGCTCAGGAA ACGGCCAGTT TGACCACCCC TATGGTGTCG CCATCGACTC GACCGGGTAT
GTCTATGTGG CCGACTCGGT CAACAACCGG ATCCAGGTTT TTGATCTGAG CGGAAACTAT
GTGACCCAGT GGGGATCGGG GTTTAACGAT CCCTCCGGGA TTGCCGTCAA CTCAACCGGG
TATATCTATG TGGCGGACGC GGGCAACAAC CGGATCCAGG AGTTTTTGCA GATCACTCCT
CCTGTCGCCT CATTCACGGC CACTCCCCGT ACCGGTACCA CCACCCCCCT TACCGTTCAG
TTCAATGACA CCTCGGCATA TTCACCGGAT CAGTGGAACT GGTCGTTTGG CGACGGCCAG
TGGTATAACA CCACGGATGA TTCTCTCCGT AATATCACGC ATGAGTATAC CCAGACCGGA
AGTTACACCG TCAGTTTAAG CGTCCAGTAT GCGGCGGGAT CCGATACCAG CTCTCAAGCC
GGGTACATCA CGATTACCTT ACCTACTACC GCACCCACTA CCGCACCGAA TTCCGGGGGT
CATGCTGCAC GAACCGACTA TTGGGTCAAC TCCGGCAGTG CAAACGACCA GGGTTACACC
GGCCCGGCCC CGACGCCGAT GGGGGTATCT CCTGCTGCGC CATCGGTCAC GAATTCAGGT
TCCCCTGGTG TACCAGCGCC AGTTCAACCT CTGGCTGCAT CAACCGTAGT CCCGGAACCC
CTGCCAACGA ACACTCCGGC CACGCCACCT TCCCCGCTCT CCCCTATAAC GACATTCATC
AGTGAATTGC TGCAGGATAT ATTCGGCACA AAGTGA
 
Protein sequence
MKSYRLVLAG FLLIVLLIII VPVSAASPIF THVTQWGSSG SGNGQFSQPE EVAINTTGYI 
YVTDNQNNRI QVFDPSGNYV SQWGSAGSGN GKFEGPSGIA VNTTGYVYVT DYGNGRIQAF
DPSGAYVTQW GGFYHLIGVA VNTTGYVYVA DSGNNQIKVF DPSGTSVTLW GSAGSGNGQF
NLPWVITVNT TGYAYVSDWN NNRIQVFGPS GNYVSQWGSA GSGNGQFDHP YGVAIDSTGY
VYVADSVNNR IQVFDLSGNY VTQWGSGFND PSGIAVNSTG YIYVADAGNN RIQEFLQITP
PVASFTATPR TGTTTPLTVQ FNDTSAYSPD QWNWSFGDGQ WYNTTDDSLR NITHEYTQTG
SYTVSLSVQY AAGSDTSSQA GYITITLPTT APTTAPNSGG HAARTDYWVN SGSANDQGYT
GPAPTPMGVS PAAPSVTNSG SPGVPAPVQP LAASTVVPEP LPTNTPATPP SPLSPITTFI
SELLQDIFGT K