Gene Mpal_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1474 
Symbol 
ID7270079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1523625 
End bp1525631 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content55% 
IMG OID643570097 
ProductNHL repeat containing protein 
Protein accessionYP_002466519 
Protein GI219852087 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0914572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTA AGTATTTTTT TTATTTTTTC TGCGCCCTGC TCCTCCTCTG TTGCAGCGCC 
CAGGCTGTTT CGGTTGAAGG TGGGTACGCA TATGTTACGC AATGGGGCAG TTCTGGTCAA
GAAGCCGGGC AGTTCAACCA GCCCTATGGT GTCACAATTG ACAGCATTGG CGATGTCTAC
GTCGTCGACA CATACAACAA CTGGATCCAG AAGTTCGATT CGAACGGCAC ATTCCTCAAA
AAATGGGGCA GTTTTGGCAC CGGAGACGGG CAGTTCAACA TACCCTATGA TATCGCCGTG
GACAGCGTCG GCTACGTCTA CGTCGCCGAC ATGAATAACA ACCGGATCCA GAAATTCAAT
TCGACTGGTG GTTACCTGAC CCAATGGGGC ACGAAAGGCT CGGAGGAAGG ACAACTCGAC
CAGCCAGGTA GTGTCGCGGT GGACAGCAGA GGACAGATCT ATGTCGCTGA CTGGGGCAAC
AACCGGGTTC AGGTATTCAA TTCGACCGGT GGCTACCTCA TGCAGTGGGG GAGTTCCGGC
TCGGGAGACG GACAGTTCGA CGGTCCGAAT GGAATTGCCA TAGACAGCAC CGGCAATGTC
TATGTCACTG ACGCATACAA CAACCGGATT CAGGAGTTCA ATTCGACCGG TGGCTACCTC
ATGCAATGGG GAAGTTCTGG CTCGGAGGCC GGGCAGTTCG AGATTCCCCA GGGTATCGCG
ATGGACAGTA ACGACAACGT CTACGTGGCC GACTCTGGCA ACCGGGTCCA GAAGTTCACG
TCGGCCGGCA CCTTCATCAC GCAATGGGGT ACGAAAGGCT CGGAAGCCGG GCAGTTCAGC
AATCCCTTTG GTATCGCCGT GGACAGCGCC GACAATGTCT ATATCACTGA CGTGTACAAC
AACCGGGTCC AGAAGTTCAC GTCGGCCGGC ACCTTCATCA CGCAATGGGG CAGTCAGGGT
TTGGAAGTCG GACAGTTCAA CATGCCCTAT GGTGATGCCG TGGACAGTGC AGGCAATGTC
TACGTCACCG ACCTGGGGAA CAGCAGGGTC CAGAAGTTTA CCGCGAACGG CACCTTCATC
ACAGAATGGG GCAGTTCGGG ATCGGGAGAC GGACAGTTCA ACATGCCCTA TGGTATCGCC
GTGGACAGCG CCGACAACGT CTACGTCGCT GATTTGAATA ACAACCGGGT CCAGAAGTTC
AATTCGACTG GTAGCTACCT GACACAATGG GGCATGACAG GCTCAGGGAA CGGACAGTTC
GACCAGCCAT GCGGTGTCGC GGTGGATCGC TTCGGCATCG TCTATGTCAC TGACTTTGGC
AACAACCGGG TCCAGATGTT CACGTCGGCC GGTGGCTACC TCTCCCAATG GGGCAGCCAT
GGTCCGGGAG CCGGGCAGTT CAGCGGTCCG AATGGAATTG CACTGGACAG CACCGGCAAT
GTTTATATCA CAGACTGGGG CAACAACCGG GTCCAGAAGT TCACGTCGAC TGGTAGTTAC
CTCAGGCAAT GGGGCAGTTC CGGCTCGGAA GACGGGATGT TCGGCGACTC AACGAGTGTC
GCCGTGGACC GTGACAGCAA CGTCTACGTG TCCGACAGTA GCAACCACCG GATCCAGAAG
TTCGATCAAA ACGGCACATT CATCACGAAA TGGGGGAGTT ATGGCTTGGA AGCCGGGCAG
TTCAACAGTC CTTTTGGTAT CACGGTGGAT GGTGCCGGCA ACGTCTATGT CACCGACGTG
AACAGCAATA GGGTCCTGAA GTTCGCCCCC ACTGGTACGA CCCCGGTGGT GATCGTCCCG
GGCGGGTCAG CCGTCCCGCA GGATCTCAAC CATGACGGAC TGTACGAGGA CGTCGATGGC
AACGGAGTCC TCGACTTTGG TGACGTGGTC CTCTTCTTCA ACCAGATGGA CTGGATCGCC
GAGAATGAAC CGATCAGTGC GTTTGATTTC AACAAAAACG GTCAGATCGA TTTCAATGAT
ATTATCACCC TGTTCAACGA GTTGTAG
 
Protein sequence
MKFKYFFYFF CALLLLCCSA QAVSVEGGYA YVTQWGSSGQ EAGQFNQPYG VTIDSIGDVY 
VVDTYNNWIQ KFDSNGTFLK KWGSFGTGDG QFNIPYDIAV DSVGYVYVAD MNNNRIQKFN
STGGYLTQWG TKGSEEGQLD QPGSVAVDSR GQIYVADWGN NRVQVFNSTG GYLMQWGSSG
SGDGQFDGPN GIAIDSTGNV YVTDAYNNRI QEFNSTGGYL MQWGSSGSEA GQFEIPQGIA
MDSNDNVYVA DSGNRVQKFT SAGTFITQWG TKGSEAGQFS NPFGIAVDSA DNVYITDVYN
NRVQKFTSAG TFITQWGSQG LEVGQFNMPY GDAVDSAGNV YVTDLGNSRV QKFTANGTFI
TEWGSSGSGD GQFNMPYGIA VDSADNVYVA DLNNNRVQKF NSTGSYLTQW GMTGSGNGQF
DQPCGVAVDR FGIVYVTDFG NNRVQMFTSA GGYLSQWGSH GPGAGQFSGP NGIALDSTGN
VYITDWGNNR VQKFTSTGSY LRQWGSSGSE DGMFGDSTSV AVDRDSNVYV SDSSNHRIQK
FDQNGTFITK WGSYGLEAGQ FNSPFGITVD GAGNVYVTDV NSNRVLKFAP TGTTPVVIVP
GGSAVPQDLN HDGLYEDVDG NGVLDFGDVV LFFNQMDWIA ENEPISAFDF NKNGQIDFND
IITLFNEL