Gene Mpal_1345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1345 
Symbol 
ID7269950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1390762 
End bp1392792 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content54% 
IMG OID643569979 
ProductNHL repeat containing protein 
Protein accessionYP_002466401 
Protein GI219851969 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.871064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCTT CGTATTTTTT GTATATATTC TGTATTCTAC TCCTGTGTTG CAGCGTCCAG 
GTCGTCTGGG CCGAAGGGGG GTATGTGTAC ACCACACAAT GGGGCAGTTC TGGTTCAGGA
GACGGGCAGT TCAACCAGCC ATCTGGAGTC GCAGTGGACA GCGACGGAAA CATATACGTT
GTCGATACGA ATAACTTCCG GATCCAGAAA TTCAATGCAA CCGGCGGATT CACCACACAA
TGGGGCGGTT CTGGCCCGGG AGACGGACAA TTCAATAATC CTGAAGGTGT TGCAGTGGAC
AACAACGGCA ACGTCTATAT CGCCGACAGG GATAACAACC GGATCCAGAA GTTCAATTCG
TCCGGGGGAT TCCTCATGAA ATGGGGCAGT ATTGGTTCGG GAGACGGGCA GTTCAACCAG
CCATCTGGTG TCGCGCTGGA TAGTGCTGGC AATGTCTACG TGACCGACAA GCAGAACAAC
CGGATCCAGA AGTTCAATTC GTCCGGGGGA TTCCTCATGA AATGGGGCAG TGAAGGCTCA
GGAGACGGGC AGGTCCACTG GCCATCTGGT GTCGCGGTGG ACAACACAGG AAGCGTCTAC
GTAGTCGACT CGTATAACCA CCGGATCCAG AAGTTCAATG CAACCGGTGG ATTCATCACA
AAATGGGGCA GTGAAGGCAC AGGAGACGGG CAGTTCAAGA GTCCAACAGG TGTCGCAGTG
GACAGCGTCA ATAACGTCTA CGTGGTCGAC ACGGGGAACG ATCGAATTCA GAAATTCAAT
TCGTCCGGTG GATTCATCAC CACAGGGGGC AGTTTTGGCC ATGGGGACGG GCAGTTCTGG
TCTCCAGAGG GGATCACGGC GGACAGTGCC AATAACGTCT ACGTGGTCGA CACCTTGAAC
GATCGGATCC AGAAGTTCAA TGCAACCGGC GGATTCATCA CAAAATGGGG GAGCGCTCTT
GGCTCGTTTG ACGGGCAGTT CAGCGGATTA TCTGATGTCG CAGTGGACAG CACCGGCAAC
GTCTACGTGG CTGAATCAGG AAACTGCCGG ATCCAGAAGT TCAATGCAAC CGGCGGATTC
ATCACAAAAT GGGGCAGTGA AGGCTCAGGA GACGGACAGT TCAACGGGCC AACTGGTATC
GCGGTGGATA GCGCCGACAA TGTCTACGTG GTCGAAATAT GGAACTGCCG AATCCAGAAG
TTCAATTCGA CTGGCGGATT CCTCATGAAA TGGGGCAGTT ATGGTTCAGG AGACGGGCAA
TTCAACAAGC CATCTGGTAT CGCAGTGGAC AGTGCCGGCA ATATCTACGT CACCGATGCG
AATAAATGCC AGGTCCAGAA GTTCGATCAA AACGGCACAT TCGTCACGCA ATGGGGCAGT
TTCGGCACGG GGGACGAGCA GTTCTTCTAT CCCAATGACA TCACAGTGGA CAACGCAGGG
AATGTCTACG TGGTCGACTC AGATAACCCC CGAATCCAGA AGTTCAATTC GACTGGCGGA
TTCATCACAA AATGGGACAG TTTTAGCACG GGGAACGGGC AGTTCAACCG GCCATGTGGT
GTCGCAGTGG ACAGCGCGGG AAACGTTTAT GTGACCTGCT TCCAGATTTT GTACACCTGT
GGTCCTCCCG AGTTGATTCA ATCCCGGGTC CAGAAGTTCA CGCCGAATGG CACGTTCATC
ACGGAATGGG GCAGTCTGGG GTCAGGGGAT GGGCAGTTCA ACCGGCCATC TGGTGTCGCG
GTGGACAGCG CAGGAAACGT CTACGTCGCT GACTCGGGGA ACAACCGGAT CCAGAAGTTT
GCCCTGGTCA TCACGCCGGT ACCCCCATCC GGAAACACTC CCCTGGACCT CAACCATGAC
GGACTCTATG AGGACATCAA TGGCGACGGG ATCGCAGACT TCAATGACGT GGTCCTCTTC
TTCAACCAGA TGGACTGGAT CGCCGAGAAC GAACCGATCA GTGCCTTCGA CTTCAACAGG
AACGGCCAAA TCGATTTCAA TGATATCGTC ATGCTCTTCA ACGAACAGTA G
 
Protein sequence
MRSSYFLYIF CILLLCCSVQ VVWAEGGYVY TTQWGSSGSG DGQFNQPSGV AVDSDGNIYV 
VDTNNFRIQK FNATGGFTTQ WGGSGPGDGQ FNNPEGVAVD NNGNVYIADR DNNRIQKFNS
SGGFLMKWGS IGSGDGQFNQ PSGVALDSAG NVYVTDKQNN RIQKFNSSGG FLMKWGSEGS
GDGQVHWPSG VAVDNTGSVY VVDSYNHRIQ KFNATGGFIT KWGSEGTGDG QFKSPTGVAV
DSVNNVYVVD TGNDRIQKFN SSGGFITTGG SFGHGDGQFW SPEGITADSA NNVYVVDTLN
DRIQKFNATG GFITKWGSAL GSFDGQFSGL SDVAVDSTGN VYVAESGNCR IQKFNATGGF
ITKWGSEGSG DGQFNGPTGI AVDSADNVYV VEIWNCRIQK FNSTGGFLMK WGSYGSGDGQ
FNKPSGIAVD SAGNIYVTDA NKCQVQKFDQ NGTFVTQWGS FGTGDEQFFY PNDITVDNAG
NVYVVDSDNP RIQKFNSTGG FITKWDSFST GNGQFNRPCG VAVDSAGNVY VTCFQILYTC
GPPELIQSRV QKFTPNGTFI TEWGSLGSGD GQFNRPSGVA VDSAGNVYVA DSGNNRIQKF
ALVITPVPPS GNTPLDLNHD GLYEDINGDG IADFNDVVLF FNQMDWIAEN EPISAFDFNR
NGQIDFNDIV MLFNEQ