Gene Mpal_1134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1134 
Symbol 
ID7270398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1163621 
End bp1164718 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content46% 
IMG OID643569767 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002466200 
Protein GI219851768 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.334038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AACTAAAAAT TATAATCACC GCAATTGTTG CGATTGCTGT TATAGGTATC 
GTCATCGGAC TAATCCTGAC CGGGACAGTG GGTTCCCCCC ATACCAAAAC GACCCAGATA
CAAGGAAACA GTGGTACGCA GACGACAAAC TCGTCCAATG TCAAGACCGT CGGAAATACT
TCAACAAATC AGAGTGGGGA CTTTTTTACC CTACGAACAC CGCTCTCGTC TTCACTTGCC
GTGATCGATC TGGCTGACCA GTTGGGATAT TACCGGGATA ACGGTATCAT CATCGAACGG
ACAGGCACTT CGACAGGAGG ACCCCAGAAT ATCATGACTG TTGCATCAGG AAGCAATGAT
GTCGGGGGGT CAGCGTTCTC AGCGATTGTA AATGCTATTG CAAAGGGAAC CAAAATCAAG
GTCGTTGTAC CCTCTATAGG AACCAGTTTA ACTGAACCGG ATTACAAATG GCTCGTTCTG
AATACGAGTT CCATTAAAAC AGCAAGTGAT CTCAAGGGAA AGACCATCGG TGTTAACACC
CTGGGAGCTC AGGCAGATTT CGTTACCCGG GCATATCTCT ATCAGCATAA TCTGACCCCA
TCTGATGTCC AGTTGGTAGT TCTCCCTATC GAAAATGAAG AACAGGTTTT ACGACAGGGT
CAAGTTGATG TTATTGCACC AAATGGAAAT TACCTGAAGA AAGCGGAATC AGATGGGGGG
GTTCGTGCCC TCTTTACAGA TGCAGAAGTA ACCGGCGATC AGGTAAAATC TGCAACATTC
ATGTCCACAG ATTTCATCGA AGAGCACCCG GATATTGTAC GAAAGTTTGT CAATGCTACA
ACACGGGCAA TTGAATGGGA CAAACAAAAC CGGGATCAGT CAAAGGTTCT TCTCGCAGAA
TACCTTGAAA AGAACAACGG CAATACGAAA CTGGCGGCAC TCCATAATGG CTGGGCAATT
CGAAGTCCCC CCACTATTAA TGACCAGGAT GTCCAGTTCT GGGTAGATGT CATGGTTAAA
GAAGGGCTTC TCAAGGAGGG ACAGATCAAA CCATCTGATG TCTATACAAA TGAATTCAAT
CCATATTACC AGAAGTAG
 
Protein sequence
MKKKLKIIIT AIVAIAVIGI VIGLILTGTV GSPHTKTTQI QGNSGTQTTN SSNVKTVGNT 
STNQSGDFFT LRTPLSSSLA VIDLADQLGY YRDNGIIIER TGTSTGGPQN IMTVASGSND
VGGSAFSAIV NAIAKGTKIK VVVPSIGTSL TEPDYKWLVL NTSSIKTASD LKGKTIGVNT
LGAQADFVTR AYLYQHNLTP SDVQLVVLPI ENEEQVLRQG QVDVIAPNGN YLKKAESDGG
VRALFTDAEV TGDQVKSATF MSTDFIEEHP DIVRKFVNAT TRAIEWDKQN RDQSKVLLAE
YLEKNNGNTK LAALHNGWAI RSPPTINDQD VQFWVDVMVK EGLLKEGQIK PSDVYTNEFN
PYYQK