Gene Mpal_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1643 
Symbol 
ID7272185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1691039 
End bp1692400 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content54% 
IMG OID643570256 
Producthypothetical protein 
Protein accessionYP_002466678 
Protein GI219852246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.906143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCAT ATCTGCCGTC TAACAGAATT CAGTGGGGGA TCTGTCTCGC CGTCCTCACC 
CTCCTCCTCT CATCCGCACT GATCGGCTCG GTAAGCGCCA CCCCGCCTCC TCCAGATTAT
AAGGCCTGGT GGGCCACGAC AAACACCGAA TACCTCAATG CATCGGATGC CGGCCTCTTT
GTCGAGACCG ACGGATACAT CTCCCGGCAG GGACGTGAGA AGAGACCACC ATTCCTGTCG
AACGTCTCGG ATAATTATTA TTACACCAAC TTCACCTCTC CGAAGACCGG CGAACAGTAT
CTCACCGCCG TCTGGTACTT CAGTGACTGG GACGGATTCA TAAAAGAGAA GGACGAGCTG
CACCGATACC TTCAACAACA CGGTACAGTC ACTCCGGTCG CACTCAATCT CTCTCCAGAA
CTCGCCAGTT CCAATAGTTC TGACCTGGTG AACCTCTCCG GATCTGAGCA ATGGCAGGCG
ATCGATGCGA CCCAGTACGA GAGTGACGAG ACGTCGGGGT ACCTCCTCAC CTTCGTCATG
GATTCGCACC CGGGGGTGAA CTACTACATC GCGTACTACG GGGTCGTGGG TCCAACTGAT
CTGAGAGAGG AAGCTCATCA TCTCCACCTC CTTGCGATGA CGAACCTTCC TGTAATGGTA
CTGGGTCACT TATACGTCTT CAATCTGACG ACGCCGAAGA CCAATTCACA TGATTCGTCC
AGCCACACAT CCAGTCTCGC GTCCAACCCC ATATTCAACC CCAAAACCTG GATCCCATTT
CTCGTGGTAT TCATTCCGAT TCTCCTTCCG ATCATCGGCT TCACCTTCAT CCCGATCATC
GTCCTCGCGT ATATCTCCGC CAGGATATCT CTATGGATGG AGGCCCGCTT GACTCCACGG
ACCCGTGCCA TCCTCCCACT GAGCGTCGCC GGTTGTCTGA TCGGGGTGAT TGCCCTCAGG
TCTCTTTTCA TCGAAGAGAT CTCGCTTGGT TGGACCGATC TCATCGCCGC TGCAATCCTC
GTCCCAATGG GGGTGCTGAC CGTCAGACCA TTCTTTAAAG AGCGCCTGAA GTACGTGAAG
CCGAAGTCTG CCGTGTTCCT CTGCGTGGTC GGGACATTCT ACACCATTAT CATTGGTTCC
CTGTTATACC TTCTTCTGGG TGTAAGTTTT ACCTCGAACC CAGGTTCTCT CGATCAACCG
CTCTCGTACA TCGTGGGTCG CTCTGTCGTC CTGCGATCAG TCATCCTTTA TATCATGAGC
GTGGTCATCG CCATCGTTCT TTATGCAGTG ATCCTGTTCT GGGACCTGAT CCGGAGACGC
CGACAGAGCA AGAAGCATAA AGTGGAGGAT GAAGATCAGT GA
 
Protein sequence
MSSYLPSNRI QWGICLAVLT LLLSSALIGS VSATPPPPDY KAWWATTNTE YLNASDAGLF 
VETDGYISRQ GREKRPPFLS NVSDNYYYTN FTSPKTGEQY LTAVWYFSDW DGFIKEKDEL
HRYLQQHGTV TPVALNLSPE LASSNSSDLV NLSGSEQWQA IDATQYESDE TSGYLLTFVM
DSHPGVNYYI AYYGVVGPTD LREEAHHLHL LAMTNLPVMV LGHLYVFNLT TPKTNSHDSS
SHTSSLASNP IFNPKTWIPF LVVFIPILLP IIGFTFIPII VLAYISARIS LWMEARLTPR
TRAILPLSVA GCLIGVIALR SLFIEEISLG WTDLIAAAIL VPMGVLTVRP FFKERLKYVK
PKSAVFLCVV GTFYTIIIGS LLYLLLGVSF TSNPGSLDQP LSYIVGRSVV LRSVILYIMS
VVIAIVLYAV ILFWDLIRRR RQSKKHKVED EDQ