Gene Mpal_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1007 
Symbol 
ID7271741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1034566 
End bp1036659 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content56% 
IMG OID643569644 
Productperiplasmic copper-binding 
Protein accessionYP_002466078 
Protein GI219851646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.525995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.505832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCAT ATCCTTCGCT CAGGTCTCTC CTCATCCTTC TCATCTGCTG CAGCCTTCTC 
ATTTTCCCGG CAGCAGCCGA AACGGTGAAC GAATCATTGA AAATGCCGGC ATTCTTCTCG
GATATTCAGA ACAGGGGGGT AGATCTCCAC GCCACCGACG AGACACCAGT CTACACGGTG
ATGCACCCCT CATCCGAGCA GCTCAAGGAA TGGGATGCTC AGTACAATGC ATTACCCCTG
GTGTCGGTGC CGGCTACCGG TTCTTCCAGC CAGCTCCAGA ATAATACCAC ATCTGTGGGC
GGATACAAGG ATCTCCTCCC TTACCTGGAT TACATCCCGG CTGAACGAAA TCAAGGATCG
ATCGGGAATT GCTGGGTCTG GGCGGGAACC GGGGTGATGG AGATCGCCCA TGCCGTCCAG
AACGGGGTGA AGGACCGATT CTCGATCTCG TACCTGGACG CCAACTATAA TGGTGGTTCT
GGGAATAAAT GGGCAGGGAC TGGAGGAGAT TTCTTTAATC TCGCTAATTT TTATACCACA
ACCGGGATAG CAGTTCCCTG GTCCAACCTG AACGCTGAGT ACCAGGACGG GACCACATGG
TCAGGGACAG AACAGCGGTC CTACGAGCCT GCCTTTGCAA TCTCCACAGA ACCCCACTAC
CAGATCTATC AGATCAAAGC GCAACGGATC GAGACACGAC ATATTGGCAA CGAGCAGGCG
ATCAGCAACA TTAAGGCTGT GCTCGACCAG AACCGTGCCA TTGGTTTTGG GTTCAATCTC
CCCAATTCTA CGGCCTGGGG ATCCTTCATA GAGTTCTTCA TGAACAGTTC GGAGGAGACC
GCATGGAACA TGACCCCCTG GCAGAACACA CTCTATAATG AAAACGAGGG CGGGGGGCAT
GAAGTACTCT GCGTCGGATA CAATGACACC GACCCGACGA ACAGGTACTG GATCATGGTC
AACTCCTGGG GAGTCTCGGA TGGGCATCCA CGCGGCGTCT TCCGAGTTTC GATGGACATG
GACTACTCTG CGACAATGCA GTTCAGAGAC AATGATGATT GGGCTGCCCT GGTGTGGCAG
ACGCTGGACG TCAACTTTGC AGCCACCCCA TCACCGGCGC CGAAGGAGAT CAGTTCCCTC
CCGTACACCT GCTCGGTTCC CGGTGAATAT TACCTTGCAA AGGATCTGAT CAGCAGCGAC
GCCGATACGG GGATCCTGGT CACGGCACAG AATGTGACGA TCGACGGGAA GGGACACCTC
CTTCGGGGCT CCGGCCGGCA GGGATCGGTC GGGATCCTCG CGTACAACAA CGGAGACCCT
GTCGATGGAC TGAATATCAC CAACCTGGCC ATCTCAAACT GGGAGGACGG GTGTTACCTG
TATCATGCCA CCAACGGATC GGTGAATGAT ACCACCATCT CCGACTGCTC GTATGCCGGA
ATCTTCCTGG ATGGAGAAAC TACCAACCTC GCAATCGCTG ACAACACACT CACCTCCAAC
TATCGCGGCC TCCTCTCCCG TTCCACAGCC GATATCAGGG TCGAGCACAA CAGGATAACC
GAAAGTCTGA ATACCGGGCT GTACCTCCTC TCAATGAACC AGAGTTTAAT CGCAGACAAC
CAGATTGTTA ACGGACAGAA CGTAATCATC TCTGGGTGGG TCAATACGAC AAGTTGGAAC
ACCAGTAAGA CCACCGGACA GAACCTGGCA GGCGGCCCGT ACCTGGGCGG CAACTACTGG
GGGAACCCCA CACAGACCGG GTTCTCCGAC CTTGCAGTCG ATCAGAACCG GGACGGATTC
GCTGACAGCC CAAACCAGAT CGCAGCCGGC ACCATGGACC AGTTCCCCCT CGTCGCCTAT
GCGAACCCTG GTCCACAGCC GATCCCGCCG AACCAGCTTG ACCCGACCGA TCCCGATCAC
GACAGGCTCT ACGAGGATCT GAACGGGAAC GGCAAGCTCG ACTTCGGCGA TGTGACCACC
TTCTTCAACC AGATGGACTG GATCGCCGAC CATGAACCGG TGCAGCTCTT CGACTTCAAC
GGCAACCAGC AGATCGACTT CGGCGACGTC GCCGCGCTCT TCTCACGGCT GTGA
 
Protein sequence
MSSYPSLRSL LILLICCSLL IFPAAAETVN ESLKMPAFFS DIQNRGVDLH ATDETPVYTV 
MHPSSEQLKE WDAQYNALPL VSVPATGSSS QLQNNTTSVG GYKDLLPYLD YIPAERNQGS
IGNCWVWAGT GVMEIAHAVQ NGVKDRFSIS YLDANYNGGS GNKWAGTGGD FFNLANFYTT
TGIAVPWSNL NAEYQDGTTW SGTEQRSYEP AFAISTEPHY QIYQIKAQRI ETRHIGNEQA
ISNIKAVLDQ NRAIGFGFNL PNSTAWGSFI EFFMNSSEET AWNMTPWQNT LYNENEGGGH
EVLCVGYNDT DPTNRYWIMV NSWGVSDGHP RGVFRVSMDM DYSATMQFRD NDDWAALVWQ
TLDVNFAATP SPAPKEISSL PYTCSVPGEY YLAKDLISSD ADTGILVTAQ NVTIDGKGHL
LRGSGRQGSV GILAYNNGDP VDGLNITNLA ISNWEDGCYL YHATNGSVND TTISDCSYAG
IFLDGETTNL AIADNTLTSN YRGLLSRSTA DIRVEHNRIT ESLNTGLYLL SMNQSLIADN
QIVNGQNVII SGWVNTTSWN TSKTTGQNLA GGPYLGGNYW GNPTQTGFSD LAVDQNRDGF
ADSPNQIAAG TMDQFPLVAY ANPGPQPIPP NQLDPTDPDH DRLYEDLNGN GKLDFGDVTT
FFNQMDWIAD HEPVQLFDFN GNQQIDFGDV AALFSRL