Gene Mpal_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1598 
Symbol 
ID7272140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1641023 
End bp1642132 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content55% 
IMG OID643570211 
Productperiplasmic binding protein 
Protein accessionYP_002466633 
Protein GI219852201 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.796704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA ATTACCAGAT AATCCTGATT ATCGCAGCGG TCCTCGTCAT CGCAGGAATC 
GCTGCTGTTG TAATGAGCGG TGCCGCCTCG GCGAATACAC CGTCATCGGT ATCACATGGA
AACAGTACAA AGACCATCTC CGATATGCTG GGCCGGACGC TGGTTGTCCC CCAGACGATC
ACTCGAGTGC TCAGCACGGA ACCGCCGACC ACGATCCTCA CCTATGTGCT GGCACCGGAC
AAACTGATCG GGGTCAACTT CGATCTGAAC CAGATCAACG GTTCCGTCTA CCTGCCTGAA
AAATATCGTT CCCTCCCCAA TGTCGGCGGC TGGTACGGCA AAACGACCGG GAATTATGAG
ACGTTCATAT CAATGAACCC CGAGGTCATT CTCTATGGCG GTATGAACGA GGGAAACTTT
TCAGGCACGC TGGATGAGCG CCAGCAGAAG TTCGGCGTAA TACCGGTTGC CGGGGTGCTG
GATTCCATGA ATGCCACAGA TTACAATCCA TCGATCCGCT TCCTTGGCAC ACTGCTTGGC
GCTGATCAGC AGGCAGCATC GCTCTCCGAG TTCTATAACC GGGTGCTCTC AAATGTCACC
TCACGGGTTT CAGGTATTCC AAAGAATGAG CGTGTCGGGG TTTATTACGC AGAAGGCCCC
AAGGGACTCC AGACCGATCC CTCGGGTTCT ACGCATGCGG ATCTGATCGA GCTGGCCGGC
GGGGTCAATG TGGCTGATTG TCAGATCACA CCAGGGAATG GTATGACAGC GGTCTCGATG
GAGCAGGTGA CGAAGTGGAA CCCAGATGTG ATCATCGTGG GCGACCCGGA CTTTTACAGC
ACGGTCTATA ACGACACCCT CTGGCAGTCG ATTCCAGCGG TGAAGAACCA CCGGGTCTAT
CTCGTTCCGC AGTCACCATT CACCTGGTTC GACCGGCCAC CGGGCGTCAA CCGGATCCTC
GGTATCCCGT GGACGGCGAA GATCCTGTAC CCCGAGAAGT TCACCGATAC GAACATGCCG
GCGCTCACCC GGGAGTTCTA TTCGAAGTTC TACCACTACA ACCTGACGGA TGACGAGGTG
AACAGTCTGC TGGATCCTCT GCTCCGGTAA
 
Protein sequence
MKKNYQIILI IAAVLVIAGI AAVVMSGAAS ANTPSSVSHG NSTKTISDML GRTLVVPQTI 
TRVLSTEPPT TILTYVLAPD KLIGVNFDLN QINGSVYLPE KYRSLPNVGG WYGKTTGNYE
TFISMNPEVI LYGGMNEGNF SGTLDERQQK FGVIPVAGVL DSMNATDYNP SIRFLGTLLG
ADQQAASLSE FYNRVLSNVT SRVSGIPKNE RVGVYYAEGP KGLQTDPSGS THADLIELAG
GVNVADCQIT PGNGMTAVSM EQVTKWNPDV IIVGDPDFYS TVYNDTLWQS IPAVKNHRVY
LVPQSPFTWF DRPPGVNRIL GIPWTAKILY PEKFTDTNMP ALTREFYSKF YHYNLTDDEV
NSLLDPLLR