Gene Mpal_2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2302 
Symbol 
ID7270563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2449329 
End bp2451239 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content57% 
IMG OID643570907 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002467310 
Protein GI219852878 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0725] ABC-type molybdate transport system, periplasmic component
[COG2998] ABC-type tungstate transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.250502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATAT CTATGTCTGA ACGAACCATC ACATGTTGCA TCGCTCTGGT TGTGCTGCTG 
CTGATCAGTC TCTGCATTTC AGGATGCATG ACAACCCAGC AACCGTCTGC CAATGGTACG
GCAGCCAATG TGACCAGTGC ATCCAATGCC TCAACTCCGG TGAAGGAACT GCTGGTCGCG
ACGACGACAA GCCTGTACGA TACCGGGCTC CTCGATTATC TTGCACCGAT CTACGAAAAA
CAGTACAATA CCCACCTGAA GATCACATCA CAGGGGACTG GAAAGGCAAT AGATCTAGCC
CAGCGTGGGG ATGCTGATGT GCTGCTGGTC CATTCACCTT CAGATGAGGT GACCTTCATG
GAGAACGGGA ACGGTGTCAA CCGGCGCTCG TTTGCGTACA ACTACTTCGA GATCGTTGGT
CCTGCCAGCG ACCCGGCACA GATCAAGGGG CTCTCGCCTG AGGACGCATT TAAGAAGCTC
AAAGCGGACG GCGCAAACAA CACCGCCAAT GTCGCCTTCG TCTCCCGCGG TGACGGGTCC
GGAACCCAGT CTGCAGAGAA GAAGATCTGG AAGAACGCCG GCTTCGACTA TGCCACCCAG
ATCGAGAAAT CCGGGGACTG GTATGTCGAG GCAGGCAAAG GAATGGGTGA GACCCTCCAG
ATGGCCGACC AGAAGCAGGC CTACACCCTG ACCGATGAGG GGACGTTCCT CGCATACAAG
GGGAACCTGA CTCTCATACC GGTGATCATC AAAGGGGATA GCCTACTCAA CATCTACAGT
GTGATGTCAG TGGTGCCGAA GAACAATGCG ACGGCTAGCA TTACTGCCGC CAACACCCTC
GCGACGTTCC TGACCGACAA CACCACCCAG CAATTGATCG CCGACTATGG GAAGGACAAG
TATGGCAAGG GGCTCTTCTC CCCGATGAAT GCAACGATGG CAAAGACGTT CAAGGTCGAC
AACTCGGCCC CGCTGAATGC GACCACCCCA GCGATAGTCT TCCAAGCTGG CTCTCTGGCG
ACCCCGTTCA AGGCCGTCAA GTCGATCTTT GAGAAGGACA CCACTGGGGC TGAGGTAGAA
CTCTTCAGTG GCTCCTCGAT CACGATGATC GAGAAGGTGA CCAAGATGGG CGAGAAGGCT
GACATCGTCG CCAGTGCTGA CGCCGACCAG ATCTCCCGTT TGATGGTCCC TGACAACGCC
TCGTTCACCC TGAACTTCGC GCGGAACGCA ATGGTGCTCG TCTACACCAA CAACAGCACC
AGTGCGTCCA CCATCACAAA GGACAACTGG TACTCGGTGC TGGGTCAGGA TAATGTCAAA
CTCGTCACCA GTGACCCGAC CTCTGACCCA GGGGGTTACC GTGCCTACAT GACACTGGTC
CTGGCCGAGT CATATTACAA GGTCCCCGAC CTCTTCAAGA AGGTCGTCAG TGATCACAGT
GCGATCACAG TGAACAAGAC CGGAGCCAAC ACCACCATCG ACCTGACGAA CCCGAAGCAG
GACAACAAGG GTCTGTTGAT TCCGAACGCC ACCGGCCCGA CCTACCTGGA CCTCCTTACG
AGCGGTAAGG CCGACTATGC CCTGGTCTAC CGGTCGAGCG CTGTCGACGC CGGGCTCCCG
TATATCGAGC TGCCAGATGA GATCTCGCTG GCAAACCTGA GCATGAGCAA AGATTACGCT
TTGGTGCAGG CGAAGACCCC ATCAGGACTG ATCCCTGCCA CCCCGATCGT CTATGGACTG
ACGATCCCGA CCCGGGCTGC GCATCCAGAC CTGGGTGTCG CGTTCATCAA GGCGCTCGCT
TCTGACCAGG GCGCAGCAGC TCTGAACAAG AGTGGGTTGG CCCCCATGAC CCCGATGACG
GCGACCGGCA CGGTGCCGGA CTCACTGAAG TCGCTGGTCA GCACCACCTG A
 
Protein sequence
MVISMSERTI TCCIALVVLL LISLCISGCM TTQQPSANGT AANVTSASNA STPVKELLVA 
TTTSLYDTGL LDYLAPIYEK QYNTHLKITS QGTGKAIDLA QRGDADVLLV HSPSDEVTFM
ENGNGVNRRS FAYNYFEIVG PASDPAQIKG LSPEDAFKKL KADGANNTAN VAFVSRGDGS
GTQSAEKKIW KNAGFDYATQ IEKSGDWYVE AGKGMGETLQ MADQKQAYTL TDEGTFLAYK
GNLTLIPVII KGDSLLNIYS VMSVVPKNNA TASITAANTL ATFLTDNTTQ QLIADYGKDK
YGKGLFSPMN ATMAKTFKVD NSAPLNATTP AIVFQAGSLA TPFKAVKSIF EKDTTGAEVE
LFSGSSITMI EKVTKMGEKA DIVASADADQ ISRLMVPDNA SFTLNFARNA MVLVYTNNST
SASTITKDNW YSVLGQDNVK LVTSDPTSDP GGYRAYMTLV LAESYYKVPD LFKKVVSDHS
AITVNKTGAN TTIDLTNPKQ DNKGLLIPNA TGPTYLDLLT SGKADYALVY RSSAVDAGLP
YIELPDEISL ANLSMSKDYA LVQAKTPSGL IPATPIVYGL TIPTRAAHPD LGVAFIKALA
SDQGAAALNK SGLAPMTPMT ATGTVPDSLK SLVSTT