Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0734 |
Symbol | |
ID | 7270468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 730550 |
End bp | 732334 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643569377 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002465821 |
Protein GI | 219851389 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.780369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA GAAGAGGACC CAGTGCGTTC TTCGTCTGCT GTTCGATCCT GTTTCTGATC CTCATCTGCT GCATGCCAGT CTCCGCCGGG ATCCCGTGCG ATACGAATGG CGACGGGGTT GTCTCTGAGG ACGAACTCTC GACAGGAATT CTGGATTATC TGAACGCACC CTCTGCGGCC GGTCCCGCCA CGCTCGATAA GGCAGATCTC TCCCTCGCGG CTCACAACAC CCTGCACATC CCGTACGGCC AACTCGTCGT AGCGGTCGAC AGCGAAAATG AACTCCTGCC GAGGGCATAT ATCGATGAAA AAGGGGTTCC AGAAGATTCT CTCATCTATG AGGGGTTCGT TACCAGAGCG CGTGACAAGG AAGATCTCGG CTGGCTGGCG TCTTCGTGGG AGCACTCTGC CGATGGAAAA ACCTGGACGT TTCATATCCC GTCAGGGTCA ACCTGGCAGG ACGGCGTTCC CGTCACCGCG AACGACGCTG TCTTCTCATA CCAGTATCTC AACGATAAGG GACTCAAGAA CAGGAACGTC CTGTCGAATG TCACCAATGT GACGGCTCTC AACAACACGA CCGTGGTCTA TTCCCTGGAT ATCTGCATGC CGCAGTTCCC CGACCTTCTT GCGACCGGAC CGGGCATTGC GATCTTCCCG GAACATATCT GGTCGTCCAT CTCAAATCCA AATAAGGAAT CTGATCCTGA TTACATCGGT TCCGGTCCTT TCATGTACAA TTCGAGCATC TCAGGAGATT CCTACAAACT TGAAGCATAT CGGGGGTATC ATGGATCGGT ACCGTATGTC AACACGATTG TTCGAAAACT CTACTCCTCT GAAGATACCA GGGTCCTTGC CCTGAAGAAC GGGGATGTCG ACTTCGTATC AGATCTGGCT CCGGCCACCG CCCACTCGCT CCAGGGAGTG AACGGGATAG GAGTGACGAC AATTCCCGCC GGAGGTAAGT CCTTCGAAGT CGCGTTCAAC CTCGACATCT ATCCGGCGAA CAACACGCTC TTTCGGGAGG CTCTCAGCCA TGCTGTGAAC AGGGACAGGA TGTGTCAGCT GATCGACCGG CAGGCAACCG AGGCGACCAG TACGGCATTT CTGATCCCCG CCCTCGCCGG CGACCAGGTG AACAACGCCA CGAACGACAG GTACGGCTAC GACCTCACCG CAGCAAAAGC GCTGCTCGCC CAGGCCGGCT TCACCCTAAA GACCGAGAAC GGGAAGAACA TCCTCTACGG ACCGGACAGC CAGGTGGTCA CGATCACCAT CCCGCTCGGC GGCAAAGCCT CGGCCAACGG TGTCGACGAG AAGATCGTCC AGGTGCTCAA GGAGGACTGG GAGACAGCGC TCGGCATCAC ACTCACCGTC GAGAACCTCA AGGCCGATGA CGATCAGTAT GACAAACGGA TCGACGGAGA CGCAGTCCGT ATCGACGGCA TGCCGAGTTA CTTCCACGAC GACATCGCCC GCCTGAATAA TTTCCAGTCG AGTCCGCTTG GCAAGAATTA CTACCACTTC GACAACGGAT CCTTCAACGA ACTGATCGAC ACCCTCCAGA ATACCGTGGA GGAGTCACAG AGAAAGGCGA TAGGAGACCA GCTTCAGGAG ATCCTCGTCG AGCAGATCCC CTGCATACCG GTCTGTTCGA TGGACGCCTT CGATGCCTAC CGTTCCGACC GGTTCTACGG TTTCGATTCC CTGATACATA AGGACGGGGG CGATATCAAC ATCTTCTCAC ATGTCAAGCC TGCGAGTACG GAGGTGAACA GGTAG
|
Protein sequence | MSERRGPSAF FVCCSILFLI LICCMPVSAG IPCDTNGDGV VSEDELSTGI LDYLNAPSAA GPATLDKADL SLAAHNTLHI PYGQLVVAVD SENELLPRAY IDEKGVPEDS LIYEGFVTRA RDKEDLGWLA SSWEHSADGK TWTFHIPSGS TWQDGVPVTA NDAVFSYQYL NDKGLKNRNV LSNVTNVTAL NNTTVVYSLD ICMPQFPDLL ATGPGIAIFP EHIWSSISNP NKESDPDYIG SGPFMYNSSI SGDSYKLEAY RGYHGSVPYV NTIVRKLYSS EDTRVLALKN GDVDFVSDLA PATAHSLQGV NGIGVTTIPA GGKSFEVAFN LDIYPANNTL FREALSHAVN RDRMCQLIDR QATEATSTAF LIPALAGDQV NNATNDRYGY DLTAAKALLA QAGFTLKTEN GKNILYGPDS QVVTITIPLG GKASANGVDE KIVQVLKEDW ETALGITLTV ENLKADDDQY DKRIDGDAVR IDGMPSYFHD DIARLNNFQS SPLGKNYYHF DNGSFNELID TLQNTVEESQ RKAIGDQLQE ILVEQIPCIP VCSMDAFDAY RSDRFYGFDS LIHKDGGDIN IFSHVKPAST EVNR
|
| |