Gene Mpal_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0103 
Symbol 
ID7272273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp121905 
End bp123959 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content56% 
IMG OID643568760 
Producthypothetical protein 
Protein accessionYP_002465219 
Protein GI219850787 
COG category 
COG ID 
TIGRFAM ID[TIGR02537] archaeal flagellin N-terminal-like domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGCA GTAACCGGGG CATCCCGGGG CCGGCCGGAC GCCATATCGA GCACGACGCA 
GGGGTCTCCG AGATCATCGG AGCCGTGCTG CTGATCGCCC TCGTCGTCGC CGGCGGAACC
CTGGTCGGGG TGGCACTCTT CTCGCAGCCG CTCCCGACCC AGGTGCCGAA GGTGAATATC
GTCATCGGGG CAGACCAGAA CGGGACGGTG ACCCTGGTCC ACAATGGCGG GGAAGCACTG
AACCCCGGCC AGTACCTGGT TTACCTGGAC CAGACACCCT GGCCTCTGAA TAAGAGCTTC
GTCAACAACA ATACTAGTAC CCCGGCGGAC ACCACGGTCT GGTCGGTGGG GAACTCGCTG
ACCCTCAGGG GGAGCGATGC GAACCTGACC GACAATGTCA CGGTCGTGTA CTCGGGCAGT
TCAGGGAACG TCACGATCAC CTCGGTGAAC GCTGTCGGGA CAGGGGAAGG AGGCTACGGG
TTCTTCTCGT CACTCTTCGA GTACATCCTG GGGAAGCACC CGGAGCAGTA CCCGGAGGGG
ATCGTGCCGC TCCCAACCCA GCAGGTGACC CATTACCCGC CGGTCGGACC ATACAACTGG
TCGAACATTC AGCCGCGTAT CGATTACACC GACTGGGCCG GTCAGTCAGA ATGGATGAAC
ACCACTGCCT ATATCTATAC CTCAGGCAAC TCCGACAGTA GTTATGATAC CCCACCTGAC
GTTATCACAA GCACAGTCCC AGGTGGTATG AGTGCCCCAA ATCTGTTTAG ATTCACTGAT
GTGAACACGA TCAATGATGA GTTGCACAAT TATGGCAGGG TGGTGATGCT CGACGGTGTC
TATGTCTGCA ACGGGCCAAT CCGTTTCGAC AACAGTTACA AGATACTCAT GGCTCAGAAT
CCAGGTACGG TCTATCTCCC CATGAATGGA CCTTCTTACA ACGATGGTTA CATCAAGGTG
AGTGCCCCGA ATGTGATCAT CAGCGGGCTC AACCTTGAGG GAACTGGAGG TGTTGAGATC
GTTTCGAGTT ATGTCAGTGT TCAGGGTGTG AATGTAACCA GTCGAAACCA TGGTGAACTC
TCCGCGAATC ACGACCCGAA CGAGCCGCCA GCGAACAAAG CTCCAATCAA CGGGATGTTC
TTTGTATGGG CTGACGGCCA GCCTCTTTCA AACATTGAAT TTTTCAACTG TACAGCCCGC
GAATGCAACA CCCACGGCTG GAACATGAAT CAGAACTGGA ACAATGTGCC ATATTCCATC
AGCAATACGC TCTTTGTAAA TTGCGTGGCC TCATACTGTG GATATGGATC AGCCGGGGAC
ACGGTCAACT ATATCAATGG TTCAACCAGT ACCGTTTCAG CAGAACATCA ATCCCGGTCC
GAGTGGATCA CCGGGTTCGA CCTGCACGAA TGGCAGGACC TGATCGAATG TCACGTATAC
AACTGTTATG CAGACAACAA CTGGGAGTCG GGATTCCACT TCGAGCCCGG TGCCCGGTAT
GGTGATAACG GAGAGGATAT CGGGCCACGG ACCAGGTCAG AGAATGTCAC ACTGGACAAC
TGCACCAGTA TCGACAACGG CCGGTCGACC TATAGCGGCG CTTTCTTCCA ATCCGGTTAC
TACCTCTCGC GGAACACCAC GCTGACCAAC TGCACCTCGG TTGACAACGC CAACGCCGGG
TTCTATGTCC AGGGCGGTAT CAACTCCCAG TTCGTCAACT GCACGGACAC CGGAAGCAAG
AACACCGGCT TCCTCGTGAT CAAGGGGTCA AGCGACATCA CCATCGACAA CTGCATCTCG
AAGGACAATC CAAGGTTCGC CCTCTGGACC GCGTTCACCG AGAACCTGCA GGTGAAGAAC
TTCCAACAGC TCAATGTCAC TGGAGGGACG GGTTTTGGAA CCCAGACCCA GTCGATCCTC
GGGTGGTACA AGGATGATTC GCGGTACCAA CTGCCAGTGA CCGACTCATA TATCCAGATC
ACCGCGGATA AGAATTCCCC GCAGATCATC AACCAGGCCG GCAGTGGAAA TACCTACGAT
CTCAGGCCCT CCTGA
 
Protein sequence
MSGSNRGIPG PAGRHIEHDA GVSEIIGAVL LIALVVAGGT LVGVALFSQP LPTQVPKVNI 
VIGADQNGTV TLVHNGGEAL NPGQYLVYLD QTPWPLNKSF VNNNTSTPAD TTVWSVGNSL
TLRGSDANLT DNVTVVYSGS SGNVTITSVN AVGTGEGGYG FFSSLFEYIL GKHPEQYPEG
IVPLPTQQVT HYPPVGPYNW SNIQPRIDYT DWAGQSEWMN TTAYIYTSGN SDSSYDTPPD
VITSTVPGGM SAPNLFRFTD VNTINDELHN YGRVVMLDGV YVCNGPIRFD NSYKILMAQN
PGTVYLPMNG PSYNDGYIKV SAPNVIISGL NLEGTGGVEI VSSYVSVQGV NVTSRNHGEL
SANHDPNEPP ANKAPINGMF FVWADGQPLS NIEFFNCTAR ECNTHGWNMN QNWNNVPYSI
SNTLFVNCVA SYCGYGSAGD TVNYINGSTS TVSAEHQSRS EWITGFDLHE WQDLIECHVY
NCYADNNWES GFHFEPGARY GDNGEDIGPR TRSENVTLDN CTSIDNGRST YSGAFFQSGY
YLSRNTTLTN CTSVDNANAG FYVQGGINSQ FVNCTDTGSK NTGFLVIKGS SDITIDNCIS
KDNPRFALWT AFTENLQVKN FQQLNVTGGT GFGTQTQSIL GWYKDDSRYQ LPVTDSYIQI
TADKNSPQII NQAGSGNTYD LRPS