Gene Mpal_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1444 
Symbol 
ID7270049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1491471 
End bp1492910 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content59% 
IMG OID643570069 
ProductPKD domain containing protein 
Protein accessionYP_002466491 
Protein GI219852059 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC ACCAGACATT GGTCCCTGCG AGGACAGTAT ATCTGTTAAT TGCAGCTCTT 
CTGCTGCTTG TCGGTACTGC CAGCGCTGCA GCCCCGATAG CGGCGTTCAC CAGCACGCCG
GCTTTCGGCA CCGTCCCCCT GACCGTGAAC TTCACCGACA CCTCGACCGG CAGTCCGACC
GGCTGGGCAT GGTTCTTCGG CGATGAGACC TATACGCAAC CGTGGATAGA ACAGAATACG
AGTTCCGGAT GGTCGAAGCG AGGGTATATC CGAAGTGTGG CGATGCCGGA CGGAAGCATT
GTGCTGATGG GCGGTTACGG CAACAGTACC TATCTGAACG ACACCTGGCG GTCGACCGAT
AACGGCAAAA CGTGGACAGA GCAGAACCCG AGTTCCGGGT GGTCGGGACG AGCCGAACAG
AGCTGTGTCG CGATGCCAGA CGGGAGTATC GTGCTGATGG GCGGTTACGG CAACGGTACC
TTTCTGAACG ACACCTGGCG GTCGACTGAT AACGGCAGAA CGTGGACAGA GCAGAACTCG
AGTTCCGGGT GGTCGGGACG AATCGAACAG AGCAGTGTCG CGATGCTGGA CGGAAGCATC
ATACTCATGG GCGGGTATGA CGGGAACAGC AATCAGAACG ATGTGTGGCG GTCGACCGAT
TTCGGTGTAA CCTGGACACA GTTACCGGAT GCCGCGTGGT CGCCGAGATC CTGTTTCACC
GTCGTCGTGA CGCGGGACAG CAGCATCGTA CTCACGACCG GCGTCGAATT AGATGGCACC
TGGAAGAATG ATGTGTGGCG GTCGACCGAT GGGGGTCTCA CCTGGACTGA GATGACCCCG
AGTGCCAGTT GGGAGGAGAG GAGATGGCAT GGATGCGTCG CGATGCCGGA CGGGAGCATT
GTGCTGATGG GCGGTGATGG GCCCGTCGGT TGGAATGATG TGTGGCGGTC GATCGATGAC
GGTGCAACCT GGACGCAGTT GCCGGATGCC GCGTGGTCGC CACGGGCCCG TTTAGGCTGT
GTGGCGATGA CGGATGGCAG CATTGTGGTG ATGGGGGGTG GAGGCCAGAA CGATACCTGG
CGGTTCCAGC CCGCAGGATC AACACTCCAG AACCCATCGC ACACCTATAC GACTGCGGGT
ACCTATTCGG TAGCATTGCA GGCGTTCACT GCGGCCGGGT ACAGCAGTAC GCACCAGGTC
TCGGTCGCCG TCGTCCCGAT GCCGTCGATA TCCCCCTCGG TCAACACTCC TCAGGACCTC
AACCATGACG GGCTCTACGA GGATCTCGAC GGCAACGGTG TTTTCGACTT CAATGACGTG
GTCCTCTTCT TCAACCAGAT GGACTGGATC GCAGATTACG AGCCGGTCAC TGCATTCGAC
TTCGACAGAA ACGGCCGGAT TGATTTCAAT GATATCGTTG TATTGTTTAC TACCCTGTAG
 
Protein sequence
MNDHQTLVPA RTVYLLIAAL LLLVGTASAA APIAAFTSTP AFGTVPLTVN FTDTSTGSPT 
GWAWFFGDET YTQPWIEQNT SSGWSKRGYI RSVAMPDGSI VLMGGYGNST YLNDTWRSTD
NGKTWTEQNP SSGWSGRAEQ SCVAMPDGSI VLMGGYGNGT FLNDTWRSTD NGRTWTEQNS
SSGWSGRIEQ SSVAMLDGSI ILMGGYDGNS NQNDVWRSTD FGVTWTQLPD AAWSPRSCFT
VVVTRDSSIV LTTGVELDGT WKNDVWRSTD GGLTWTEMTP SASWEERRWH GCVAMPDGSI
VLMGGDGPVG WNDVWRSIDD GATWTQLPDA AWSPRARLGC VAMTDGSIVV MGGGGQNDTW
RFQPAGSTLQ NPSHTYTTAG TYSVALQAFT AAGYSSTHQV SVAVVPMPSI SPSVNTPQDL
NHDGLYEDLD GNGVFDFNDV VLFFNQMDWI ADYEPVTAFD FDRNGRIDFN DIVVLFTTL