Gene Mpal_1290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1290 
Symbol 
ID7271150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1319472 
End bp1320980 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content61% 
IMG OID643569924 
Productprotein of unknown function DUF39 
Protein accessionYP_002466347 
Protein GI219851915 
COG category[K] Transcription
[S] Function unknown 
COG ID[COG1900] Uncharacterized conserved protein
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0616948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.376218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGT CCATTGGCCT GATCAACCAG CGGATCGCTG ACGGCAATGC CACTGTTGTT 
ACAGCAGAGG AGATGCCTGC CCTGGTTGAT GAACTCGGTG AAGAAGGGGC CCTTGAAACA
GTGGATGTGG TCACGACCGG AACATTTGGA GCGATGTGCT CGACGGGAGC ATTCCTGAAC
TTCGGTCACG CCGACCCGCC GATCCGAATG GAACGGGTCT GGCTGAACGA TGTCGAGGCG
TATGCCGGCC TCGCTGCGGT GGATGCTTAT ATCGGCGCCA CCCAGCAGTC CACAACGCGT
GGGGACCAGT ACGGCGGCGC CCATGTGCTC GAGGACCTGG TATCCGGGAA GCAGATCGAC
CTCCGGGCCA TCTCCCGCGG TACCGATTGC TATCCCCGGC GGACGATCAA CACCGAGCTG
ATCCTTGAGG ATCTGAACCA GGCCACCATG TGTAACCCCC GGAATGCATA CCAGCGCTAC
AATGCCGCAA CCAACACCAC CGACCGGACC CTCCACACCT ACATGGGGAT GCTCCTCCCG
AACAGTGGAA ACATCACCTA TTCTGGGGCA GGGCTGTTGA ACCCGATCAC CAACGACCCG
AAGTTCCGGG TGATCGGAAG CGGGACGCCG ATCCTGCTCG GTGGCGGTCA GGGGATGATC
GTTGGGGAAG GGACGCAGCA CTCGTCCGGT GCCGGATTCG GGACCCTGAT GGTCACCGGG
GACCTCAAGC AGATGTCACC GGAGTACCTG CGTGCAGCCA CCATGCAGGG GTATGGGGTC
ACGATGTACA TCGGGGTAGG GGTGCCGATC CCGGTGGTTG ACCTCGAGGT TGTCCGGTCC
ACGGCAGTCA GAGACGAGGA TATCCTGGTC GATCTGGTCG ACTACGGGGT GCCGAGCAGG
TCACGTCCGG TTCTACGGCA GGTCAGTTAT GCCGACCTGA AGAGCGGAAC GATCGATCTG
AATGGGGAGG AGGTGACCGC CTCGTCGCTC TCGAGCTTCC GGATGGCCAG GCGGGTCGCG
GCGGACCTGA AAGAGCGGAT CCTTGCCGGA ACGATGTCGA TGACCCTGCC GACCCGGCAG
ATCGATCCGA ACAAGCTCGC CCACCCGATG CATGAGACGG TGCATGCCCC GCGGGTCTGT
GATATCATGG ACCGGCACCA GGTGAGCATC ACCGAGGACG AGGAGATCCG GACGGCGGCA
AAGAAACTGT TGAAGGGTGA GACCAACCAC CTGACGGTGC TGAATCTGGA GGGACGACTG
GTCGGGATGG TCACCACCTA TGACCTCTCC AAGGCCGTCG CCAACCCGGG GAAGGTCTCG
CTGGTCAGGG AGATCATGAC GAGAAAAGTG ATCACGACCA CACCCGATGA GGTGGTCGAC
GTTGCAGCCC AGAAACTGGA ACAGTATAAT ATCAGTGCCC TGCCGGTGAT CGATAAGGCC
GGCCGGGTGC TCGGCATGCT GACGGCACTT GACTTAGGAA AACTGTTCGG CAGGAGGTGG
ACACGATGA
 
Protein sequence
MEKSIGLINQ RIADGNATVV TAEEMPALVD ELGEEGALET VDVVTTGTFG AMCSTGAFLN 
FGHADPPIRM ERVWLNDVEA YAGLAAVDAY IGATQQSTTR GDQYGGAHVL EDLVSGKQID
LRAISRGTDC YPRRTINTEL ILEDLNQATM CNPRNAYQRY NAATNTTDRT LHTYMGMLLP
NSGNITYSGA GLLNPITNDP KFRVIGSGTP ILLGGGQGMI VGEGTQHSSG AGFGTLMVTG
DLKQMSPEYL RAATMQGYGV TMYIGVGVPI PVVDLEVVRS TAVRDEDILV DLVDYGVPSR
SRPVLRQVSY ADLKSGTIDL NGEEVTASSL SSFRMARRVA ADLKERILAG TMSMTLPTRQ
IDPNKLAHPM HETVHAPRVC DIMDRHQVSI TEDEEIRTAA KKLLKGETNH LTVLNLEGRL
VGMVTTYDLS KAVANPGKVS LVREIMTRKV ITTTPDEVVD VAAQKLEQYN ISALPVIDKA
GRVLGMLTAL DLGKLFGRRW TR