Gene Mpal_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1940 
Symbol 
ID7270744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2055888 
End bp2056958 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content54% 
IMG OID643570554 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_002466967 
Protein GI219852535 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.423901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACTCA CAGAAAAAGA TGAAACCGGA TCCGTTCTGG ATTCCCAGAA TTCGACGAAG 
AACCGTTACT GCCGCCAGAT CATCCTTCCT GAGATTGGGA CAGAGGGGCA ACGCCGATTC
AAAGAGAGCC GGGCAGTCAT TGTGGGTCTT GGAGCAACAG GAAGTTCCGT GGCCAACTCA
CTGGTGCGGG CGGGTATCGG ACAGGTAGTT TTGATCGATC GTGATCTGGT GGAGCTGCAC
AACCTGCAGC GGCAGATCCT GTACAGCGAG GAAGATCTGA ACCGCCCAAA GGCTGTTGCC
GCTGCTGAGA TCCTGCGAAA GATCAATTCT TCCATTGAGA TTGAGGCACA TGTAACAGAT
TTCAACATAT CAAATGCCGA AAAACTCCTA TCAGGAGCGA ACGTGGTTCT GGATGGAACC
GACAACCTCC AGACCCGTTT CCTCATCAAC GATATCTGCG TAAAGCATAG CATCCCCTGG
ATCTATGCCG GCGTTGTCGG GACCGGGGGC ATGGTGATGC CCATCCTGCC GGGCAAAACT
CCCTGTTTCC GGTGTCTCGT TCCTTCGCTT CCCGGACCCG GCCTGTTGCA GACGTGCGAC
ATTGCAGGTG TCTTAAATAC GATGCCTCCT CTCATCGCCT CTATTGAGTG TACCCTCGCT
TACCAGATCC TGACGGGGCA GTTCGACCCG AAGGATGAGA TCTCGTACAT GGTATATATT
GATGGATGGC GGAACACGTT CGATCGAGTG GCGGTTGGGA GACAACCAGA CTGTCCGTGT
TGCGTACAGG GTCAGAGGGA TTTCCTGGAT GCGGTCTCCC GGGAGATGGT AACATCACTC
TGTGGGAGAG ACGCGATCCA GATCATCCCC TCATCCCAGA TGGAGATCAC TCTTGAAGAC
CTTGAGATCC GTCTCTCCCG TCTTGGCGAG GTACGTCTCC ACCCATACAT GCTCACATTC
CGGACCGGAA CCGAAGAGAT CTCAATATTC CGTGACGGAA GGGCGATCAT CAAAGGAACC
AAAGACGAGG CTATGGCCCG TTCCGTTTAT GCACGGTACA TAGGGCTCTA G
 
Protein sequence
MTLTEKDETG SVLDSQNSTK NRYCRQIILP EIGTEGQRRF KESRAVIVGL GATGSSVANS 
LVRAGIGQVV LIDRDLVELH NLQRQILYSE EDLNRPKAVA AAEILRKINS SIEIEAHVTD
FNISNAEKLL SGANVVLDGT DNLQTRFLIN DICVKHSIPW IYAGVVGTGG MVMPILPGKT
PCFRCLVPSL PGPGLLQTCD IAGVLNTMPP LIASIECTLA YQILTGQFDP KDEISYMVYI
DGWRNTFDRV AVGRQPDCPC CVQGQRDFLD AVSREMVTSL CGRDAIQIIP SSQMEITLED
LEIRLSRLGE VRLHPYMLTF RTGTEEISIF RDGRAIIKGT KDEAMARSVY ARYIGL