Gene Mpal_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1371 
Symbol 
ID7269976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1410483 
End bp1411646 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content53% 
IMG OID643570003 
Productaminotransferase class V 
Protein accessionYP_002466425 
Protein GI219851993 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00447816 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.862029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATTA AACGCCCGAT ATACATGGAT TCACACGCAA CTACCCCTGT CGACCCTCGC 
GTGTTCGAAG CGATGCTCCC CTATTTCTCC GAGATCTTCG GGAATGCCGG CAGTATCGAT
CATAACTATG GGGCGGTGGC TGCTGATGCA GTGAAGAAAG CCCGGGAACA GTGTGCCCAT
ATCCTGAACG CTCAGTCCGA GGAGATCATT TTCACGAGCG GGGCGACCGA GTCCGATAAC
ATCGCTATTC TTGGTGTTGC CGAGCAGTAC GCTGCTAAGG GCGATCATAT CATCACCTGT
GTTACCGAGC ACAAGGCCGT GCTTGATACC TGCAAGCATC TCCAGATGGC CGGCAAATCT
GTGACGTATT TACCAGTTGA CCATTATGGG CTTGTCGACC CCGGCCAGGT TGAAGACGCC
ATCACCGAGA AGACTGTATT GATCTCCATC ATGGCGGCGA ACAACGAGAT CGGGACGATT
GCACCCATCA AAGAGATCGG GGAGATTGCT CACAAACACG GCGTGCTCTT CCATACCGAT
GCAGCGCAGG CCGTCGGCCA TATCCCGATG GATGCTAAGG AGATGAACAT CGATCTGCTG
TCCTTCTCCG GTCATAAGAT CTATGGGCCG AAGGGGATCG GCGGTTTGTA TGTCCGGGGG
AGCAAGCCCC GTGTGAAATT ATCGCCGATT GTCTTTGGTG GCGGGCAGGA GAAGGGGATT
CGCTCGGGGA CGCTGAATGT AACGGGTATC GTTGGGCTCG GAGAGGCGCT CGCAATTGCT
AGGAAGGAGA TGGGCAAGGA AGAAAAGCGG TTCCGTCAGT GGACAATGCA GATGTTCGAT
GCGTTCAAAG AAGCGTATCC ATCGGTGATG TTGAATGGGC ATCCAACCCA GCGGCTTGCT
CACAACCTGA ATGTCTGTTT CCCTGGCATC GAGAGCAAGG CATTGATTCA TTTATTGAGG
GACGATGTAT CGATATCTGC TGGCTCTGCC TGCACGACCA CGAGCGTTGA GCCATCGCAT
GTGTTACTTG CGATTGGACG GACAGTCGAG GAGTCCCACT CGGCGGTGCG ATTTGGATTG
GGGAGAGGGA ATTCGGAGAA GGAAGTGGGG TTGGGGATAA AAAAGGTATA CGATTATTTG
AAACAATTAG AAAAATTAAA ATGA
 
Protein sequence
MEIKRPIYMD SHATTPVDPR VFEAMLPYFS EIFGNAGSID HNYGAVAADA VKKAREQCAH 
ILNAQSEEII FTSGATESDN IAILGVAEQY AAKGDHIITC VTEHKAVLDT CKHLQMAGKS
VTYLPVDHYG LVDPGQVEDA ITEKTVLISI MAANNEIGTI APIKEIGEIA HKHGVLFHTD
AAQAVGHIPM DAKEMNIDLL SFSGHKIYGP KGIGGLYVRG SKPRVKLSPI VFGGGQEKGI
RSGTLNVTGI VGLGEALAIA RKEMGKEEKR FRQWTMQMFD AFKEAYPSVM LNGHPTQRLA
HNLNVCFPGI ESKALIHLLR DDVSISAGSA CTTTSVEPSH VLLAIGRTVE ESHSAVRFGL
GRGNSEKEVG LGIKKVYDYL KQLEKLK