Gene Mpal_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1941 
Symbol 
ID7270745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2056992 
End bp2058176 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content55% 
IMG OID643570555 
Productaminotransferase class V 
Protein accessionYP_002466968 
Protein GI219852536 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0312599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.253942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTG ATAATATCCG CAACGATTTC CCCCTTCTTT CTGAGGTATG CTACCTGGAC 
AGTGCAGCCA CGAGCCTCTC GCCGGAGCCG GTACTTGAGG CGATGCTTGA GTACGAGCAC
AAATACCGGG CAAATGCCGG CCGGGGGGTC CATCGGATCG CCCAGCAGGC CTCTCAGAAG
TACAGGGATG CCCACCAGAA GGTTCGGAAA TTCATTCATG CTCAGGAAGG TGAACTGGTC
TTTACCCGTA ACTCCACCGA GGCAATCAAT ACGGTTGCGT CAGGACTCGC GTGGCAGAAG
GGAGATCAGG TAATTACCAC ACTCCTTGAA CATCATAGTA ATCTCCTTCC CTGGATGCGT
CTTCGCAACC GTTATGGGAT TGATCTCCAG CTCCTGACTC CTGCACGGGA CGGCACCCTG
GATCCGGCCG CCCTTGAGGC AATCATCACA AAGCAGACCC GGCTCGTTGC TATCAGTCAG
GCTTCGAATG TACTGGGAAA TGTCGTGCCC ATCAGCGAGT TTGCAAAAAT CTGCCAGAAT
TACGGGGCGC TTCTTCTCGT TGACGGGTCA CAATCGGTCC CTCACATTCC GGTGGATGTG
GAACGCTTAG GCTGTGATTT CCTCTGTTTC TCAGGGCACA AGATGCTCGG CCCCACCGGT
ACTGGAGTAC TTTACATGAA GACTCCCTGC CTTGAACCCC TGCTTGTGGG TGGCGGGAGC
GTGGAGCGGG TCACTGCCGA GGATTACACC CTCACCGACG GATATGAACG TTACGAGGCG
GGAACCCCGA ATATAGCGGG GGCTATCGGT CTCGCCCGTG CAGTCGATTA CCTGAACGCG
CTTGGTATGG AGAATATCCA GAACCACGAG CAGCAGATCA CCCGGTATAT CATCAAAAAT
CTTACCGGGA TAGAGAACGT GGAGGTTTTT GGACCCGGGC CGGCAGGGAA CCGGATCGGG
GTCATCTCGT TTGCCGTCAA GGGGCTCAAT CCCCATGACG TTGCTGTTAT GCTTGACGGG
GAGGCAAATG TGATGGTACG ATCTGGTCAT CACTGTTGCA TGCCCCTTAT GCAACTCCTG
AACCTGACCG ACGGCACGGT TCGGGCAAGT CTGCACTGCT ATAACACGAT CGAAGACGCG
GAGCTGCTCG TGGACACCGT CAGGAAAATT GCTGGGGATT TTTAA
 
Protein sequence
MTFDNIRNDF PLLSEVCYLD SAATSLSPEP VLEAMLEYEH KYRANAGRGV HRIAQQASQK 
YRDAHQKVRK FIHAQEGELV FTRNSTEAIN TVASGLAWQK GDQVITTLLE HHSNLLPWMR
LRNRYGIDLQ LLTPARDGTL DPAALEAIIT KQTRLVAISQ ASNVLGNVVP ISEFAKICQN
YGALLLVDGS QSVPHIPVDV ERLGCDFLCF SGHKMLGPTG TGVLYMKTPC LEPLLVGGGS
VERVTAEDYT LTDGYERYEA GTPNIAGAIG LARAVDYLNA LGMENIQNHE QQITRYIIKN
LTGIENVEVF GPGPAGNRIG VISFAVKGLN PHDVAVMLDG EANVMVRSGH HCCMPLMQLL
NLTDGTVRAS LHCYNTIEDA ELLVDTVRKI AGDF