Gene Mpal_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2074 
Symbol 
ID7271551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2200346 
End bp2201914 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content50% 
IMG OID643570685 
ProductNHL repeat containing protein 
Protein accessionYP_002467095 
Protein GI219852663 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.614793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGTTT TCTCATTACT CTGCATCCTG CTCCTCCTCT GCGGTAGTGT CCAGGCCGTG 
ACGGCAACAG AGACATATCT ATCTTCCAGA CTGTGGGGAA CCCCTGGATT TGGGATCAAT
CAGTTCAACT CCCCTGAAGG GATTGCGGTG GATGGTACTG GCAATGTTTA TGTGGCCGAC
ATGAACAACG ATCGTATATC GTTCTTCACG AAGGCTAGCT TACCACAGAT GCCTTCATCA
ATTGGGAGGA TCGGTTCTGG GCATGGACAG TTCTTCTATC CCCACGGGGT TGCAGTGGAT
AGCACTGGCA ATGTTTATGT GGCTGATACG GGTAACCACC AGATTCAGAA GTTCACGGTA
AATGGTAACT TCAACACGCA ATGGGGAATT AAGGGCTCGG GGACCAATCA GTTCAACTCC
CCTGAGGGGA TTGCGGTGGA CGGTGCTGGC AATGTTTATG TGGCCGATAC GGGTAATAAC
CGCATTGAAA AATTCACATC CTCGGGGGAT ATTGTCACCT CCTGGGGTTC CTATGGTTCG
GAAGTTGGGC AGTTCAACAG ACCAACCAGT GTTGCTGTGG ACAACACAGG AATAGGATAT
ATCTACGTCG CAGATACCGG TAACAACCGC ATTCAGAAAT TCACATTGAC CGGTGACCTC
GTTGCGACAA GGAGCATATC CAACTCTGGG GCCAGCCAGT TCAACAGACC GACCAGTGTC
GCTGTTGACA CCGGTGGGAG TGTTTATGTT GCGGACACTG GCAATAATCG GATCCAGAAG
TTCACGTCTT CAGGTGACCT CATCACCTCC TGGGGCTCTT ATGGTTCGGA ATCAGGCCAA
TTTGTTTCTC CATGCGGAAT AACGGTTGAT GGTGAAGGTA CCGTCTATGT GGCCGATACT
GGTAACAATC GCATTCAGCG GTTCACGCCT GTGCAGACCT ATGCCACCCT TGACTTTGTC
CCAGGTACAA AAACGCTGGT CCTTGGTGAA CACCAATCGT TTGATCTCAC CCTCTCTGGA
ATAGATACCG GCCTTTCGGG GTCTGAGGTC ATTGTATCCG TTGCTAATCC CTCAGTCCTT
GATATTGTTG GAGCCAGCCC ACCCGTTTGG TCTTCGACAC CACAATACTA TGATCTTCCT
TCATCTGCGG TCACAATCGG GGGTGCGGAC CTTGGGAATA GGGTCCAGGG ACGGATGTCT
AATATCCCCC TCGGCAATCT CACGGTTCAG GGGAAATTGC CTGGGACAAC CAGTCTGGAT
GTGACCCGGT ATCAACTGGA CGATGATTCA GGTAATCTGG TACCGGTCAT CACCATGTCT
GTTGTCATCA CCGTCAGCGG CACACTGATA CGGTCACTTC CTTCGTCTGA TACTCCGCCC
CACGATCTGG ACCAGGATGG TCTATATGAG GATGTGAATG GCGATGGGGT TTTTAACTTC
AACGATGTGA TTCAATACTT CAACCAGATC GACTGGATCT CTGATAATGA ACCGACAGTG
GCCTTCGACT TCAATCGAAA CGGGCGTGTC GACTTCGGGG ATATTGTGAC ATTGTTTAAT
ATATTGTGA
 
Protein sequence
MKVFSLLCIL LLLCGSVQAV TATETYLSSR LWGTPGFGIN QFNSPEGIAV DGTGNVYVAD 
MNNDRISFFT KASLPQMPSS IGRIGSGHGQ FFYPHGVAVD STGNVYVADT GNHQIQKFTV
NGNFNTQWGI KGSGTNQFNS PEGIAVDGAG NVYVADTGNN RIEKFTSSGD IVTSWGSYGS
EVGQFNRPTS VAVDNTGIGY IYVADTGNNR IQKFTLTGDL VATRSISNSG ASQFNRPTSV
AVDTGGSVYV ADTGNNRIQK FTSSGDLITS WGSYGSESGQ FVSPCGITVD GEGTVYVADT
GNNRIQRFTP VQTYATLDFV PGTKTLVLGE HQSFDLTLSG IDTGLSGSEV IVSVANPSVL
DIVGASPPVW SSTPQYYDLP SSAVTIGGAD LGNRVQGRMS NIPLGNLTVQ GKLPGTTSLD
VTRYQLDDDS GNLVPVITMS VVITVSGTLI RSLPSSDTPP HDLDQDGLYE DVNGDGVFNF
NDVIQYFNQI DWISDNEPTV AFDFNRNGRV DFGDIVTLFN IL