Gene Mlab_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0041 
Symbol 
ID4795141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp43744 
End bp45333 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content52% 
IMG OID640098686 
Producthypothetical protein 
Protein accessionYP_001029486 
Protein GI124484870 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.561625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.893455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACGA AAAAGTACTT TGCCGTTGCA GGAGTTCTTC TCCTCACGCT TTTCATTGTT 
TTTTCCAGCG GGTGTATCGC ATCTTCACAG ACCAATCAGG ATGCAGTGCT GAAAATCGCT
ACGCCCAACG AGATCACCGA AGCCTCGTAC TTCGGGAACT ACAACTTTGC CCAGATGACG
CGTGTCGCCA CGCCCCCCCT GATTCAGGTC GATGAAAACG GAGACTATGT CGGGGCAGTG
GCAAAGAGCT GGACCGTTTC TTCCGATGGA AAAACCTGGA CCTTTACCCT GGATCCGAAT
TATCAGTGGA GCGACGGAAC GGCGCTTACC CCCGAGGATG TCAAATTCTC TCTTGAATAT
ACATCTGAAA AGGTCACCAC GGCCCCGTCG TGGATAAAGA ATGCCACGAT CACGACAAGC
GGGAATGTCG TTACCATCAG TCTTGCCGAT CCGGTGTACC GTCTGTGCGG CGATCTTGTT
GCGATCAACA TCGTTCCAAA GCATATCTGG GAATCGATCG ATAATCCCGA AGAGTACGTC
AATGCAGGAC CCTATGTCGG CTCTGCCGCA TACTACGTAA AGTCGGTCGA TATCAACTCG
GCGACCCTTG TCCTTGCGGT GAACCCGAAG TGGAAAGGAG ATGCTCCTTA CTATGGAACC
GTCGAGCTTC ACTGGTTCGC GACCGAAGAT GCCGCGATTT TAGCCATGCT TGGGAATGAA
TGCGACACCT ACTGGAAGTA CGGCGCCTCG TTCCCCACAT CGTCCGTTGC GCTTCTTGAC
TCCTCGAAAA AATTCACCAC GGTCGAAACT CCGTCCTTAA AGATGACGTA TGTTGGATTC
AATACGATCT CTTCCGATGT CGGAAGCGAT CTTGAGTTCC GCAAAGCGAT CGCCTATGCG
CTGAATTATG ACGAACTCGT CGAGATCGGT GCTTCGGGAT ATGCTCTCTC GGCAAACTCC
GGCTTCATTC CAAACGGGGT TCCGTATTAT AAGAATACCA CGGCCAACAT CTACAATCTC
GCAACCGCCA AACTGATGCT TGATGCGGCA GGTTACAAAG ATGTAAACGG GGATGGAATT
CGCGAGGACA AAAACGGCAA TGCTTTAACC GTCACCCTCC TGACTCGTGA CAAAGTCGCG
GGAGAACTCA TCAAAGAGTA CCTCGAAAAG GCCGGGCTGA AAGTTGAGTA TAAATTCGCC
TCCGATCTGA ACGCCTGGAT CGAATTAAAG GATGCCCATG CATACGACAT CGCCTACAAT
GGAATCACCG CCAAAGGCAT GATGATGGAT TCAGGCTGGG CAACCGGATA CTTCGCCTCG
AACACCGTAG GTGCCGGCAA ACTCCAGAAC GTTGACGACC CGGCATTCCT TGAACTCTGT
GCGAAAATCG CAACAACTCC GGATGGAGAC GAACTGAAAG CCCTGATCTC CACTCTTCAG
GACTATTATG CAGACAACAT GCCAGGCATC GCTCTCTACT GGTGCACGGA TGTGACGCCG
ATCAATACGG AGATCGACGG CTGGTACATC TCCAAGGCAA CGGGAATCCT GAACGAGATC
AATCTCCTCT CGATTCACCC GGCCAATTAA
 
Protein sequence
MHTKKYFAVA GVLLLTLFIV FSSGCIASSQ TNQDAVLKIA TPNEITEASY FGNYNFAQMT 
RVATPPLIQV DENGDYVGAV AKSWTVSSDG KTWTFTLDPN YQWSDGTALT PEDVKFSLEY
TSEKVTTAPS WIKNATITTS GNVVTISLAD PVYRLCGDLV AINIVPKHIW ESIDNPEEYV
NAGPYVGSAA YYVKSVDINS ATLVLAVNPK WKGDAPYYGT VELHWFATED AAILAMLGNE
CDTYWKYGAS FPTSSVALLD SSKKFTTVET PSLKMTYVGF NTISSDVGSD LEFRKAIAYA
LNYDELVEIG ASGYALSANS GFIPNGVPYY KNTTANIYNL ATAKLMLDAA GYKDVNGDGI
REDKNGNALT VTLLTRDKVA GELIKEYLEK AGLKVEYKFA SDLNAWIELK DAHAYDIAYN
GITAKGMMMD SGWATGYFAS NTVGAGKLQN VDDPAFLELC AKIATTPDGD ELKALISTLQ
DYYADNMPGI ALYWCTDVTP INTEIDGWYI SKATGILNEI NLLSIHPAN