Gene Mlab_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1060 
Symbol 
ID4795465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1072448 
End bp1073689 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content51% 
IMG OID640099731 
Producthypothetical protein 
Protein accessionYP_001030496 
Protein GI124485880 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.767819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGGTT TCGTTATAGT TGAAATAGTT TGCAGCGGAG ATCCGCCTGA GTCAGAGCCG 
ATTATTCGTG TACATGGTCT GACCAAAGTC TTTGGTCATG ATCTGGAAAA AGCCCTGGAA
CTTCACCGTT CGGGCCGCTC AAAAAAGGAA GTTCACGAAG AGACCAATGC GACGATCGCT
CTGCACAATG TCTCTTTCGA GGTTATGCGT GGAGAGACTT TCGTCCTTAT GGGTCTTTCC
GGCAGTGGAA AATCAACTCT TCTTCGTTGT ATTAATCGTC TTATTGATCC GACCGAAGGT
GAGATTGAGA TCGACGGAGA GGACATTATC GGCATGGATC ATGAGGAACT TCGGCTGATC
CGCCGACGCA AACTGGGCAT GATCTTCCAG AATTTTGCAC TTCTTCCGCA GAGAAATATT
CTGGACAACG TAACGTTCGG TCTTGAAATC ATGGGCGTTC CCAAAGCAGA ACGTAATCAG
CGGGCCGAAA AAGTCCTGGA AATGGTCGGT CTTGGCGGCT ACGGAAGCAG TATGCCCTCA
GAACTTTCGG GCGGGATGAA ACAGCGTGTC GGTCTTGCCC GTGCACTCAC CAGCGATCCG
GACATTTTGT TGATGGACGA AGCGTTCAGT GCTCTTGATC CGCTAATCCG CCGCGATATG
CAGGACGAAC TGCTGGAACT TCAGGAGAGG CTCGGCAAAA CGATTATTTT CGTAACTCAT
GATCTTGACG AGGCACTGAA GCTCGGTACC CGTATCGCTC TGATGAAAGA CGGTTCGATC
GTGCAAATCG GTACGCCGGA AGAGATTCTG ACAAATCCAG AGAATGCCTA TGTCGAGAAG
TTCGTTGCTG ATGTGGACCT GACCCGAGTT CTCTCGGCAA AAGATGTTAT GCGCCGTCCA
GAGCCGGTTG CCCAGTGCAC TGCCGGTCCG CGGGTAGCGC TGCATCTGAT GGAAGAACAT
GATATCCCGA TGATTTTTGT TGTCACCCGC CACAGAAATC TTCGTGGTCT CGTGACTCTA
GAAGATACCA TCGGTGCTGT GAAAACAGGA AAGACCATGT CGGACATTCT CAAAACCGAT
ATTCCCATTG TTGCACCCGA CGCACCGCTT TCTGACATTC TCTCTTTGAT TGTCGACAGC
CAGTATCCAA TGGCAGTTGT AGATGAAAAC GGCAGACTGC ATGGCGTGAT CTCACGTGCA
TCAATCCTGG CTGCACTTGC CCGCAAGGGA GGTGATTCCT GA
 
Protein sequence
MEGFVIVEIV CSGDPPESEP IIRVHGLTKV FGHDLEKALE LHRSGRSKKE VHEETNATIA 
LHNVSFEVMR GETFVLMGLS GSGKSTLLRC INRLIDPTEG EIEIDGEDII GMDHEELRLI
RRRKLGMIFQ NFALLPQRNI LDNVTFGLEI MGVPKAERNQ RAEKVLEMVG LGGYGSSMPS
ELSGGMKQRV GLARALTSDP DILLMDEAFS ALDPLIRRDM QDELLELQER LGKTIIFVTH
DLDEALKLGT RIALMKDGSI VQIGTPEEIL TNPENAYVEK FVADVDLTRV LSAKDVMRRP
EPVAQCTAGP RVALHLMEEH DIPMIFVVTR HRNLRGLVTL EDTIGAVKTG KTMSDILKTD
IPIVAPDAPL SDILSLIVDS QYPMAVVDEN GRLHGVISRA SILAALARKG GDS