Gene Mlab_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0163 
Symbol 
ID4795895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp148281 
End bp149414 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content52% 
IMG OID640098809 
Producthypothetical protein 
Protein accessionYP_001029606 
Protein GI124484990 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAG GACTGTGCGT AAATCCGGTC TGGCCGTGCG CGATGACCGG GGCCTGTTCA 
GTGCTTGCCG GCTTCTCCGG ACTGAACGTG CTGATCCACG GTTCATCCGG CTGTTATTAT
TATCCCCGGT CGCTGCTGAA GGTTCCTTTG TTCAGCACGT ATCTTCTGGA GTCAGAGATC
GTGTTCGGTA CGGTCGAGCG CCTGAAAGAG GTCGTGAACA CTCTTTCAAC GTCGAAGCGA
CCGATCGCCG TTTTAAACAC CTGTATCCCG GCACTCACCG GCGAAGATCT TTCGGGGGCA
TTTTCCGAAG AGGAGACAAT TTTTGTCGAT GCGCCCGGAT TCATCGGGAG TGTCGAAGAT
GGAGCAAAGA TCGCCTTTGA ACGGCTGGGA ATCGAAACGG ATGCATCAAG GGAAGGCGTG
AACATCGACG GGGTTTCCCT TCTGGATCTA TTCTGGCGGG GAAATCTGCA CGAAGCAAAA
CGGATTCTCA CAGAGATGGG GATCCCGACC GCTCTGTGTC TTGCCAAAGA CAGCTATGAA
AATCTGCGAA AAGGAGCTGC TTTGCATACC GTCTCGGTAA ACCCGTCGTA TCCATCCGGC
GTTGGAACGA TGCTCGGCTC GTTTTTGTTC CCCGACCTGA AAGATACCTG TGCAAAACTC
GCAGATATAT TTCCCAACGC CGATATCGAC CCGCTCCTCG AAGAGTGGAA TCTTGCCGAC
GAGCAGCTGT TCTACTCAAG CGACAAATAT CTGCGAAAAT ATGAACCACC GGTCGTTGCC
GTATGTGCTC AGGAAAGTTA CGCACTGTTT GCAAAATCGA TGATGGAGCG CTACTTCGGG
GCGGACGTTC CGGTGATGCT TGCACGAAAC CATGATGCGG TAAGCATTCC CTCTGAAACC
GATCTGACGA AGATTGCAGG GCATATTGCC GGCTGCCGGC CGGATCTTAT CCTTGGATCA
ACGTTCGAAG CAAATGGTTA TCCAAACGCT GCATTCCTCG GGATAACGCC GCCTGACAGA
AGCCGGGTCT CCATAGCGGC ACGACCGATT GCAGGAATAG AGGGCGGAAT CATGTTTTTA
GAGAACGTGC TCAATACCTT GATTGATGCA ACTTCGTCAA AGCAGAAAAA GTGA
 
Protein sequence
MDEGLCVNPV WPCAMTGACS VLAGFSGLNV LIHGSSGCYY YPRSLLKVPL FSTYLLESEI 
VFGTVERLKE VVNTLSTSKR PIAVLNTCIP ALTGEDLSGA FSEEETIFVD APGFIGSVED
GAKIAFERLG IETDASREGV NIDGVSLLDL FWRGNLHEAK RILTEMGIPT ALCLAKDSYE
NLRKGAALHT VSVNPSYPSG VGTMLGSFLF PDLKDTCAKL ADIFPNADID PLLEEWNLAD
EQLFYSSDKY LRKYEPPVVA VCAQESYALF AKSMMERYFG ADVPVMLARN HDAVSIPSET
DLTKIAGHIA GCRPDLILGS TFEANGYPNA AFLGITPPDR SRVSIAARPI AGIEGGIMFL
ENVLNTLIDA TSSKQKK