Gene Mlab_0363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0363 
Symbol 
ID4794914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp341803 
End bp343068 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content52% 
IMG OID640099013 
Producthypothetical protein 
Protein accessionYP_001029806 
Protein GI124485190 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAC TGGTTCTTTC AAATGCCCGG CTGCCTGACG GAAGGATCGC AGACATCTCC 
ATAGATCAGG GAATCATCAC CCACATCGGG AGTTCCGGAC ATGGAGAAAG AGTAATTAAC
TGCAGAAACC GGCTCTGCAT ACCGGCTGCT ACCGACATGC ATGTCCATAT GCGGGATGGT
AGTCAGGCAG CAAAAGAGAC TTGGAAAACC GGGACTCAGG CGGCGGCGGC AGGAGGAGTG
GCTACGGTCG TCGACCAGCC AAACACCATT CCTCCAATGG ATACCGTTGA AAACTTTCTG
GAGAGAGCCG CTCTCGCCTC GAAGGAATCC TTCTGTCACT TCGGCATCAA CGGATCGGTT
ACCGAACATG CGGATATTGC AGGCCTCGCA AAAGCTGGAG TTCTTGCATT TGGCGAGATG
TTTGCAGCTC CATCGAGTTA CGGCAGCGCC CTCCCTGCAG AGGTGATCAG GGATTCTCTA
AAAACCATCG CAAATCAAAA CATGCTGGTC ACCGTACATG CGGAAGAAGT TATTCTCGGG
GAGATTCATT CCCTTGCCGA GCATTCCCGT TCACGTCCGA TATCCGGAGA GATAGAAACC
ATCCGGCTTG TGCAGAATCT CGCACCGACG CATGCACAAC TGCACATCTG TCATGTCAGC
GGCGCCGAAG CATTCGAAAC GATCAAAGGA AGTTTCGAAG TCGCCCCCCA TCATCTTTTT
TTGTCCTATG AAGATACTGA TCCGGAAAAT ACTTTTTGGA AAATGAATCC CCCGCTCCGT
TCAAAAAAGG AGCGGCTGCA TCTCATTCAA AACTTCGCAA AAATCCCCGT GATTGCCTCG
GACCATGCCC CCCATACAAT TCAGGAAAAG TCACAGCCGT TCTCCGCTTC TGCACCGTCC
GGAGTTCCCG GCGTGGAAAC GATGCTCCCT CTCCTGATGA ATGCCGTGAC ACAGCGAACG
ATCACCCTGA ACGATGTAAT TGAAAAAACG GTAACAAATC CATGCAGAAT ACTTGGCATA
TCTGCCCCAT CGCTTAGTCC GGGCAGCCGG GCCGATCTTG CCGTATATGT CGACATCCCG
ACAAAGATAA CCGGCGAAGC TCTGCACAGT AAATGCGGGT GGACCCCCTA TGAAGGAATG
TCCGGGCTTT TTCCCGCAAC AACGGTGATA GGCGGTATCC CTGCATGGCA TGACGGGGAA
TTTACCCACG GCGGCGGACA GATGTGGAAA AATACCCAAA AGGCACAACT TCGCCGAAAA
GAGTAA
 
Protein sequence
MSELVLSNAR LPDGRIADIS IDQGIITHIG SSGHGERVIN CRNRLCIPAA TDMHVHMRDG 
SQAAKETWKT GTQAAAAGGV ATVVDQPNTI PPMDTVENFL ERAALASKES FCHFGINGSV
TEHADIAGLA KAGVLAFGEM FAAPSSYGSA LPAEVIRDSL KTIANQNMLV TVHAEEVILG
EIHSLAEHSR SRPISGEIET IRLVQNLAPT HAQLHICHVS GAEAFETIKG SFEVAPHHLF
LSYEDTDPEN TFWKMNPPLR SKKERLHLIQ NFAKIPVIAS DHAPHTIQEK SQPFSASAPS
GVPGVETMLP LLMNAVTQRT ITLNDVIEKT VTNPCRILGI SAPSLSPGSR ADLAVYVDIP
TKITGEALHS KCGWTPYEGM SGLFPATTVI GGIPAWHDGE FTHGGGQMWK NTQKAQLRRK
E