Gene Mlab_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1652 
Symbol 
ID4795905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1680104 
End bp1681525 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content52% 
IMG OID640100337 
Producthypothetical protein 
Protein accessionYP_001031080 
Protein GI124486464 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase
[COG0710] 3-dehydroquinate dehydratase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase
[TIGR01093] 3-dehydroquinate dehydratase, type I 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00124545 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGCAA ATTGTGCCGT TATCGTGGCG AAAGATGAGA GAGAAGTTCG AACTCTTTCA 
GAAGAAGCGA TCTCGAAAGG AGCAGAGGCT CTTGAATTCC GGCTGGATTC CTTCCCGGTT
ATTCCTGCAG ACCTCTCCTT TCTTTCATGC GGGGTTCCAT CAATAGCCAC ACTCAGATCT
CCGTTGGACG AAGAGCGGAA GGAGATTTTC ATCCGTGCTC TTTCGTCGGG CGCGACATAC
ATCGACATAG AATCAGATTC GGTTCTGAGA GGTCAGTTTC CAAAAGAACA GGTCATCTGT
TCGTATCATG ACTTTGAAAA AACTCCTGCC GATGATGAAA TTCTCACGAT CTTCAAAGAC
CTCCAATCTT CGGGAATCCC CAAAGCCGCA TTCATGGTGA GGGGGCCGGC GGATCTGCTT
GCCGTCTGGA ATGCAGCATC AGTTCTGAGA GAAAGAGGAT ATCCGTTTAT CTTTATAGGC
ATGGGGGCCG CCGGTGAGGT TACACGTATC CGTGCCGCCG AGCTTGGATC AATGCTCAAC
TACTGTGCCT TAAGGCCGGA ACTTGCATCC GCTCCGGGTC AGATAACACT GGAAGAGGCT
GAGAGACTCG GTACCGACCC GCAGATTACT GCCGTAACCG GATATCCTTT GACCCATACC
TTCTCGCCGG AGATCCACAA CGCAGCGTTC AAAGCCGCGA AGATCCGGGG AAGATATGTG
AAGATCCCCG CAGCTTCTCA GGAAGTGCCC CTGATTCCTG ATGTTATCAA AAAATATCGG
ATCATCGGGA TGAACGTAAC CATTCCTCAT AAAGAGTCGG TCATCCCTCT CCTGAAACGG
ATAGATCCTC TCGCGGAGTC TGCCGGCGCA GTGAACACTA TCCGAAATTC TCCCGCCGGC
CTTGAGGGGT ACAATACCGA TATTCTGGGA ATAGCGGCGT CGCTCGCTTC GGTGGATATG
GATCCAAAGG ATGCCGATGT GCTTATCATC GGTGCCGGAG GGGCAGCAAA GGCCGCCGCC
GCATATCTGA AATCTGCCGG TGCTCGTCTT TCGATAACCA ACAGGACCCA TTCCCGGGCA
GAGACACTTG CCGAAATATT TGGTGCCCGG GCAGTGAAGC TGGAGGACCT TGCTCCGGAG
TATCAGCTGA TTCTGAATGC GACCCCGGCT GGAATGAGCG GGTTTGAACA CTGCTCACCG
GTTCCAAATA CGCTATTCAC CAAAGACACG GTTGTTATGG ATATGATCTA TGATCCGGAA
GTGACTCCCC TTCTTGCCGC TGCAAAAGCA GCCGGCGTTC GTGCCTGCAT CAACGGAAAG
ACCATGCTTA TCGAACAGGC CGCCGCGTCA TTCACTCTCT GGACAGGAAT TACTCCGAAT
CGTGATGTGA TGAGAAAAGC ATTCGAAGCG AGGGCGGCAT GA
 
Protein sequence
MTANCAVIVA KDEREVRTLS EEAISKGAEA LEFRLDSFPV IPADLSFLSC GVPSIATLRS 
PLDEERKEIF IRALSSGATY IDIESDSVLR GQFPKEQVIC SYHDFEKTPA DDEILTIFKD
LQSSGIPKAA FMVRGPADLL AVWNAASVLR ERGYPFIFIG MGAAGEVTRI RAAELGSMLN
YCALRPELAS APGQITLEEA ERLGTDPQIT AVTGYPLTHT FSPEIHNAAF KAAKIRGRYV
KIPAASQEVP LIPDVIKKYR IIGMNVTIPH KESVIPLLKR IDPLAESAGA VNTIRNSPAG
LEGYNTDILG IAASLASVDM DPKDADVLII GAGGAAKAAA AYLKSAGARL SITNRTHSRA
ETLAEIFGAR AVKLEDLAPE YQLILNATPA GMSGFEHCSP VPNTLFTKDT VVMDMIYDPE
VTPLLAAAKA AGVRACINGK TMLIEQAAAS FTLWTGITPN RDVMRKAFEA RAA