Gene Mlab_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1650 
Symbol 
ID4795407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1677995 
End bp1679263 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content54% 
IMG OID640100335 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001031078 
Protein GI124486462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0113989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA TCGTATCTCG TTCTCAAATC TCCGGATGTG TCCATGCACC TCCCTCAAAA 
AGTCACACAC ACCGGGCATT TCTCCTTGCG TCTCTTGCAA AAGGAGAATC GGTCGTTCTC
TCTCCGCTTC TGGGCGAGGA CACTCTGGCT ACTCTCTCCG CGGTCAAAGC GCTTGGAGCG
AACGTATGCG AAGGGGATGA TCGAATCACG ATCCAGGGCG GCAATTTACA CGCCCCGCTC
CCGAAAGGAA CGGTGATAAA CTGTAAAAAC TCGGGCACCT CCATTCGGAT GCTTGCAGGC
ATAGCTTCCC GTCTGGATGG AACGACTGAG TTCACGGGTG ACGCTTCGCT CTGTTCCCGC
CCTATGAAGC CTCTGCTTGA CGCCCTGTCA GAACTTGGAG CCGGGGTAAC ATCCGACAAC
GGATGTGCTC CGTTCACCAT AACGGGTCCG GTATCGGGCG GCGATGTCCA TATTCGCGGT
GACGTGAGTT CTCAATTCAT CTCCGGCCTG CTGATCTCTG CTCCGCTTGG CAAAGCTGAC
ACGAGGATCC ACCTGACAAC TCCCCTCACG TCAAAACCAT ACGTGGACAT GACGATTTCT
GCTATGAAAA AGCACGGCGT TTCGGTCGAG ACGATCGAAG ATGGATATCT TGTCCGTTCA
GGTCAGGTCT ATTCTTCCGA GGATGTTCAG GTTGGCGGCG ACTACTCGTC GGCCGCATTT
CTGTTTGCGG CGGCGGCACT CGCCGGGGAG ATCGCCGTTT CCGGACTCGA CCCGGCTGAC
CCTCAGGGCG ATCAGGTTGT GATCTCCATC CTTGAAACAT TCGGGGCAGG AGTAGTTCGT
GATGGCGAAA ACGTTACGAT TCGAAAAGCA GCTTTGAAGG CTGCAGACAT CGATCTTGCG
AACGCTCCGG ATCTGTTTCC CATTATCGCG GTCCTTGCGT CGCAGGCGAA AGGCACCAGC
AGATTATACG GCGCCGCTCA TCTCAGATTC AAGGAAAGCG ACCGTATCAT GTCCACGGTC
CTTTTCCTCA GATCGATGGG TGCAGATATC AGCGAGACTG AGGATGGATG CATTGTTACG
GGACCTGCCA ATCTTTCCGG GGCAAATGTT ACTACATTTG GCGACCACCG TATAATGATG
GCATCAGCGG TTGCCGGGCT TATCGCAGAT AGTACTACGA CCGTAGATGA TGCCGGCTGC
TGCGCAGTTT CCTATCCGGG TTTTGTGAAA GATATGCAGA AACTCGGTGC GGATATGAGG
GAAGAATGA
 
Protein sequence
MKLIVSRSQI SGCVHAPPSK SHTHRAFLLA SLAKGESVVL SPLLGEDTLA TLSAVKALGA 
NVCEGDDRIT IQGGNLHAPL PKGTVINCKN SGTSIRMLAG IASRLDGTTE FTGDASLCSR
PMKPLLDALS ELGAGVTSDN GCAPFTITGP VSGGDVHIRG DVSSQFISGL LISAPLGKAD
TRIHLTTPLT SKPYVDMTIS AMKKHGVSVE TIEDGYLVRS GQVYSSEDVQ VGGDYSSAAF
LFAAAALAGE IAVSGLDPAD PQGDQVVISI LETFGAGVVR DGENVTIRKA ALKAADIDLA
NAPDLFPIIA VLASQAKGTS RLYGAAHLRF KESDRIMSTV LFLRSMGADI SETEDGCIVT
GPANLSGANV TTFGDHRIMM ASAVAGLIAD STTTVDDAGC CAVSYPGFVK DMQKLGADMR
EE