Gene Mboo_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1562 
Symbol 
ID5410087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1629659 
End bp1630924 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID640868796 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001404722 
Protein GI154151104 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCA CCCTCCCGGC CCGGAGCGGG ATCGAACTTT CCGTCACCAC CCCGCCCTCC 
AAGAGTTACA CTCACCGCGC CCTGATAGCA GCAGCGCTTG CACAAGGGAG AAGCACGATT
GTCCGCCCGC TCATGGCCGA TGACACAAAA CTCACCATTG CATCGCTCAT GAAACTCGGT
GTTGCGATCC ACGCCGACCA GCACAATATC ATGCTGGAAG GCTGCGATGG ATCCTTCCCA
AATACACCGG GAACAGTGCT TGACCTGGAT AACTCCGGCA CCTCGCTGCG CCTGCTTGCC
TCGGCAGCCC TGCTCGCCAC TTACCCGGTC ACGCTCACCG GCAGCGCCCG GATGCAGGAG
CGCCCGCTCG GGCCCCTTGC CCACACGCTT AACGACCTCG GGGGAATGGT CATCTTCACC
AAAAAAGAGG GCTATCCCCC GGTCACCATC GGGGGCCGGC TCCTTGGCGG GACTGCCACA
ATTGACGGCT CGCAGAGCAG TCAGTTTGCC TCCTCGGTCA TGATGGCGGC ACCGTATTCA
AAAGGCCCGG TGGACCTGAC CGTTACAGGG ACTCCTGCGT CGCAGTCGTA CCTCGACATC
ACGGCCGGGG TTATGACGGA CTTTGGCGCC GTGATCCGGC GCGAGGGCTA CAGGCGGTTT
TTGGTCAGCA ACTGCAACCA TTACACCGGG CGCACGTTTG TTGTTGAAGG GGACTACTCC
TCTGCCTCGT ACTTCTTTGC GCTTGCTGCG ATCTGCGGGG GCAAAGTGAC CGTAGCCGGC
CTTGCCCCGG ACTCTGTGCA GGGTGACCGG CTTTTCCTTG ATGCACTCCA GCGGATGGGC
TGCGAGGTGA CCTATGCCCA CGATGGGGTT ACCGTGGAAA ACCAGGGCCC GCTTACCGGG
ATCACGATCA ATATGTCCTC GGCTCCCGAC ACGGTCCAGA CACTCTGCAT GGTGGCGGCA
GTGGCAAGGA CACCAACGAT CATCACCGGC ATCGGCCACC TGAAGTTCAA AGAGAGCGAC
CGGATCGCAG TCACCGCGGA CCGGCTCAGG ATGCTGGGCG GTATTGTAAC CGCAGAAAGA
GACCGTATCG TAATCCAGCC GGCTACCCTG CACGGCGGGA GGATCGACCC GGTAAACGAC
CACCGGACCG CGATGAGCTT TGCCGTGCTC GGCCTTGGGA TCGGCGGGAT CACGATCACC
GGTGCAGAAT GCGTAAACAA GTCCTTCCCC GGTTTTTGGG AAATACTCTC GAAGGTGATG
GAATGA
 
Protein sequence
MIVTLPARSG IELSVTTPPS KSYTHRALIA AALAQGRSTI VRPLMADDTK LTIASLMKLG 
VAIHADQHNI MLEGCDGSFP NTPGTVLDLD NSGTSLRLLA SAALLATYPV TLTGSARMQE
RPLGPLAHTL NDLGGMVIFT KKEGYPPVTI GGRLLGGTAT IDGSQSSQFA SSVMMAAPYS
KGPVDLTVTG TPASQSYLDI TAGVMTDFGA VIRREGYRRF LVSNCNHYTG RTFVVEGDYS
SASYFFALAA ICGGKVTVAG LAPDSVQGDR LFLDALQRMG CEVTYAHDGV TVENQGPLTG
ITINMSSAPD TVQTLCMVAA VARTPTIITG IGHLKFKESD RIAVTADRLR MLGGIVTAER
DRIVIQPATL HGGRIDPVND HRTAMSFAVL GLGIGGITIT GAECVNKSFP GFWEILSKVM
E