Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1871 |
Symbol | |
ID | 5104139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1814238 |
End bp | 1815413 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507757 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001191935 |
Protein GI | 146304619 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.481027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCGTGG AAATTGAGCC ATCAAGGATC AAGGGAAGGG TTAAGGCTCC GCCCTCAAAG AGCCTTGGAA TCAGGTACGT TTTCCTGTCC CTTCTGACGG AGGTTTCGCT CGAGAACCTC CCAGAGTCGG ATGACGTAAG GGTTGCAATT AACGCGGTCA AGTCCCTGAA GGAGGGGAAG GATGAGCTCT ACCTGGGTGG ATCTGCCACC ACCCTAAGGA TGATAATACC AGTGATCCTA GCCATGGGGC GAAGGGTGAA ACTTGACGGA GACGACACCC TGAGGAGGAG ACCTCTGAAC GCGTTGAGAT GGCTTCCAGG CAAGTTCTCG TCAAATTCCC TTCCCATGAC AGTTGAGGGA AGCCTGGGAC CAGAGACGCA GATTGAGGGG TGGGAAAGTA GTCAGTACAT CTCTGGGTTA ATCTACGCCT ACTGCCTGAG GGGGGAGGGA AGGATCAGGG TAATTCCCCC AATCTCGTCA AGGGGCTACA TCTTCATGAC CGCTGACGTC ATCTCCTCAA TAGGGGGAAA GGTGACGATT CAGGGGGAGG AGATAACCGT GGAGTGCAGA AACCTTCGCA AGTTTGGGGG ATCTGTTCCA GGCGATTACG CCCTGGCTTC ATTTTACGCA GTGGGGGCAG TACTAACGGG TGGGGAAGTG GAGATCACCA ACCTTTACGC TCCCCCAAGT TACGTTGGGG ACCACGACGT GGTTAGGATG GTGAAGGAGG CGGGAGCTGA GAGCTACGTG AGCGAGAACA GGTGGATAGT GAGGGACACC GGGGTCAGGG TCCCCATATC CGTGTCAATC AACGACGTGC CAGACCTGGC GCCCTCCCTG GCAGCCCTCA TGGCCGTGAT ACCGGGTGAG TCCAGGATAA TGGATAGCGA AAGGCTGAGG ATCAAGGAGA GTGACAGGAT ATCCACTATC CTGAACACGT TAGCCAGCTT TGGGATATCA GGATCTTACT CAGCCGGCAC TATCACAGTG AAGGGAGGAG AGCCGAGAAG AGGGGAGGTG GAGTGCCCCA AGGATCACAG GATAGCCATG ATGGCTGGCG ATCTGGCCTT GAGGGTAGGT GGTAAAATCA CGAGTGCAGA GTGCGTGAAC AAGAGCAACC CTGGATACTG GAGCGATCTC TCGGCTCTGG GAGGGAAGAT AAGGATTCAT GAGTGA
|
Protein sequence | MFVEIEPSRI KGRVKAPPSK SLGIRYVFLS LLTEVSLENL PESDDVRVAI NAVKSLKEGK DELYLGGSAT TLRMIIPVIL AMGRRVKLDG DDTLRRRPLN ALRWLPGKFS SNSLPMTVEG SLGPETQIEG WESSQYISGL IYAYCLRGEG RIRVIPPISS RGYIFMTADV ISSIGGKVTI QGEEITVECR NLRKFGGSVP GDYALASFYA VGAVLTGGEV EITNLYAPPS YVGDHDVVRM VKEAGAESYV SENRWIVRDT GVRVPISVSI NDVPDLAPSL AALMAVIPGE SRIMDSERLR IKESDRISTI LNTLASFGIS GSYSAGTITV KGGEPRRGEV ECPKDHRIAM MAGDLALRVG GKITSAECVN KSNPGYWSDL SALGGKIRIH E
|
| |