Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0418 |
Symbol | |
ID | 6166163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 374956 |
End bp | 376341 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641667576 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001793812 |
Protein GI | 171184893 |
COG category | [S] Function unknown |
COG ID | [COG1892] Uncharacterized protein conserved in archaea |
TIGRFAM ID | [TIGR02751] phosphoenolpyruvate carboxylase, archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0376744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCC GCCTAATGTG TACACAGCAC CCAGACGCCA CGGTTAAGGT AACGGCGAGC GAGGAGGTAG ACGAGGCGAT CGTGGCGTAT ACAGCATACG GATGCGACGA GGTCATGATC GACTACGAGG GCAAGACCAC GCCCTACGCC CAGCCCAAGG ACATCACGGT GAAGGCGCAC GACTCCGGCC TCCCACTGGG CGAGAAGTTC TACCTCACGC CCAGGATGCC CAACCCCAGG CTGGAGGAGT TCGAGAGGTC TATGCTGACG CTGGAGGCTG CCATACTCGC CAACTATTTC TCGGCCAAGC TCACTGGTAG GCAGGCCATC AGGTGGGTGG TTTTGCCGAT GGTGGCGGAC GTGGAGACCC TCGGCCTCGT CTACAGGATG CTTATCCACA AGACCGAGGC CTACACCAGG GAGACCGGCG TCAAGCTGGA GCCGCCCGAG CTGATACCGC TTATAGAGGA CGCTGTGGCC CAGCTCAAGG CCGACGAGCT CATCGGAGGC CTGCTCAAGC AGATGTCCCA ACCGCCCCGC TACCTGAGGC TCTTCCTGGG GAAGTCCGAC TCCGCCGTTA AACACGGCCA CATCGCCTCC GCCCTAGCCA TAGTCGCAAC GCTTTCGAAG GTGAAGGCGG TGGAGAGGGA GCTGGGCGTC AAGATATACC CGATCCTCGG AATGGGCTCG CCCCCGTTTA GAGGCGCCAT AAACAACCCC ACCCTCTCCC ACCTGGAGGT GATCCAGTAC GCGGGCTACT ACACTGCCAC CATACAGTCC GCGGTGAGGT ACGACACGTC GTACGACGAA TACACACGGG TGAGGGAGTC CATACTAAAC GCCTGTTGCC TACCCAGCAG AGATATAGAT ACGCCCGAGG TGGAGGGGCT GATAACGAAG GCCTCCTCCA CCTACAGATC CCTCATTGTG AAATACGCAG AGAGGGTGGT GCAGGTGGCG AGGCTCGTCC CCGGCACAAG AGACAGGGTG AGCTGGGCCG TCTACGGCAG AAACATCACA GCCGAGGACA GGGTGGTGAA CATGCCAAGG GCCATCGTCT ACACCTCCAC CTGGTACGCC ATGGGGCTCC CGCCCATATT CCTCGACGCT CCCACCGTCG TGGAGCTGGC CAAGTCCGAC CAACTCGACG TGGTCCTCAG GGCGTTGCCC ACCCTAAAGA GGGAGTGGGA GTACGACGCC CAGTTCTTCG ACCCCCAGAC GGCGGCTAAG TACACCTCCG AGGAGCTTGT CAAGACCGTC CAGGAGGCCA TGGACTACCT AGGCATAAAC GCCAGAGCCA GCGGCACCTA CCTATCCCTC CTCAGGATGA ACCGCAACGA GTCTAACATA CTCGCCATGG GCAAATTTAG AAAGTTCCTG GGGTAG
|
Protein sequence | MIPRLMCTQH PDATVKVTAS EEVDEAIVAY TAYGCDEVMI DYEGKTTPYA QPKDITVKAH DSGLPLGEKF YLTPRMPNPR LEEFERSMLT LEAAILANYF SAKLTGRQAI RWVVLPMVAD VETLGLVYRM LIHKTEAYTR ETGVKLEPPE LIPLIEDAVA QLKADELIGG LLKQMSQPPR YLRLFLGKSD SAVKHGHIAS ALAIVATLSK VKAVERELGV KIYPILGMGS PPFRGAINNP TLSHLEVIQY AGYYTATIQS AVRYDTSYDE YTRVRESILN ACCLPSRDID TPEVEGLITK ASSTYRSLIV KYAERVVQVA RLVPGTRDRV SWAVYGRNIT AEDRVVNMPR AIVYTSTWYA MGLPPIFLDA PTVVELAKSD QLDVVLRALP TLKREWEYDA QFFDPQTAAK YTSEELVKTV QEAMDYLGIN ARASGTYLSL LRMNRNESNI LAMGKFRKFL G
|
| |