Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1050 |
Symbol | |
ID | 6165510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 935208 |
End bp | 936263 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641668202 |
Product | amidohydrolase |
Protein accession | YP_001794427 |
Protein GI | 171185508 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.794357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCC GGGCCCGCTA CGTCCTGGCC GGGGGGCTTG AGCTGGTGGC AGACGGCGTT GTGGAGGTGG ACGACGGCGG GGTGGTGGTG GGGGTGGGGA GGTACACGGG GGGCGTGGCG GCGGACTTGG GCAACGTGGT GCTCATGCCC CAGCTGGTGA ACGGCCACGT GCATGTGCTC GACGCAGCCA TCTTGGACCG CGACGACATG TACATAGACG ACCTGGTGGG GTGGCCCCAC GGCGTTAAGT ACCACGTCGT GAGGAGGCTG GTGGAGAGGG GGAGGCACAC AGCCCTGCTG GAGAAGGTGG CGAGGAGGAT GAGGAGATAC GGCGTGGGGT GCGCCCTGGT GTACGCCGAG TACGCGGCGG GGGACGTGGA GGCGGCCCTC CGGCGCTGGG GGATCGAGGC GGTGGTGTTC CAGGAGGCGC ACGGGGGGTT CCCCAGCTAC CCCAACGTCC AGGTGGCCAC CCCCGTCGAC CACCCCCCGG AGTACCTCCG CCAGCTCAGG GCCAGGTACA GGCTGGTGTC TACCCACGTC TCCGAGACCG AGGACTGCCA CGAGGCGGGC GACCTGGAGC TGGCGCTGAA GGTGCTGGAC GCAGACGTCC TCGTCCACCT GGTGCACCTC ACTCCCGAGG AGGTCGGCCA GATCCCCCCC TCCAAGACGG TCGTGGTGAA CCCCAGGGCC AACGCCTACT TCGTGGGGCG GGTGGCGCCG GTGCCCCAGC TACTGCACCT CAAGCCCCTC CTCGGCACAG ACAACGTCTT CATGAACGAG CCCGACCCCT GGGCGGAGAT GAGGTTCCTC CACGCCTACG CCGCCGCCGC GGGCTGGAGG CTGGAGGAGA GGGAGATACT CTCCATGGCC ACCGTCTGGG GCTGGGAGAA GATGAGGTGC GTCCCCCCCA TCGAGCCCGG CCACAGGCTT AGGGCGATCG CCGTGGCGGC GCCGTACGCG GGGGAAAAGG TGCTGAAGTT CCTCGTGAAG AGGGCCGCCC ACACAGACCT CGTCGCCTTC GTGGAGGGCT CCTCTATAGA GCCGCCCCCC TCCTGA
|
Protein sequence | MRLRARYVLA GGLELVADGV VEVDDGGVVV GVGRYTGGVA ADLGNVVLMP QLVNGHVHVL DAAILDRDDM YIDDLVGWPH GVKYHVVRRL VERGRHTALL EKVARRMRRY GVGCALVYAE YAAGDVEAAL RRWGIEAVVF QEAHGGFPSY PNVQVATPVD HPPEYLRQLR ARYRLVSTHV SETEDCHEAG DLELALKVLD ADVLVHLVHL TPEEVGQIPP SKTVVVNPRA NAYFVGRVAP VPQLLHLKPL LGTDNVFMNE PDPWAEMRFL HAYAAAAGWR LEEREILSMA TVWGWEKMRC VPPIEPGHRL RAIAVAAPYA GEKVLKFLVK RAAHTDLVAF VEGSSIEPPP S
|
| |