Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0099 |
Symbol | |
ID | 6164334 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 84911 |
End bp | 86218 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667266 |
Product | hypothetical protein |
Protein accession | YP_001793503 |
Protein GI | 171184584 |
COG category | [R] General function prediction only |
COG ID | [COG4882] Predicted aminopeptidase, Iap family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.428389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00133655 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGCCGAGG TGTACGCAAA ATGCACCTCC TACAGGGATC TCGTGGCGGG GTCCCCCGCC GAGAGGGAGT TTCTACAGTG GCTGATGGCC TTCCTCGACT CGCCGAGCCT CTGGTTCCAC CTATCGCCTG TGGAGGTCCT ACACTGGGAG GACCTCGGCA CGAGGCTCGT GGTGGGGGGC GAGGAGGAGG TGGGCCTGGC GATGCCCTAC TCCCGCCCAA CCTCGGTGGA GGGGAGGCTG GCGCCTTTAG ACGGCGACGT GGAGGGGAAC ATAGCCGTTG CCGACCTCCC GGCGGATTTA GACGACGCAA AGTACATAGT TCTAGAGGCC GCGCGCCGGG GGGCCTCGGC CGTCGTCTTC ACGGGGGAGC GGCCGCGTCG GATAGTGGTC ACCGGCGAGC CCGGGTTCAA GAGCGACGCC GCCCCGCCGC CTATACCGGC GGCGAGCTTC GGAGACCTCA AACGGCACTT GGGCAGGCGG GCGAGGCTGG AGGTGGCGAC CAGGTCTAGG GTTACGTACA GCTACAGCTT GATCGCCTTT AACAGCTTCG AGAACACCCC CATGATATCG GCCCACTGGG ACCACTGGCT TGTAGGCGCT ACGGACAACT GCGCGGGGGT TGAGGCAGCT GTTTTAGCGT TCGTCGAGCT AGCCGCGGGG GAGTTCCCCA TCGCCCTCGG GCTTTTCACG GCGGAGGAGG GGGTGGCGCC GCATGTGCCG TCCTTCTACT GGGCCTGGGG GTCTTTCAAC TACCTGAGGC GCTGGAGGCC TACCCTGTTG GTGAACGTGG ACGTGGTGGG GGTTGGGACC CCCCGCATCT ACGCCATGCC CTACCTCCAC GAGGAGCTTA AGTCGCTTGG CCCCGTGGAG TGGCCCGTGG CGTATTTCGA CAGCGTACAC TACGAGAGGT GGGGGCTCCC CGCCTTGACC ATCTCGTCGC TGAGAGACGC CTGGGATAGA TACCACAGCC CCGCCGACAG CCACGTCGAT GTTGAAAACG TCCTCTACGT GGCCGAGTTG GCGAAGCGTG CCGTGAAGGT GAAGCCCAGG GCTCCCCAGG TGGGGCTTGA GGAATACGGC TTGGCGAACC CCGAGGCGGA TCCCTACGCC GCGTGGTCCG CCGTGTATAA CTACCTCGTC CTCTTCAGAG ACCTCAGACA TTCGGAGATC GTCTACGCCA ACGTGCCCAG GTTCTTGAGG GGCGAGGCGG GGAGCTACAG CAGGATAGAT CTGCTGGGGG GCCCCACTCT CTGTGTTGGG GACTGCGCCG GCGCCTTCGA GACCTACCGG GCGCTACTTG GCCTCTAG
|
Protein sequence | MAEVYAKCTS YRDLVAGSPA EREFLQWLMA FLDSPSLWFH LSPVEVLHWE DLGTRLVVGG EEEVGLAMPY SRPTSVEGRL APLDGDVEGN IAVADLPADL DDAKYIVLEA ARRGASAVVF TGERPRRIVV TGEPGFKSDA APPPIPAASF GDLKRHLGRR ARLEVATRSR VTYSYSLIAF NSFENTPMIS AHWDHWLVGA TDNCAGVEAA VLAFVELAAG EFPIALGLFT AEEGVAPHVP SFYWAWGSFN YLRRWRPTLL VNVDVVGVGT PRIYAMPYLH EELKSLGPVE WPVAYFDSVH YERWGLPALT ISSLRDAWDR YHSPADSHVD VENVLYVAEL AKRAVKVKPR APQVGLEEYG LANPEADPYA AWSAVYNYLV LFRDLRHSEI VYANVPRFLR GEAGSYSRID LLGGPTLCVG DCAGAFETYR ALLGL
|
| |