Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1950 |
Symbol | |
ID | 6164474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1716536 |
End bp | 1718146 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641669113 |
Product | hypothetical protein |
Protein accession | YP_001795311 |
Protein GI | 171186392 |
COG category | [S] Function unknown |
COG ID | [COG3356] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCCT TCGAAAGGGG CTACTACCTC CTCTTTGGGA GATCGGGCTT GAGAGTTGCC CTCTACGCCA CCTTGTTGCT GGCGGCGTTG GCGGTTGTGG AGGGGCTTGT CCACCACCTG GCCCCCCTCC TCTACGCCTT GTATGCCGCG GGTCTCTTCT CCATCCTCCT GGCGGTGGAT CGGGCCGTGG TTAACCCCAG GAGGTCTTAC TACGTCGCCG CCGTGTCGAC CGCAGCCACA GCCGCGCTAG ATGTGGCCTT TGGGAAGCCG CCTCTCGCCT TCGCCCTCGT GGGGGCCATA GTCAGCGCCT TGGTGATCCA GTCGCTACGG TGCGAGCTAC CGGCCTTCGT CCTGCCGCTT GCCTTCGCGG CTACTGTGTA CTACGCGATG GGGCGCCCCG TGCTTGCCGT CTTGGCCTTA TCCTATGCCT TAGTCATATA CGGCTTGAAG CCGGTTATTC GGAGGTGGGC GGGGGGGATA GACGCCGTCT GTATGTTTTC AAGCTTCATA TATTCGGTTT TCGCCGGAGA CGACGGGATC GAGGACGTCT TTAGGGAGCT GGGGAGGGTG GAGAGGGTGC CTCTCCACGT CTATCTCGTG GGCGGCCGCC ACGTGGTGGT TGTGTCGGAC TTCCACCCAG GCCCCTTTAG GCATATCGGC GGCGGATCGC TGGTCGACGT CCTTAACAGG GAGGTGGAGG GCACCGGCTT CCGGTTTACG TTTCTACACG GCGTTGGTAG CCACGAGCGG GACCCCGTCA CTGGGGAGGG CGTGAGGAAG ATAGCCGCGG CTGTGAAATC GGCCGTTTTG GAGATGACGG ACGGACGGCC TCCCCAGGGG GTGCGGCCCG CGGAGGTGGT CTCGGGCGAC GTGAAGATCG TCGGCTTTAG CCTGGGGACC GCCCCCCACC TCGCCGTGGT GAGCAGGTTG AGGTCGGCCT CGGACGACAT CCCTCTGTGG GTCGCCAAGA GGGTTAACCC CGGGAGCTAT CTGCTCGTGG ACGCTCAGAA CAAGTTCGAC GGCGTTGTCC AGTGGCTTGA GGAAGACGTG AAGGCGCTTT CCGAGGGCCT TAGGGTGTTG CAGGGATCTC CACAGTGTCG TAGCTTCTCG GTCGGGGTTG GGAAGGTGGG GGGAGAGGCT CTGGATCCTC TCGGCCACGA GATCGGCCCC GGCGGCGTTT CGGCGATTGT GAACGAATGC GATGGGGAGA GGGCTCTGCT GGTGGTGTTC GACGGGAACA ACCTAGACTG GGGGCTCTAC GGGAAGATCG TGGATAGATA TCGGAGGCGG GGGTACGCCG TGGTTGAGGT GGCCACCACC GACACACATA GGGCCACTGG GGTTGGGTTC GGCAGGGGGT ATAGGATCGT CGGCGAGCAC ATCGACCACG GGAAGATCTT GGAGGCCGTG GACCTAGCTG TGGCGGAGGC CGAGCGCCTC CTCGGCCCAC ACGCCGTCTC CTACAGAAGG GTGGAGGTGG AGGCTGAGGT GATGGGGGAG GAGGGGTTTA GGAAAATACA GCGCGCCGTG AGGATCTACA AGAGGGTTGG GGGGCTGGTG CTGGGCGCGG TCTTTCTAGC GCCTACGGCG TTGGTCACGC TGTTTGCATA A
|
Protein sequence | MRAFERGYYL LFGRSGLRVA LYATLLLAAL AVVEGLVHHL APLLYALYAA GLFSILLAVD RAVVNPRRSY YVAAVSTAAT AALDVAFGKP PLAFALVGAI VSALVIQSLR CELPAFVLPL AFAATVYYAM GRPVLAVLAL SYALVIYGLK PVIRRWAGGI DAVCMFSSFI YSVFAGDDGI EDVFRELGRV ERVPLHVYLV GGRHVVVVSD FHPGPFRHIG GGSLVDVLNR EVEGTGFRFT FLHGVGSHER DPVTGEGVRK IAAAVKSAVL EMTDGRPPQG VRPAEVVSGD VKIVGFSLGT APHLAVVSRL RSASDDIPLW VAKRVNPGSY LLVDAQNKFD GVVQWLEEDV KALSEGLRVL QGSPQCRSFS VGVGKVGGEA LDPLGHEIGP GGVSAIVNEC DGERALLVVF DGNNLDWGLY GKIVDRYRRR GYAVVEVATT DTHRATGVGF GRGYRIVGEH IDHGKILEAV DLAVAEAERL LGPHAVSYRR VEVEAEVMGE EGFRKIQRAV RIYKRVGGLV LGAVFLAPTA LVTLFA
|
| |