Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1002 |
Symbol | |
ID | 6164842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 894066 |
End bp | 895220 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641668155 |
Product | hypothetical protein |
Protein accession | YP_001794380 |
Protein GI | 171185461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000137488 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00202904 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGTTCCG CCGAGTGGCG GCGTAGAGAT AGGCGTTTGC TGGAGCTGGC GGCGCGTTGG TATGTAAAAC ACTACGTCGG GGGGAGAAAT ATAATGAAGC TCCTGGGCTT GTCGGAGGAT GAGAGATATG GGGTGGCCTC TGCGTTGCTG GCCGAGTTTG TAAATGCGAT GACGGTAGTG GTGAAGGCTG CGCTTAGGGA CCTCTTGGTT TATAGGGAAT TTGCTGTCGT ATACGGCGAC GAGATCTTCG GCGCCTTAGA CGTGGGGAGG ACCGTGGCTG TTCTGCCTGC GGGGGTGTAC GCCTCGTTGA CGTACCTCCC CTCGCTTCAA GCGCCTGAGT ATGGGATACT GCGGCGCATG GCCTTGAAAG TATCGAAGTG GGGGAGGGAG GCCGCCGTCG ACGAGGAGAT GAGGAAGGAC GCCGAGAGGC TGAGGCGGAT CGCCAAGAGG TTGCCTAAGG CATATGGCAG AGCGGCAGTG GACGTAAGAG AGGGGCCGCC TTGGCTGAGA CAGGGCTGGG CTATCTACAG AGCCGCCAAG GCGCTTGTGG AGGGGGAGGT ATACGTGGGG GAGAGGAGGG GCGTTGGAAA GGCGTTGAAG TTCGTCAACT GGCGCCTATA CGAGATGTAC ATAGCTATGC TCGTGTTGGA GGCCCTACGG CGCTTGGGTT GGAGGACGGT GGGGGTAGAC GCGGAGAAAC GCGCCGTCTT AGTCGAAAGG GACGGTAAAA CGCTGGCGGT GTATCTTAAC AGAGCGTTGC CGCACCACTC CATAATAGAG GAGGTCGCCG GGGACGAGGT GAGGGGGAGG CCGGATTTAA CTGTCGCAAA CGCCGATGTG AAAGCCGTGG TGGAGTGCAA ATACTCAGAC AGGCCGGGCT ACATCGCAAG AGGCCGCTTC CAAGTGATGA CATATATGTG TGAATATAGT GCGAAAATTG GGATATTGGT ATATCCAGCC GCGTCGGAGG AAGAGGCCGA GGATGAAGAA GAGGAGGCGG CAGTTAGATG GGCAAACAGC GGCAAGCCGA TCCGTATGAA GGACGGCCGC GCCATATACC CCCTGAGGAT AGACCCGGCT TATGGAGCTA CCGCGGATGA GGCCAGGGAA AAACACATCG GCGAGGTGAT GAGGTTGCTG GAGAGGTCAT TATAA
|
Protein sequence | MGSAEWRRRD RRLLELAARW YVKHYVGGRN IMKLLGLSED ERYGVASALL AEFVNAMTVV VKAALRDLLV YREFAVVYGD EIFGALDVGR TVAVLPAGVY ASLTYLPSLQ APEYGILRRM ALKVSKWGRE AAVDEEMRKD AERLRRIAKR LPKAYGRAAV DVREGPPWLR QGWAIYRAAK ALVEGEVYVG ERRGVGKALK FVNWRLYEMY IAMLVLEALR RLGWRTVGVD AEKRAVLVER DGKTLAVYLN RALPHHSIIE EVAGDEVRGR PDLTVANADV KAVVECKYSD RPGYIARGRF QVMTYMCEYS AKIGILVYPA ASEEEAEDEE EEAAVRWANS GKPIRMKDGR AIYPLRIDPA YGATADEARE KHIGEVMRLL ERSL
|
| |