Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1007 |
Symbol | |
ID | 6164914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 899278 |
End bp | 900735 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641668160 |
Product | extracellular solute-binding protein |
Protein accession | YP_001794385 |
Protein GI | 171185466 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.256846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000168852 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGTGGCGG TGGCCATAAT AGCCGCCCTA TTCGCGGCGC AGCAACAACA GGCGCCTGCG GCGTCTCCCC CTCCTCCCAC GGCCTCGACG CCTGCGACCG GCACGGTGAC CTCCACGCCG ACGACCACGG CGGCGTCTCC CACGCCTACG GCGACTTCTC CGGAGGTCTG CACCACGTTG GTGGTCATAA CTAGGCACCC CACCGACATC CTAGACGCGG CGAGGGCCCT CTTTCTACAG AGCGACGTGG CGAAGAGATA CGGCGTTAGA GACGTGGTGT TTAAGCCGGT CCCCGCCGCG CAGTGGAGGG CCTTGATCCA GGCAGGCCAG GCGGACGTGG CGTGGGGAGG GGGGCCGACG CTCTTCGACT CGCTGTATAA AGACGGCCTG CTCCTGCCGC TGGAGGGGGA CGAGGTGAAG AGCGCGCTGG CCCAGATACC CAAGACCATC GCGGGGATGC CCATGATGCG CGTTGGGCCA GACGGCAAGG TCTACTGGGT GGCCTGGGCG GTGTCCAGCT TCGGCATCAC CATAAACACG CAGGTTCTCA AGGACCTCGG CCTCCCCGAG CCGAGGAGCT GGGCAGACCT CGCCTCGTTT GAGTACGGGA AGGCCGTGCT GAGGGGTACC CCGGCCGCCG GCCTTTCGTC GCTGACTAAG TCCACCAGCA ACACGAGGAT CGCCGAGATC ATCCTCCAGG CCTACGGCTG GGACGAGGGG TGGAGGGTGA TCACTCTCAC GGCAGCCAAC GGCAGGATAT ACGGCGGTAG CGAGGCCGTA AGAGACGCCG TGATCGCCGG CGAGATAGGC GCCGGGTGGA CCATCGACTT CTTCGGCTAC ACGGCCCAGC TCCAGAACCC GGCGACGAGG TACGTCGTGC CCAACGACAC CTCCGTCAAC GGGGACCCCA TCGCCGTGGT GAAGGGCACA AGGTGCCGCC AGGCGGCCGA GGCCTTCGTG GCCTGGGTCA TCACCCAGGG TCAGGTGGTG GTGTTTGACC CCAAGGTGAA CAGGATGCCG GTCAACCCCA GCGCCTTCGA CACGCCCGAG GGCAAGAAGA GGGCCGACCT AAAGGCCGTC TACGACTACA TGCTCCACCT CAGGACTATA AACTTCAACG ACACCCTGGC CCTCGCCACC GAGAACGTGG TGATGTACTA CTTCGACGCG GCGGTGACGG ACAACGCCCA GCTCCTCCAG GACGCCTGGG CCAAGCTGGT CAAGGCCTAC CTCGGCGGGA GGATAGACAG AGCCAAGGCC GTGGAGCTGG CCCGGCGCCT GGGCGAGCCC GTCACCTTCA AGGACCCCGA CACCGGCCAG ATGGTCAAGC TGACGCTGGA GTACGCCATG AAGATCAACG ACAAGCTCAA GGACCCGGCC TACCGCGACA AGATCTACGC CGCGTGGAGG GAGGCCGCCA GGGAGAAGTA CAGGGAGGTG GCATCCCAAA TCCCCTAG
|
Protein sequence | MVAVAIIAAL FAAQQQQAPA ASPPPPTAST PATGTVTSTP TTTAASPTPT ATSPEVCTTL VVITRHPTDI LDAARALFLQ SDVAKRYGVR DVVFKPVPAA QWRALIQAGQ ADVAWGGGPT LFDSLYKDGL LLPLEGDEVK SALAQIPKTI AGMPMMRVGP DGKVYWVAWA VSSFGITINT QVLKDLGLPE PRSWADLASF EYGKAVLRGT PAAGLSSLTK STSNTRIAEI ILQAYGWDEG WRVITLTAAN GRIYGGSEAV RDAVIAGEIG AGWTIDFFGY TAQLQNPATR YVVPNDTSVN GDPIAVVKGT RCRQAAEAFV AWVITQGQVV VFDPKVNRMP VNPSAFDTPE GKKRADLKAV YDYMLHLRTI NFNDTLALAT ENVVMYYFDA AVTDNAQLLQ DAWAKLVKAY LGGRIDRAKA VELARRLGEP VTFKDPDTGQ MVKLTLEYAM KINDKLKDPA YRDKIYAAWR EAAREKYREV ASQIP
|
| |