Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1418 |
Symbol | |
ID | 6164389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1261177 |
End bp | 1263768 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641668574 |
Product | cell wall anchor domain-containing protein |
Protein accession | YP_001794789 |
Protein GI | 171185870 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.939763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACACA AAATGTTCCT ATTGGTGCTC ATGCTAGGCG TGTTCCTAGC GGCCCAGTAC ACCGCGCCGC ACACGAACCC GGGACCTGCC GCGGATAGAA TAGTCGGCAA GTCTGTCCCC ATAGCACAGG CGGGGGCCGC GGTTAAGGCT GGAGACATAG ACGTCTACAT ATTCGGCATG AGGGCAGCTC AAGCGGCTCC CCTCAAGGGA GACCCCTCTG TGGTTCTCTA CACGGCTGCG GCTGGCTTCA ACGACATTGT TCTGAACCCA GCGCCCTCGA ACCCGCCTTG TGCCAACCCC TTCTCAAGCC GGGCGATTAG ATACGCGATG CAGTTTGTAG TAGACAGAGA CTACGTGGCA AACGAAATTT TCAAAGGCTT CGCAGTCCCC ATGTACATAT GGCTCTCCCA ATACGACCCC ACCTACTCGA TAGTGGCGGA CATAATCTCG CAGCTCGGCA TTAGATACGA CCTAGACTAC GCAAAGGCGA TAGTGGATAG CGAGATGCCT AAGCTCGGCG CCACCAAGGG GCAAGACGGC AAGTGGTACT GCAAGGGCAA GCCGGTCACG CTGATCGGCC TAATCCGTGT AGAAGACGAG CGTAAGGACA TCGGCGACGC CTTTGCCTCC GCGCTTGAGC AGTTAGGCTT TACTGTAGAC AGGAAATATG TGACATTCGA CGTAGCTATT CAGACGGTCT ACGGCACAGA CCCAGCCCAG TTCCAGTGGC ATTTCTACAC AGAGGGTTGG GGCAAGGGCG GTATTGACAG ATGGGATACG AGCTCCATAT CTCAGTACTG CGCCTCGTGG TTCGGCTATA TGCCGGGCTG GGGTACCACG GGTTGGTATA ACTATGCCAA TGCGACAATC GACGAGATAA CCGACAAGCT GTACAAAGGA AAATACACCT CCTTCAAAGA ATATGTAGAG CTATATAGAA AGGCCACCTT GATGTGTATA CAGGAGTCTG TGAGGGTCTT CGTCAATACA AACCTCAACG CCTTTGTGGC ATCTCCGCAA CTTAAGGGAG TCACAGTTGA CCTAGGCGCC GGTCTAAGGG CTTCTGTCTA CAACGCCCGC AACTGGTACG TCCCAGGTAG AGACGTAGTA AACGTGGGGC ACCTATGGAT ATGGACCGCC TCCAGCGCAT GGAACCCTGT GCCGCAGGGC GGTTTTACAG ATGTCTACTC CGTAGACTGG TTCAGGATGA TGTACGACCC CGCCGTGTGG AACCACCCAT TCACCGGCGA GCCGATGCCC TTTAGAGCTA CGTATGTCGT AGAAACCAAA GGCCCGGCCG GCTACTTCGA CGTCCCTGCT GACGCGTATA GGTGGGACGC CAAGCAGAAG GCTTGGGTGA GCGCCGGGGG GGCCAAGGCT AAGTCAAAAG TCGTGTTTAA TTACGCCAAG TACATAGGCG CTAAGTGGCA CGATGGACAG CCGATTAAGC TTGCAGACGT ATTGTTCATC TACGCATTCA TGTGGGATAT TTCAAACGAC CCGCAGAAGG TCGCCCGCGA GTCCGGCGTT GCCTCGTATG TAAATGCCAC GATGAGCTTA ATCAAGGGTA TACGGGTGGT CAACGATACA GCTATAGAGG TGTATATAGA CTATTGGCAT TTCGACCCCA ACTACATAGC GTCTATGGCT GTATATACGC CGGATATGCC GTGGGAGATC TACTACGCCA CGGATCAGCT CGTCTATGTA AAACAGACGT ACGCCGCCTC CAGGGCGTCT GCGACTAAGT ACAACGTCCC CTGGCTTTCC CTCATATTGA AAGACCACGC CAAGGCGGCA GCCGACGTGC TACAAGACGC CCTGTCTAAG GGCGTGTACC CAGAGAGCTG GTTTAAGATA GGCAACAAGA CCTTGTTAAC AAAGGACGAG GCTTTGGCGA GGTACAAAGC CGCTATCGAC TGGTTTAACA AATACGGCCA CTTGATAATA TCGCAGGGGC CCTTCTATCT ATATGCGATA GACACAGCTA AGCAGTACAT AGAGCTACGC GCCTACAGAG ATCCCACCTA CCCCTACAAA CCAGGGGCCT TCTACTTCGG CGTAGCTACA CCGGTTTCCA TAAAGGCCAT AAACGTGCCG GCCGTCACGG TGGGCCAGCC TGCGTCTGTA TCGGTCTCTC TGGAGGTGCC CACCGGCGCT GGAAAGATAT ACTACAAGTG GGGCGTGGTG GACCCCACCA CTGGAAGATT CATTTACATA TCTGAGGAGG CCTCCGCGGC GGGCGCCCCG CTAACCATAA ACATGCCCGC GGACGTCTCG TCTAAGCTTA CGGCCAACAA GCCGTACAAG TTCTGGGTTC TTACATACGC CGAAAATGTG CCTATAGTAT CCGAGGCCAC GCAGGTCTTT GTTCCAAAAG CTGCCGCCCC CGCAACTACC ACGCCCGCCG CCACCACGCC TCCGGCGACC GCCACGCCGA CTGCGACACC GCCTCAGCCG ACTGTTGCGA CGACTACGGC GCCTGCGGCC GGGTCCACCA CGGCCCTAGC GGCAGCTGTG GTGGGTATAT TGGTGATCAT AGCAGCGCTG GCCCTCGCCT TGAGGAAGAA AAGCGGCGGA GCTCAGCAAT AG
|
Protein sequence | MRHKMFLLVL MLGVFLAAQY TAPHTNPGPA ADRIVGKSVP IAQAGAAVKA GDIDVYIFGM RAAQAAPLKG DPSVVLYTAA AGFNDIVLNP APSNPPCANP FSSRAIRYAM QFVVDRDYVA NEIFKGFAVP MYIWLSQYDP TYSIVADIIS QLGIRYDLDY AKAIVDSEMP KLGATKGQDG KWYCKGKPVT LIGLIRVEDE RKDIGDAFAS ALEQLGFTVD RKYVTFDVAI QTVYGTDPAQ FQWHFYTEGW GKGGIDRWDT SSISQYCASW FGYMPGWGTT GWYNYANATI DEITDKLYKG KYTSFKEYVE LYRKATLMCI QESVRVFVNT NLNAFVASPQ LKGVTVDLGA GLRASVYNAR NWYVPGRDVV NVGHLWIWTA SSAWNPVPQG GFTDVYSVDW FRMMYDPAVW NHPFTGEPMP FRATYVVETK GPAGYFDVPA DAYRWDAKQK AWVSAGGAKA KSKVVFNYAK YIGAKWHDGQ PIKLADVLFI YAFMWDISND PQKVARESGV ASYVNATMSL IKGIRVVNDT AIEVYIDYWH FDPNYIASMA VYTPDMPWEI YYATDQLVYV KQTYAASRAS ATKYNVPWLS LILKDHAKAA ADVLQDALSK GVYPESWFKI GNKTLLTKDE ALARYKAAID WFNKYGHLII SQGPFYLYAI DTAKQYIELR AYRDPTYPYK PGAFYFGVAT PVSIKAINVP AVTVGQPASV SVSLEVPTGA GKIYYKWGVV DPTTGRFIYI SEEASAAGAP LTINMPADVS SKLTANKPYK FWVLTYAENV PIVSEATQVF VPKAAAPATT TPAATTPPAT ATPTATPPQP TVATTTAPAA GSTTALAAAV VGILVIIAAL ALALRKKSGG AQQ
|
| |