Gene Tneu_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1089 
Symbol 
ID6165555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp972979 
End bp973995 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID641668241 
ProductABC transporter periplasmic-binding protein 
Protein accessionYP_001794466 
Protein GI171185547 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.167186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0159055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATTA GGGGTTTATT GGCGGCTGTG GTGGTCGTCG CCGCCGTGTT GGCGGCCCTC 
ATGGCGTGGC AGTCCCTCCA GCAGCAGATG GAGAGGAAGC TCGTGATCGT GGGGCCGGCC
GGCATCGGCG ACTTGGGCAG GGAGCTGGCC AGGAGGTTCA GCGAGAGGTA TGGGGTAAAC
GCCACCTTTG TGGCGCTTGG GGGGGCGGTG GAGATGGTGA ACGAGCTGGT GAGAAACAGG
GACAACCCGC CGTGGGACGT GGCCATCGGG GTGCCCGAGT TCTACTACAC CGTTCTGGTG
GAGAAGGGCG TGCTCTACTG CCCCGGCTTC AAGGTGGAGG GGGTGCCGGC CGAGGAGTAC
TGGGATCCCC ACGGCTGCGT CTACCCGCTT GACAAGTCCT ACATCGGGAT CGTCTACAAC
GAGTCGGCCC TCGCCGCGCG GGGCCTCAAG CCGCCTCAGA CCCTCGACGA TCTTCTGAAG
CCGGAGTACA GGGGGCTTAT CACATATCCC AACCCGGTCC AGTCGGGCAC CGGCCTCGCC
GTGCTCTCCT GGGTCATGTC TGTGAAGGGG GAGGAGGAGG GCTGGCGCTA CCTCAAGCAG
CTGTCCAGCC AGATCTCCAA GATCGGCTAC CCGTCCGGCT TCACGCCGTT GAGAAGCGCG
TTGAAGAGGG GGGATGTTTT GATCGCCCTC TCGTGGTACA GCCACGCCAT CGACCCCGGG
ACCCCCAGCA TGAAGGCCGC GACGTACAGC GCCTTCCTAT ACAAGGAGGG GGTGGCCGTG
TTGAAAAACG CCAGGAACAG GGACCTGGCC CTGGAGTTCG TCAAATTCGC GCTGAGTAAG
GAGGGGCAGG ACCTGGTCGA CCCCTACAAC TACATGCTCC CGGTTAGGCC AGACGCCGTG
GTTAAAAACA ACCTGGGCCT CCCGAGGCCG CAGTCCGTCG TCGTCTACAA CCCGGCGCTG
GGGTCCAAAG CCGACGAGTG GAGGCTGAGG TGGCAGAGGG AGATCGCGTC TGGGTGA
 
Protein sequence
MSIRGLLAAV VVVAAVLAAL MAWQSLQQQM ERKLVIVGPA GIGDLGRELA RRFSERYGVN 
ATFVALGGAV EMVNELVRNR DNPPWDVAIG VPEFYYTVLV EKGVLYCPGF KVEGVPAEEY
WDPHGCVYPL DKSYIGIVYN ESALAARGLK PPQTLDDLLK PEYRGLITYP NPVQSGTGLA
VLSWVMSVKG EEEGWRYLKQ LSSQISKIGY PSGFTPLRSA LKRGDVLIAL SWYSHAIDPG
TPSMKAATYS AFLYKEGVAV LKNARNRDLA LEFVKFALSK EGQDLVDPYN YMLPVRPDAV
VKNNLGLPRP QSVVVYNPAL GSKADEWRLR WQREIASG