Gene Tneu_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0523 
Symbol 
ID6165816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp474273 
End bp475343 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID641667676 
Productglycosyl transferase family protein 
Protein accessionYP_001793912 
Protein GI171184993 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.343071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTGTGC TTGCGGCTGC TCTGGCGGCT GTACACTTCG GCGCGCCGGC CCTCTACCTG 
CTATACCTAC GCGCCGCCCC GAAGAAGCCC CTCCAGACGG CGGCTATATA CCCCAAGGTG
GCGGTGGTCG TGCCCACCTA CAACGAGGCG CGGAACATAG AGGCTAAGCT CGAGGACGTA
TACAGCCAGA GCTACCCCAG AGATAGGATG TCTATATACG TCGTCGACTC GGCCTCCACC
GACGGCACAG CCGAGGCGGC GGAGCGGTGG GCCGCCGGCA GGAGAGACGT CAGAGTCGTG
GTGCTGAGGG AGCCCAAGAG GCGGGGGAAG GCCCACGCCT TAAACACGGC CCTTGCCCAC
CTCGCCGACG AGGAGGTGGT GGTGATCACC GACGCCGACT CCCGCTGGCT AGACCGAGAC
ACGCTGAGGA GGGCCGTGGC CTACCTCGCC GCCGCCGACG CGGTCTCCTG CCTAAAAAGG
CCGGCGGGGG GAGGCCCCAC GGAGGAGGCC TACCGCACGT GGTACAACAG GCTGAGACTC
GCCGAGAGCC TAGTCCACTC CACCCCGGTC TTCCACGGCG AACTCGCCGC CTTTAGACGG
GAGGCCATCG CCGGGGGGTT CCCGGAAGAC GTCGGCGCAG ACGACAGCTA CGCCGCCATT
AGGATAGCCG CAGCGGGGGG ACGCGCCGTC ACGCCGCCGG ACGTGTGGTG CATAGAGGCG
GTGCCCCAGA GGGGCTACCC CACGTGGCGC CTAAGGCGGG CGCAACACCT GATACAAGCC
TTCGCGCGGG CGCTTCCAAA CGTCGCCAAG GCCCCGCCGC CCTACAGAGT AATCCTCGCC
GCCGAGGCCT ACCTACACCT GTTTAACCCA TGGCTCCTCC CAGCCGCCGC CGCCCTAGCC
GCCGCCTCCG GACCCCCCGG CCTGGCCCTC CTCGCCGCAG GCGCCGCCGC GTTGCTGTAC
AAGCCCTACA GAGCCTGGGC GGCGGGCCAG ATATACCTAA TGGCAGCCGC CCTGAGAAAC
ATATGGAACA AGGAACTCAT ATGGCAAAAA CAAGAAAAGC CGCCGCCGTA A
 
Protein sequence
MLVLAAALAA VHFGAPALYL LYLRAAPKKP LQTAAIYPKV AVVVPTYNEA RNIEAKLEDV 
YSQSYPRDRM SIYVVDSAST DGTAEAAERW AAGRRDVRVV VLREPKRRGK AHALNTALAH
LADEEVVVIT DADSRWLDRD TLRRAVAYLA AADAVSCLKR PAGGGPTEEA YRTWYNRLRL
AESLVHSTPV FHGELAAFRR EAIAGGFPED VGADDSYAAI RIAAAGGRAV TPPDVWCIEA
VPQRGYPTWR LRRAQHLIQA FARALPNVAK APPPYRVILA AEAYLHLFNP WLLPAAAALA
AASGPPGLAL LAAGAAALLY KPYRAWAAGQ IYLMAAALRN IWNKELIWQK QEKPPP