Gene Tneu_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0516 
Symbol 
ID6165896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp467680 
End bp468945 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID641667669 
Productglycosyl transferase group 1 
Protein accessionYP_001793905 
Protein GI171184986 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.628757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.125688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGGG GCGCCCCCCT ATTAAAAGAT ATATGCCAGC CGGTGAGCTA CGCCATGGAG 
CGCTACGCCG TCGTCGCCCA CCACTACTGG GGCACCCCAG GAGGGGGCCA GCTCGTCTGC
GCCGCCGCCG CCAAGGCGCT GGAGGAGGCG GGCTACAGGC CGGCCCTAGC CGGCACCTTC
AAATTCGACC CCCGGAGATA CGTCGAGTGG TACGGCATAG ACATATCTAG GTACCCCGTC
GAAACCCTCC CCATAGCCCC CAGGGCCTTC GGCCTCTGGA GCAGGCTCTA CGTCTGGCTC
CCCGCCAAGA AAGCCGCCGA GAGGTACAAG CCGGAGCTCC TCTTCATCGA CGAAGTCGCC
TACAAGCCCC TCGCCAGGGG GAGGAGGTTC AGGCTAGTGG AGTACATCCA CTTCCCCTTC
GAGGTTGTGG TGGACCCAAG GTACAGGGGG ACCGGCCTCG CCTACGGGGA GGACCCCTAC
ATCATGGAGC GGTACGGCAG ATTCCCCATG AGCCTCTACT GGAGGGCCTT CGTCTGGGGG
CTCAAGAGAT ACGCCAGAGA AAACCCCTTC CACTACGCCG ACGCCGTGTT GGTCAACTCC
CGCTGGACCG CCCAGGTGGC CAAGATGGTG TACGGCCAGG AGCCCCAGGT CCTCAACCCG
CCCCTGCCCC CAAACGTAGA GGTGGTCGAG AAGCCGAGGC CCTTCGAGGA GAGGGAGCCC
ACCGTCGTGA TGCTAGGCCG CTTCTCCCAG GAGAAGCGCT ACCACTGGGT CGTCACAGAG
GTCGCGCCCA GGCTGTTGAA GGAGGTCCCC GGCGCCAAGA TCATAATCTT CGGCGGCGCC
GCCACCCCCA CCCTACAGGC CTACAGAGAC AGGGTGAGGA AGATGGCGGA GGACGCCGGC
CTAAAGACGG CAGAGACGCT AGACGCCCAC GCCCACATCT ACCTAATAGC CAACGCCCCC
CGCCGCGTCA TAAACGACGC CATGGACAAG GCCAGGGCCT TCCTCCACGC CACCATAAAC
GAACACTGGG GCATAGCGGT GGCGGAGGCC ATGGCCCGAG GCCTCACGCC GGCGGTCCAC
AGGTCCGGAG GCGCCTGGAC AGACCTCGTC ATGGAGGGCA GATACGGCCT AGGCTACACA
ACCGCCGAAG AGGCCGTAGA GGCGCTGGCG AAGCTCCTCA CCCAGAAGGC CAGCTACGCC
CCCCAGGAGA GGGCCCGGGA GCTGGTCTTC CAGAACTTCG CCAGCGCCCT CCGGAGGTAC
ATATGA
 
Protein sequence
MRRGAPLLKD ICQPVSYAME RYAVVAHHYW GTPGGGQLVC AAAAKALEEA GYRPALAGTF 
KFDPRRYVEW YGIDISRYPV ETLPIAPRAF GLWSRLYVWL PAKKAAERYK PELLFIDEVA
YKPLARGRRF RLVEYIHFPF EVVVDPRYRG TGLAYGEDPY IMERYGRFPM SLYWRAFVWG
LKRYARENPF HYADAVLVNS RWTAQVAKMV YGQEPQVLNP PLPPNVEVVE KPRPFEEREP
TVVMLGRFSQ EKRYHWVVTE VAPRLLKEVP GAKIIIFGGA ATPTLQAYRD RVRKMAEDAG
LKTAETLDAH AHIYLIANAP RRVINDAMDK ARAFLHATIN EHWGIAVAEA MARGLTPAVH
RSGGAWTDLV MEGRYGLGYT TAEEAVEALA KLLTQKASYA PQERARELVF QNFASALRRY
I