Gene Tneu_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0484 
Symbol 
ID6165954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp440974 
End bp442068 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content47% 
IMG OID641667641 
Productglycosyl transferase family protein 
Protein accessionYP_001793877 
Protein GI171184958 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0461685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000461867 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAT CTATGGCCAT ATTTGTTGTA AATTATAATT CAGTGTCAAC TATGGGCCAG 
AGGGTATTCA ATTTTCTTGA TACATTTGTA GATGTCGTGG ACCAGGACGT AGATGTGTGG
CTAGTTGATA ATGGGTCAAC TGACGGATCT TACAGCGTAC TTTCGCAACG CTACGGCGGG
ATGCTGAAGT TCTTGGCCCT CCCTAAGAAT CTCGGCTACG GGGCCGCGTG TAGCCTCGCC
TACAGATATA CAAGATTAAT GGGGCTGGAG TACGACTATT ACGTCTGTAG CAATAACGAT
ATTGAGCTCT TTCCCCAAAA GATCGGCGAA CTGCTCTCCT ATCTCAGAGC GCTGGAGAAG
GCCTATCCCA AGGGGTTCAT TGCGGCGCCG TTGCTATTAA ACGGAAACGA TGGGCTAATA
GATTACGGTG GATACTTCAT CGACGACAGT GGAGGAACTT GGGGCCTGCG CCTGGCGGCT
TTAACTCCAC GGGAAATACG CCGCCTCACT CCGATCAGTT ATGCAGATGG GGCGTTTCAG
ATAGTACATA GAAATGTTGT TGAGACTATA GGTTGGTTTG ACCCCAAGTA TTTCTTGTAT
TACGAAGATG TGGAGTTTTC TCTGAGGGCG TGGAGGGCGA GATTTCCTTC TCTTCTTATA
CCTCTAGTGA TCGGCAGACA CTACCGAAGT GCTAGCACGG GTAGATCAGT CTATAAGGTG
ACATTTCTGT CCTATAGGAA TAGACTACTT ATCATAAGAG AGTACTTGGG CGCATTACCG
CTTCTAAAAT TTCTGCTCTG GATAGTCTCC TATCCTCTTC GGGTTTTCGA CCAGAAGATC
TCGGCTTTAG ATAGATATAT AGAAATTACC GCTCCAGGTG TACCTACGCC GAGGTGGGGC
ATTAGGGAGT ACCTTGAGGT TCTGAGACAT CTAGTCAGAG CTCTATACGA GGGATTTACC
GCCTCCGTTG GTAGCAGAAG AGCAGGGGGG GTCCCTGTAG TAAAAACTTC ATGGATGAAA
TATCTGTCCA TGAAAAGCTT GCTTAATGAC GTTAGATCAG AAATTACAAG AGTGACTAAG
GATGTTGAGG GCTGA
 
Protein sequence
MKKSMAIFVV NYNSVSTMGQ RVFNFLDTFV DVVDQDVDVW LVDNGSTDGS YSVLSQRYGG 
MLKFLALPKN LGYGAACSLA YRYTRLMGLE YDYYVCSNND IELFPQKIGE LLSYLRALEK
AYPKGFIAAP LLLNGNDGLI DYGGYFIDDS GGTWGLRLAA LTPREIRRLT PISYADGAFQ
IVHRNVVETI GWFDPKYFLY YEDVEFSLRA WRARFPSLLI PLVIGRHYRS ASTGRSVYKV
TFLSYRNRLL IIREYLGALP LLKFLLWIVS YPLRVFDQKI SALDRYIEIT APGVPTPRWG
IREYLEVLRH LVRALYEGFT ASVGSRRAGG VPVVKTSWMK YLSMKSLLND VRSEITRVTK
DVEG