Gene Tneu_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1844 
Symbol 
ID6164782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1623238 
End bp1625304 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content55% 
IMG OID641669007 
ProductCBS domain-containing protein 
Protein accessionYP_001795207 
Protein GI171186288 
COG category[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGG TATATACCTT CCGCCAACTG GTATCAGTCG CCGGCATATC GTGGCTCGGG 
GCCTTTTTGG AGTGGCTAGA CTTCTACACC TTTGCAACAC TGGCGCCTCT CATATCCGGA
AAGTTTTTCC CATCTAAGGA CCCAATTGCC GCTTTGCTCT CTACATTTGC AGCGCTCGCC
ATAGGCTTTC TGTTTAGGCC GCTGGGCGCC ATCTTGTTTG GCAAAATAGG CGACCAATAC
GGCCGTAAAA TAGCATTTAC TCTAGCTATG ACTCTGATGT TGGCGGGAAC GCTGGGCATA
GGCCTACTGC CCACCTACGA CCAGATCGGT ATACTGGCAT CAATCGGCGT CTTCGTTCTT
AGAATAATCC AGGGTCTTGC GTTGGGCGGA GGCTTCGGCG CTGCCCTGGT CTATCTAGGC
GAGTTTGCCC CTGAGCACAG GAGGGGCTTC ATAACAGGTT TCCTCTTCAC AACAGCGCCG
GCTGGCATGG GCACAGCTGC GTTGCTTCAA GTCATCATAG CCTCTATGGT GGGCAAAGAG
ACCTTTGGCC AGTGGGGCTG GCGTATAAAC TTCATCGTGG CAGGCGTGAT AGTGTTTGTG
GTTGCCCTCG TGATTCACTT CTTCTACAAG GAAACCCCCA TCTTCTCTAT GCTCAAGGCT
GTGAGAAGGG TGACTTCGGC GCCTGTGAGA GAGGTGTTCT CCGGTAAATA CTTGCCGCTT
GTGTTGCTCG CATGGATAGG CGTGGTAGGG GCACATGGCC CAGTTTGGTA TACAAATCAG
CTGTTCAATA GCTACTACGT ATCGACCTTC CAAAAATACG TCGATGGCTC AACCGCGAAC
GCGTTACTAT CGACAGCTAC ATACGCCGCC TTGTGGATGT ACCCGCTCTT TGGCTACCTA
TCTGACAAGA TTGGGAGAAA GCCCATACTG TTGCTAGGCA TCTTCGGCAA CGCCCTGTGG
TTCCCCATCG CGTTTTGGCT AATAGACAAG GTGGGGCCGC AGAAGGATCT AACCGCAATG
TGGCTCCTCT TCTGGAGCAT GACCCTCTTC AACGGCATTG GATACAGCGG CGCCATGTCA
GCATACCTCC TCGAGCTATT CCCCGCCAGA ATTAGGCTCT CCGCTGTCTC GCTGTCCTAT
AACCTGGGCT ACGGCGTAAC CGGCGGGCTG ACCCCAACGA TAATAACCGC CCTATATCAA
GCTACACACA ACATATACCT GTCCACAATA CTCTGGTCTA CCTTGGTCCC CGTGCTCATG
GGCCTTGTGT TTCTGTTCAA GGGCTGGGAG ACATTAGGCA CGCGCATTTG GGCTGAGCTT
GCCGCTGGCA AATTCGCCAA GAAGGCCGTG GTGCTTCCGC CCACCGCGCC GATTAGAGCG
GCTGCACAGA AGATGGCCGA AGGCGTGAGA GCCGTAGTAA TAGCCGCCTC AAAGCCTGTG
GGGGTCTTCG GCAGGAGACA GCTAATAAGA GCTCTGGCGT CTGGCGCAAC GCCGGAGGCC
GAAGTAGGCA GATTCGCAAC CCGCGTAGAC TGCGTCGGCG AGGACGCCCC GCTGACTGAG
GTATTCGCCG CAATGGAGAA ATACGGCGTT AGAGACGTCC CCATATGTAA AGGAGACGAG
GTGGTGGGCA TAATAGAGGC CCGCGAGCTA CTCAACGAGG CTCTAGCCCT CAGGGGTATA
GTCAACAAGA AGAAGGCCTT GAGCGTAAGC GCAGGCGATG CCGTGGCTAG AGACCCCATC
ACCGTCCCGC CGAGCGCGAC GCTACGCGAC GTATTGAAAA TCATGGCCGA GAAAAACATT
GGCTTCGTCC CCGTGGTTGA AGACGGGAGA CTCGTAGGCG GTATCTCAGA GAGCGACTTT
GTACAAATAC TGCTGAACAA CACCCCGCTG GACACGCCGG TGGAGAAGGT AATGAGGTGC
CAGCTTATCA CAATTGAAAG GACAAGGCCG GTGAAAGAGG CCGCGGAGCT CATGGTGAAA
CACAACATTA GACATCTGCC GGTGGTAGAA GACGGCAAAG TCGTCGGAGT CCTCTCGGTG
CGCGACCTCC TAAAGGCGGT CGCCTAA
 
Protein sequence
MTTVYTFRQL VSVAGISWLG AFLEWLDFYT FATLAPLISG KFFPSKDPIA ALLSTFAALA 
IGFLFRPLGA ILFGKIGDQY GRKIAFTLAM TLMLAGTLGI GLLPTYDQIG ILASIGVFVL
RIIQGLALGG GFGAALVYLG EFAPEHRRGF ITGFLFTTAP AGMGTAALLQ VIIASMVGKE
TFGQWGWRIN FIVAGVIVFV VALVIHFFYK ETPIFSMLKA VRRVTSAPVR EVFSGKYLPL
VLLAWIGVVG AHGPVWYTNQ LFNSYYVSTF QKYVDGSTAN ALLSTATYAA LWMYPLFGYL
SDKIGRKPIL LLGIFGNALW FPIAFWLIDK VGPQKDLTAM WLLFWSMTLF NGIGYSGAMS
AYLLELFPAR IRLSAVSLSY NLGYGVTGGL TPTIITALYQ ATHNIYLSTI LWSTLVPVLM
GLVFLFKGWE TLGTRIWAEL AAGKFAKKAV VLPPTAPIRA AAQKMAEGVR AVVIAASKPV
GVFGRRQLIR ALASGATPEA EVGRFATRVD CVGEDAPLTE VFAAMEKYGV RDVPICKGDE
VVGIIEAREL LNEALALRGI VNKKKALSVS AGDAVARDPI TVPPSATLRD VLKIMAEKNI
GFVPVVEDGR LVGGISESDF VQILLNNTPL DTPVEKVMRC QLITIERTRP VKEAAELMVK
HNIRHLPVVE DGKVVGVLSV RDLLKAVA