Gene Tneu_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1042 
Symbol 
ID6164973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp928346 
End bp930646 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content61% 
IMG OID641668194 
Producthypothetical protein 
Protein accessionYP_001794419 
Protein GI171185500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.119019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.445057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAT TGCTGATAGC GGTGCTACTG GCGGCTCTGG CGGTGTATGC ACAACAGATC 
TATGCCCTAC ACTATACGCC TGCCGAATGC GACATCAACC CATATAGATG TATAGTTGGG
ACGACGGCTT ACTCGGAAAG CGCATCGGAA TGGGCAAAAG CGTTTATTGA AGCAAAAGGT
TGGTATGGCG AAATAATGCC AGGCTATATC GTGATTACTG CTCTGCCCAA TGGGAGCGTC
TCCGAGTACA GCTTTACAAA AGCTGTCGCT CTGTTCGACA TAGTGCAGGA TCCATGGAGT
AAGGTGGAGC CGCTGTATGT TCGAGATGTG ATGTACATCG ACTACGTGCT GGGGATCCCG
ATCAAGAGCG ACTCCAGGCC GGTAATAGAC CTGAGCACCT TGAAGCACTT TACCGGGTCT
GTTTACAACG TGACTGGGCC TGGCAACACC GCCACGGTGC ACAACGTGAT TATCCGCTAC
AGGATATATT CGGATGGGAG GAGGGAGTTC TGCCTTGAGG ATCACAGAAC TTCTCAGTCC
GAGTGGTATA GGTGGGCTGT GAGCTTGGGC TACGATCTTG CCCCTATCTG GGACGACCTT
GTCTTCTACA GCTGCCGGCC TCCCGACAGA GCGCCTAGAA ACATAACCCT CGTAGTGGGG
CAGGACGGCC TTACGGTCTA CGTAGACGGC AACCGCACGT GGTGGCTCTC CAACATCAAA
GGCTACATCT ACATCGGGCC GTACAGGATA GAGCGCATCG AGATGTATGT GTTATACGCC
ATACTGTATT CAAATGTATT CCTTAAGGTG AAGGAGCCTT TAGGCAGAAT CCAGCTCTAC
CTCGGCAACG AGACCCTTGT AGCTCCGCCA AAAGCCGTCA TACCTTATAC AGGCATAGCT
CTCTACAACC TTGTTAGTTC CCATTACATT GCTCACGTGG CGTTTAGTTG GGAAAACGGT
AGCGCGGTTC TTCCGGGTAG AGATGTGGGT GGGCGCGTCT ACGTCCGGCC GGACGTCAGC
GCGTCGGTCT ACGGCAGGCT GGCGTCTCTC CCATACATGG CGGTGATCGA CCTAGGCTGG
CTGAGGGGGA GGTACTGCCC CTACGGCGAC GTTGTTCTGT CCGGAGCGTA TAGGCGGATC
GGCGGCTCCG TCCAGGTGCT GGGGCCGGTC TCCATAGCCT GCACCAGATA CCCGGTGCGC
TTCATACTGC CAAACGGCTC CGCCCTCCTC GCCGTCGCGG AGGCCAACTC CACAGCCCAC
CTCCCCCCAC TCACGGTAGA CCTCGGCAAC GGCACGCGGG CGGTCACCGC CGAGGTAAAA
GTAGATGTGA GAGAGCCCGT GGACGTCGTA GTGCCCGTAT CACAGCGCCT CTACCGCGTC
GTGGTTAAGA CGCCCGCCGG CGTAAACGCC ACCTGGGCGC CCCCCGGCGT GCTTAGGCTA
CCCACCATCG AGCTGTCCAA CGGCACCCGG TACGTCCCCA CGGCGCCTGC GGAGGTCGTA
GTCGCGAGGC CGACTACCGT CGAGCCAAGG TACAGGAGGC AGTATCTAGT GCGCCTGGCG
GCGCCGGCTA ACTCCACCGC GGCGTGGGTT GACGAGGGGG CCCTCTTCAA CGTGACGCTG
GCGGATCCGT GGGTGGTGGG CAACGGCACC ATGTTCTCCG GCCTCCGCGT AAACGGGACT
GGGGAGAGGA GCTGGCGCGT CGTAAAGCCG GCGACGCTTG AGGCAAGCTA CGCCGAAGTC
CACTACTGGG TCTCCGTCGA AACGCCCGTC AACAAAACCG CGGGCTGGAC GCCGAGGGGC
GCCGTGCTGA CTTTCTCCGA CGTCCTCGAC CTCGGCAACG GCACGCGCCT AGTAGGCCCA
TCCCCCGCGT CTGTCGTGGT GGACGGGCCG AAGAACGTGT CTGTGGCGTA CGGGAGGAGG
CAGTACTACG TCCGCGTGGA GGGCGTCGAG GAGTGGAGCG GGTGGGCCGA CGCGGGCTCC
GTTGTGAAGC TGAACTCGAC CGCGGTGGGT GGGGTGGTCT ACAGACCGCT GGAGGACGTC
GCCGTGTTGC ACCCCGGCGT CTACAGGCCG AGGTTCCTCG CCTCCTACAC GTACGAGTTT
AGAGATGTGC TGGGCGTCCC CAACCCGGCG GCCTCTGTGG AGCTCTGCGG CGTGAGAGCC
GGCGCCGACC TGGCGGGGAG GGCACACGTG GAGGCCGAGA CAGACCGCCT CTGCACTCCG
GCGCTCTCCG CCTGGCCCGT CAGCCCATAC ACGCTGGTGG GTTTGGCTGC TGTGGTGTTG
TTGCTTCGGC GGCTACTCTA G
 
Protein sequence
MRALLIAVLL AALAVYAQQI YALHYTPAEC DINPYRCIVG TTAYSESASE WAKAFIEAKG 
WYGEIMPGYI VITALPNGSV SEYSFTKAVA LFDIVQDPWS KVEPLYVRDV MYIDYVLGIP
IKSDSRPVID LSTLKHFTGS VYNVTGPGNT ATVHNVIIRY RIYSDGRREF CLEDHRTSQS
EWYRWAVSLG YDLAPIWDDL VFYSCRPPDR APRNITLVVG QDGLTVYVDG NRTWWLSNIK
GYIYIGPYRI ERIEMYVLYA ILYSNVFLKV KEPLGRIQLY LGNETLVAPP KAVIPYTGIA
LYNLVSSHYI AHVAFSWENG SAVLPGRDVG GRVYVRPDVS ASVYGRLASL PYMAVIDLGW
LRGRYCPYGD VVLSGAYRRI GGSVQVLGPV SIACTRYPVR FILPNGSALL AVAEANSTAH
LPPLTVDLGN GTRAVTAEVK VDVREPVDVV VPVSQRLYRV VVKTPAGVNA TWAPPGVLRL
PTIELSNGTR YVPTAPAEVV VARPTTVEPR YRRQYLVRLA APANSTAAWV DEGALFNVTL
ADPWVVGNGT MFSGLRVNGT GERSWRVVKP ATLEASYAEV HYWVSVETPV NKTAGWTPRG
AVLTFSDVLD LGNGTRLVGP SPASVVVDGP KNVSVAYGRR QYYVRVEGVE EWSGWADAGS
VVKLNSTAVG GVVYRPLEDV AVLHPGVYRP RFLASYTYEF RDVLGVPNPA ASVELCGVRA
GADLAGRAHV EAETDRLCTP ALSAWPVSPY TLVGLAAVVL LLRRLL