Gene Tneu_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0471 
Symbol 
ID6166097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp426882 
End bp428483 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID641667628 
Producthypothetical protein 
Protein accessionYP_001793864 
Protein GI171184945 
COG category[S] Function unknown 
COG ID[COG4938] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0692558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000612882 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
TTGCAGACGG CGGAGCCCAG GCCCATTGTA AACGCTCTGC GGGCAATCTA TGCGAAGCTG 
TCGCCTCTTG ACAGGTACGC CGCTGGGCGT GTGGAGAGGA GGCTGTCGGC TAGGATTGCG
GAGTATCTAG AGCGCGAGTT CCGGGCGGAG GCGTCGTCGA GCGTCCGGAG GTTGGTTTCT
GGGGTGGTTG AGCTGGTTGC GGCTTTGAGG TCGGGTGGGG TTGAAGAGGC TCGTACCGTT
TTGGGGCGGA TGGGCGAGGT GGGTCTTAAG GTGGCTGAGG CCGGGGGCTC TGTTGTCGTT
AGAGGTCCGC CGATGGAGTG GGCGGTGGAC GTGGGTGTCT TGAGGCAGAT GGCTGTGGAC
GCGTTTTACG GCTTTATGGC CGAGCTTGTG CCTGTGAGGG GGGTGGACGC GGTGCGGCTT
GAGCCGCTCG AGCCGCTTGA TGTTGTGGGC GCGCAGAGAC TCCGTGTTGA GGCGGAGCGC
CGGTATCCGT TGGGGTGGTT TGGGCAGGCA GGTGAAGTAC TTAGGAGGTT GGGTTGTGGG
GGTGATGAGT TGGAGCGTCT GCTGTTCTCG TCCTCGGTGA GTCTTCACTT TGATGTTGGG
GTGGGTGGGA GGTCGAGTGT GTGGCTTGAG TTCCGCATAG ATACGCGGGC TTTCTCGCCG
GGGGGCGCCG GCCGCGTGGA GAAAACCGAT GGGGGTTTGT CCAAGGCGTT GGACGAGTAT
GTGGAGGAGT TGGTGTCTAA GGTTGTGCGC AGGTCGGCTG CCTATCTTTC TAGTGCTGTG
GTGGGGGGCG TCCGCGGCGC TTTGAGGAGC CGCCTTGGTT TTGAGGGGCT GCGTTTTGTT
CCTTTTGGCA GAAGTGTGCT CGTCTTGGCG CTGGAGAGCG CCTCTAGGGA GCCGTATGCC
AGGCCTGTGT ACCTGCGTAG GTTGGTTGAG GAGTTCTATC CAGGCGTGTT GGCGAGCTAT
GTCTACTGGG CGAGCGAGGG GCGGAGGCGT CTGTTGGAGT GGCTCGACGA GGCGGGGGCG
AGGGTTCTCG ACGCCGCCGC GCCTCTGCTG GAGGGCAGGC TGGTGCCGGG CGCCGCAGGG
AGGGTGATGT ACAGGGACTG GCGTGGGTCG CTTGTCGAGA TTCGGCTGTC TTCCGCTCTT
GTGGGCGAGG TCGCCGGCCT TCTCTTCTCC CTGCTGAGCG TGGGCGGTAG ATCTCTGGTG
CTTGTCGAGG AGCCCGAGGC CCAGTTGCAC CCCGGGGCGC AGATAGCGAT GGCGCTCTTC
TTAGTCTCGC TACCGGCTCT GTGCGGGTGT AGGGTCGTGG CTACCACACA CAGCGATCTG
TTGGCCATCA CCATGAGCCA GCTGGCGGTG CAGAGGCCGG ACAGGCAATG GGTTGTGGAG
CTGCTGGCGA GGGTTCTGCC CCATGTGAAG GAGGGCGTTG ACGTGTTGGC TGGGGCCGTG
GCGGAGGCCG CCGTAGACCT GAGGATCTAC GAATTCACAA GAGAGGGCAG GGTGGCGGCT
GTGAGGCCGG AGGACGTGCT CGGCAAGGAG GTACCTGGGA TAAGCAGGGT AATCGACGAG
CTCACCGATT GGGCCTTTCG CCTTGCGAGC CGCCGGAGGT GA
 
Protein sequence
MQTAEPRPIV NALRAIYAKL SPLDRYAAGR VERRLSARIA EYLEREFRAE ASSSVRRLVS 
GVVELVAALR SGGVEEARTV LGRMGEVGLK VAEAGGSVVV RGPPMEWAVD VGVLRQMAVD
AFYGFMAELV PVRGVDAVRL EPLEPLDVVG AQRLRVEAER RYPLGWFGQA GEVLRRLGCG
GDELERLLFS SSVSLHFDVG VGGRSSVWLE FRIDTRAFSP GGAGRVEKTD GGLSKALDEY
VEELVSKVVR RSAAYLSSAV VGGVRGALRS RLGFEGLRFV PFGRSVLVLA LESASREPYA
RPVYLRRLVE EFYPGVLASY VYWASEGRRR LLEWLDEAGA RVLDAAAPLL EGRLVPGAAG
RVMYRDWRGS LVEIRLSSAL VGEVAGLLFS LLSVGGRSLV LVEEPEAQLH PGAQIAMALF
LVSLPALCGC RVVATTHSDL LAITMSQLAV QRPDRQWVVE LLARVLPHVK EGVDVLAGAV
AEAAVDLRIY EFTREGRVAA VRPEDVLGKE VPGISRVIDE LTDWAFRLAS RRR