Gene Tneu_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0115 
Symbol 
ID6165905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp100230 
End bp103532 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content59% 
IMG OID641667282 
Producthypothetical protein 
Protein accessionYP_001793519 
Protein GI171184600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0316436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGCC AGGCGCTGGT TCTGGTTGGT CTTATTCTCG TCGCGGCTTT GGCCGTGGCC 
ATGATGGCGT TTTACGCGTT GCAGAGCTCC GCCTCCCTGG CGCCTTCCAA GCCGTCCTAC
GGCTATATAT CGAGGTCTTG GCCCGACCTA GTGAAGCTGG CCGGCGGCTA TCTGACCTAT
GTGGCGTCCC AGAGCGTCTT CGCCTTGGCT AGGGGCTCCC TAGACGTGGG CTTATACGGG
AGGCCGTACG ACGGTAGGTG GCTTCAGTAC AACGAAACGG CTCGGCGGGC TCGGCTGGGC
TTGTTGACCA TGTCCGCGGC CCTCGCCTCT TTGCAGGTGA ACGCCACGGG CGGTCTCTGG
TACTACATAC GGGGCTTCAA CGGGTCGCTG GGGCCCTACC CGCTCGCCGC CACCTACAGC
AACTGGAACC TCGATGTGGT GAGAACGGGC GTTGCGATAC CTTGTGGCTC GGACACGCTG
TACATCCGGC TGGTTCCGCT GAACGCCAAG GCGGCGCGGT TCGTTGTCCA GTCGCAGGTG
GACGTAGGCG TGCCCTACGT CTTGACGTAT CTGGGACACA CCAGCGACAT AGTGACTGTG
GGCGCCGGCT ATAACGAGAT CCTCTCGGAC ATGTTGCGCA TGGGGAACCC AAACGGCAAG
CTGGCCCTAT ACGCCTTCGA CTCCTCCGTC CTAGGGCCGA ACCTCTGTTA CAAGACCGAC
GACCAGATAG CCCAGGTCGT CCGAAGCGGC ATAAAGCCGT TGCCGTGGTA CGCCGAGGTG
TTCACGTGCC CCTTCTGCAT GTTGCACTTC CAGGTTCCGC CCGGTTTTCT GCAGAAGGGG
AGGACCTCGG AGGTTCTGAT CACATATACG ACGAGCGGCT CCATATCGGC GCCGCAGTTT
AAGGTGGTTA TTTCCAGGCG GGCCGGCTAC AGCGAAATCC CGATATATGT AAACGCGTAC
ACCACAAGCG ACCCGCGCGC CGTCTTTCCG GCCTACTTCG GCGCACGAGA CATCGCTACG
TGGCAAAGTG CCGTTGTGGC CACAGACGGG GACTGCTATC CGGCGCCTCA AGGCGGCGTC
GTGGACTACG GCGTGGCTGT GACGACGAAC TATCTGCCCC CCGGCACAAC GATTAGCCGG
AGCGTAAGGA TATACGCCAC CTACGCGCCT AACACCCAGA GGTACGGCTT CAACGTGTTT
ACATCATACA GCTGGCCCTA CTCCGACGTC GGCTACAAGG TCGAGGCGCT GGTTCTGCCC
GAGGACGTCA GCTATATCAG GCCGGCTAGG CTGGACATCT ACGCGTCTGA CCCCAGCTCC
ACACACCCGA TATGTGCCTC GATGTATGCC GTTCAAGTGG TCAACCCAGT GCTTGTGTGG
TCGTGGTGGA ACATCGGCGG TGGGTATGTG TGGAACGGGA GAACTTGGGA CGTGGGGTAT
GTGGTCTACA GCGGACGCCA GTGGTACCTC ATGTCCTTTG CCCTATCTCC CTCCGGCATC
GCACAGTGGG CTGTGTACCA CTACAACTCG ACGGGTAGGC CTATGCGGTT GCTGGGGGTC
ACCACGAGAT CTGGCGTCAC GTGGCTACAG AACTTCTACA TAGTGCTGGG GAGCGCCATA
GTGGATAACC CGGGGAGCAC GTCGTCCTAC TGGACCGAGG CGGCGTACTA CGCCTACGTT
AGAGTCCGCC CTTGGGTGTA TCCGGAGCCC ACGGTCTCGC TCTCGGGGTT GGACACGCCG
CCTCTCGTGC AGCCGCAGCG CAACGACATA GTTGCCCTAA ACAGGAGCGA GGTTAGGCTT
AGGCTTGACT CGACCCTAGA CATGGCGGGG GCGTTCGTGC GCAGGATGAA CGCCACCTTG
TGGGTCAACG CCTCGGCGGC TGTTCAGAAA AGCGGCGGCG TCACCTTGAC CCACGAGCGG
ACTTTGTGGT ACCTCGTAAA CGTGAGCCAC AGCGAGCCCA GAGCTGCGCT CTCGTCGCAG
TTCTACCTGT ACTACGTGTT GGGAAACGCC GTGAGGAACT ACACAATCCA AAACGTCGCT
CTGTACGACT ACGGCGATAG GTGGGCTAGG TTCAACGTGA CTTTTACGGT GCCCCGCCGC
GCCCCCTACG CCGTCTTGAT CTCCGTGGCC GGGTCGGTGG TGGCGAAGTT GGCTGTTAGC
AACGCGGCGC CCCGTGTCTA CTACCTCAAC ATCCCGAACA ACGACGGCAC ATACACATAC
TACGTGATGA ACTACGGCAA CCTCACGGCC GTCTTCTACC TCCCCTGGGG CACCGTGACC
TCCTTTGACG ACAGCTGGAA CCCACAGGCG CTGGGCTACT CGGGCTACTT TGGGGCGCTG
TACGACGTGA CAAACGGCAG AGACAGGTGG CAGGTCCTAG CGATACCGCC AGGGGGACTC
GTGAAATTCA AGACGTCTAG CAGTGTCGAC CTCAAGGCCG CATCCCCAAG CTGGCAACAG
CCATACGTCC AAGACTTGTA TCCACAGTTG CTTCGTTGGT GTGAGCTGGA GAAGATATAT
AAGATATATA GGTTTTTGGT CCCAACCAAC ATAACAACCA GCTACTACAT ATTCACAATC
GACGGATATC TGCCGCGGAG CCTCAGCGGC ATCTACATAT TTGGTCCATT CACGCCCGGC
TGGGTCTCTG TACCCTACTA CATAGAGAGG GATCCTCGGG GCAACAAACT GCCGAGGATT
TGGGTTAGGG TAGACGCGCC TCCTGGGAAG CTTGACTACA GCGGCATGTT GGCCGTGCTT
TGTAGCCAGG GACCAGGCGA GAGTAGCAAA GATGTAGTGT TTGGAACCGG CTACTGGGGG
ACCGGCACGT CCTACGCCGA TATAAGCCAG CTGGCCAACC TTCTGACTTT CCCCGATGGC
TATACGGTAG ATATTAAGCC GCTTGTGCAG TCGAGCATAG TGGGTAGCTG GGGCTTAAGC
AACTCGACGT ATCTTCCGAG CGTGTGGCCT TGTAGCGCAG ATCAGACGCA TGTGGCTATT
TACTACGTCG GGGGGTCGCC GCCGTATTGG TGGCATTTCG ACGGGTTCTG TTTCGACAGG
CATATCTGGG AGAAGCAGGG CTACACGCCC TACCTTGCGA GCATTTCCAT TTCTAGGCAG
TTCGTGCTGT ATAGGATGTG GGATATGTCG TTTAGCTACG ACGGCATGTG GCGCCGGTCG
TACCCCAACA AGCCTGTTAA TTCAAACAGT CCCTACGTGG CGTACAGCTA TGTGGTTAGC
GATTTTGGGT TTGAGTGGCG TATTCGGCCT TTTGCGTGGC CCGAGCCCTA TGTGAGGGAG
TAG
 
Protein sequence
MRGQALVLVG LILVAALAVA MMAFYALQSS ASLAPSKPSY GYISRSWPDL VKLAGGYLTY 
VASQSVFALA RGSLDVGLYG RPYDGRWLQY NETARRARLG LLTMSAALAS LQVNATGGLW
YYIRGFNGSL GPYPLAATYS NWNLDVVRTG VAIPCGSDTL YIRLVPLNAK AARFVVQSQV
DVGVPYVLTY LGHTSDIVTV GAGYNEILSD MLRMGNPNGK LALYAFDSSV LGPNLCYKTD
DQIAQVVRSG IKPLPWYAEV FTCPFCMLHF QVPPGFLQKG RTSEVLITYT TSGSISAPQF
KVVISRRAGY SEIPIYVNAY TTSDPRAVFP AYFGARDIAT WQSAVVATDG DCYPAPQGGV
VDYGVAVTTN YLPPGTTISR SVRIYATYAP NTQRYGFNVF TSYSWPYSDV GYKVEALVLP
EDVSYIRPAR LDIYASDPSS THPICASMYA VQVVNPVLVW SWWNIGGGYV WNGRTWDVGY
VVYSGRQWYL MSFALSPSGI AQWAVYHYNS TGRPMRLLGV TTRSGVTWLQ NFYIVLGSAI
VDNPGSTSSY WTEAAYYAYV RVRPWVYPEP TVSLSGLDTP PLVQPQRNDI VALNRSEVRL
RLDSTLDMAG AFVRRMNATL WVNASAAVQK SGGVTLTHER TLWYLVNVSH SEPRAALSSQ
FYLYYVLGNA VRNYTIQNVA LYDYGDRWAR FNVTFTVPRR APYAVLISVA GSVVAKLAVS
NAAPRVYYLN IPNNDGTYTY YVMNYGNLTA VFYLPWGTVT SFDDSWNPQA LGYSGYFGAL
YDVTNGRDRW QVLAIPPGGL VKFKTSSSVD LKAASPSWQQ PYVQDLYPQL LRWCELEKIY
KIYRFLVPTN ITTSYYIFTI DGYLPRSLSG IYIFGPFTPG WVSVPYYIER DPRGNKLPRI
WVRVDAPPGK LDYSGMLAVL CSQGPGESSK DVVFGTGYWG TGTSYADISQ LANLLTFPDG
YTVDIKPLVQ SSIVGSWGLS NSTYLPSVWP CSADQTHVAI YYVGGSPPYW WHFDGFCFDR
HIWEKQGYTP YLASISISRQ FVLYRMWDMS FSYDGMWRRS YPNKPVNSNS PYVAYSYVVS
DFGFEWRIRP FAWPEPYVRE