Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0115 |
Symbol | |
ID | 6165905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 100230 |
End bp | 103532 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641667282 |
Product | hypothetical protein |
Protein accession | YP_001793519 |
Protein GI | 171184600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0316436 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGGCC AGGCGCTGGT TCTGGTTGGT CTTATTCTCG TCGCGGCTTT GGCCGTGGCC ATGATGGCGT TTTACGCGTT GCAGAGCTCC GCCTCCCTGG CGCCTTCCAA GCCGTCCTAC GGCTATATAT CGAGGTCTTG GCCCGACCTA GTGAAGCTGG CCGGCGGCTA TCTGACCTAT GTGGCGTCCC AGAGCGTCTT CGCCTTGGCT AGGGGCTCCC TAGACGTGGG CTTATACGGG AGGCCGTACG ACGGTAGGTG GCTTCAGTAC AACGAAACGG CTCGGCGGGC TCGGCTGGGC TTGTTGACCA TGTCCGCGGC CCTCGCCTCT TTGCAGGTGA ACGCCACGGG CGGTCTCTGG TACTACATAC GGGGCTTCAA CGGGTCGCTG GGGCCCTACC CGCTCGCCGC CACCTACAGC AACTGGAACC TCGATGTGGT GAGAACGGGC GTTGCGATAC CTTGTGGCTC GGACACGCTG TACATCCGGC TGGTTCCGCT GAACGCCAAG GCGGCGCGGT TCGTTGTCCA GTCGCAGGTG GACGTAGGCG TGCCCTACGT CTTGACGTAT CTGGGACACA CCAGCGACAT AGTGACTGTG GGCGCCGGCT ATAACGAGAT CCTCTCGGAC ATGTTGCGCA TGGGGAACCC AAACGGCAAG CTGGCCCTAT ACGCCTTCGA CTCCTCCGTC CTAGGGCCGA ACCTCTGTTA CAAGACCGAC GACCAGATAG CCCAGGTCGT CCGAAGCGGC ATAAAGCCGT TGCCGTGGTA CGCCGAGGTG TTCACGTGCC CCTTCTGCAT GTTGCACTTC CAGGTTCCGC CCGGTTTTCT GCAGAAGGGG AGGACCTCGG AGGTTCTGAT CACATATACG ACGAGCGGCT CCATATCGGC GCCGCAGTTT AAGGTGGTTA TTTCCAGGCG GGCCGGCTAC AGCGAAATCC CGATATATGT AAACGCGTAC ACCACAAGCG ACCCGCGCGC CGTCTTTCCG GCCTACTTCG GCGCACGAGA CATCGCTACG TGGCAAAGTG CCGTTGTGGC CACAGACGGG GACTGCTATC CGGCGCCTCA AGGCGGCGTC GTGGACTACG GCGTGGCTGT GACGACGAAC TATCTGCCCC CCGGCACAAC GATTAGCCGG AGCGTAAGGA TATACGCCAC CTACGCGCCT AACACCCAGA GGTACGGCTT CAACGTGTTT ACATCATACA GCTGGCCCTA CTCCGACGTC GGCTACAAGG TCGAGGCGCT GGTTCTGCCC GAGGACGTCA GCTATATCAG GCCGGCTAGG CTGGACATCT ACGCGTCTGA CCCCAGCTCC ACACACCCGA TATGTGCCTC GATGTATGCC GTTCAAGTGG TCAACCCAGT GCTTGTGTGG TCGTGGTGGA ACATCGGCGG TGGGTATGTG TGGAACGGGA GAACTTGGGA CGTGGGGTAT GTGGTCTACA GCGGACGCCA GTGGTACCTC ATGTCCTTTG CCCTATCTCC CTCCGGCATC GCACAGTGGG CTGTGTACCA CTACAACTCG ACGGGTAGGC CTATGCGGTT GCTGGGGGTC ACCACGAGAT CTGGCGTCAC GTGGCTACAG AACTTCTACA TAGTGCTGGG GAGCGCCATA GTGGATAACC CGGGGAGCAC GTCGTCCTAC TGGACCGAGG CGGCGTACTA CGCCTACGTT AGAGTCCGCC CTTGGGTGTA TCCGGAGCCC ACGGTCTCGC TCTCGGGGTT GGACACGCCG CCTCTCGTGC AGCCGCAGCG CAACGACATA GTTGCCCTAA ACAGGAGCGA GGTTAGGCTT AGGCTTGACT CGACCCTAGA CATGGCGGGG GCGTTCGTGC GCAGGATGAA CGCCACCTTG TGGGTCAACG CCTCGGCGGC TGTTCAGAAA AGCGGCGGCG TCACCTTGAC CCACGAGCGG ACTTTGTGGT ACCTCGTAAA CGTGAGCCAC AGCGAGCCCA GAGCTGCGCT CTCGTCGCAG TTCTACCTGT ACTACGTGTT GGGAAACGCC GTGAGGAACT ACACAATCCA AAACGTCGCT CTGTACGACT ACGGCGATAG GTGGGCTAGG TTCAACGTGA CTTTTACGGT GCCCCGCCGC GCCCCCTACG CCGTCTTGAT CTCCGTGGCC GGGTCGGTGG TGGCGAAGTT GGCTGTTAGC AACGCGGCGC CCCGTGTCTA CTACCTCAAC ATCCCGAACA ACGACGGCAC ATACACATAC TACGTGATGA ACTACGGCAA CCTCACGGCC GTCTTCTACC TCCCCTGGGG CACCGTGACC TCCTTTGACG ACAGCTGGAA CCCACAGGCG CTGGGCTACT CGGGCTACTT TGGGGCGCTG TACGACGTGA CAAACGGCAG AGACAGGTGG CAGGTCCTAG CGATACCGCC AGGGGGACTC GTGAAATTCA AGACGTCTAG CAGTGTCGAC CTCAAGGCCG CATCCCCAAG CTGGCAACAG CCATACGTCC AAGACTTGTA TCCACAGTTG CTTCGTTGGT GTGAGCTGGA GAAGATATAT AAGATATATA GGTTTTTGGT CCCAACCAAC ATAACAACCA GCTACTACAT ATTCACAATC GACGGATATC TGCCGCGGAG CCTCAGCGGC ATCTACATAT TTGGTCCATT CACGCCCGGC TGGGTCTCTG TACCCTACTA CATAGAGAGG GATCCTCGGG GCAACAAACT GCCGAGGATT TGGGTTAGGG TAGACGCGCC TCCTGGGAAG CTTGACTACA GCGGCATGTT GGCCGTGCTT TGTAGCCAGG GACCAGGCGA GAGTAGCAAA GATGTAGTGT TTGGAACCGG CTACTGGGGG ACCGGCACGT CCTACGCCGA TATAAGCCAG CTGGCCAACC TTCTGACTTT CCCCGATGGC TATACGGTAG ATATTAAGCC GCTTGTGCAG TCGAGCATAG TGGGTAGCTG GGGCTTAAGC AACTCGACGT ATCTTCCGAG CGTGTGGCCT TGTAGCGCAG ATCAGACGCA TGTGGCTATT TACTACGTCG GGGGGTCGCC GCCGTATTGG TGGCATTTCG ACGGGTTCTG TTTCGACAGG CATATCTGGG AGAAGCAGGG CTACACGCCC TACCTTGCGA GCATTTCCAT TTCTAGGCAG TTCGTGCTGT ATAGGATGTG GGATATGTCG TTTAGCTACG ACGGCATGTG GCGCCGGTCG TACCCCAACA AGCCTGTTAA TTCAAACAGT CCCTACGTGG CGTACAGCTA TGTGGTTAGC GATTTTGGGT TTGAGTGGCG TATTCGGCCT TTTGCGTGGC CCGAGCCCTA TGTGAGGGAG TAG
|
Protein sequence | MRGQALVLVG LILVAALAVA MMAFYALQSS ASLAPSKPSY GYISRSWPDL VKLAGGYLTY VASQSVFALA RGSLDVGLYG RPYDGRWLQY NETARRARLG LLTMSAALAS LQVNATGGLW YYIRGFNGSL GPYPLAATYS NWNLDVVRTG VAIPCGSDTL YIRLVPLNAK AARFVVQSQV DVGVPYVLTY LGHTSDIVTV GAGYNEILSD MLRMGNPNGK LALYAFDSSV LGPNLCYKTD DQIAQVVRSG IKPLPWYAEV FTCPFCMLHF QVPPGFLQKG RTSEVLITYT TSGSISAPQF KVVISRRAGY SEIPIYVNAY TTSDPRAVFP AYFGARDIAT WQSAVVATDG DCYPAPQGGV VDYGVAVTTN YLPPGTTISR SVRIYATYAP NTQRYGFNVF TSYSWPYSDV GYKVEALVLP EDVSYIRPAR LDIYASDPSS THPICASMYA VQVVNPVLVW SWWNIGGGYV WNGRTWDVGY VVYSGRQWYL MSFALSPSGI AQWAVYHYNS TGRPMRLLGV TTRSGVTWLQ NFYIVLGSAI VDNPGSTSSY WTEAAYYAYV RVRPWVYPEP TVSLSGLDTP PLVQPQRNDI VALNRSEVRL RLDSTLDMAG AFVRRMNATL WVNASAAVQK SGGVTLTHER TLWYLVNVSH SEPRAALSSQ FYLYYVLGNA VRNYTIQNVA LYDYGDRWAR FNVTFTVPRR APYAVLISVA GSVVAKLAVS NAAPRVYYLN IPNNDGTYTY YVMNYGNLTA VFYLPWGTVT SFDDSWNPQA LGYSGYFGAL YDVTNGRDRW QVLAIPPGGL VKFKTSSSVD LKAASPSWQQ PYVQDLYPQL LRWCELEKIY KIYRFLVPTN ITTSYYIFTI DGYLPRSLSG IYIFGPFTPG WVSVPYYIER DPRGNKLPRI WVRVDAPPGK LDYSGMLAVL CSQGPGESSK DVVFGTGYWG TGTSYADISQ LANLLTFPDG YTVDIKPLVQ SSIVGSWGLS NSTYLPSVWP CSADQTHVAI YYVGGSPPYW WHFDGFCFDR HIWEKQGYTP YLASISISRQ FVLYRMWDMS FSYDGMWRRS YPNKPVNSNS PYVAYSYVVS DFGFEWRIRP FAWPEPYVRE
|
| |