Gene Tneu_0873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0873 
Symbol 
ID6164525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp778466 
End bp780118 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content64% 
IMG OID641668029 
Productpeptidase S16 lon domain-containing protein 
Protein accessionYP_001794256 
Protein GI171185337 
COG category[R] General function prediction only 
COG ID[COG1750] Archaeal serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGGA AGGCTCTGGC GTTGATCCTC CTAGCCGCCC TGGCGCTGGC CGCGTGGCAG 
ACCTACACGG TGAAGGTGGC CTCGGCGGAG ATAAACGCGC TGGCCGTTGG GCCCTCCGGA
GGCGCCGTCT TGCCCATCAA GATCACCCTC ATCACGCCGG GGGACGGGAG GGCGTACGTG
GCAGGGGTCC CAGAGGCTGG CCAGGGCTTC GGCCCCTCGG CGCAGATTGC GCTCTACGTC
GCGTCTAGGT ACTCGGGCAA GCCCTACACG AACTACACCG CCCTGCTGAG GGTGTTGGCC
AGCGACGCCC AGGTGGGAGG CCCCTCGGCC AGCGGCTACA TAACAGTGGC CATGTACGCC
CTCATGAACG GCCTGGAGCT TAGAAACGAC ACAGCGATGA CGGGTATAAT ACTGCCCGAC
GGGCTTATAG GGCCGGTGGG CGGGGTCTCC CAGAAGGTGT CGGCGGCCGC GGAGAGGGGG
ATAAAGACCG TGTTGGTCCC CATTGGCGAG AAGCCGAGCG CCGTGCGCGG CGTTAAGGTG
GTGGAGGTGG GCACGGTGGA AGACGCCATC TACTACCTCA CTGGGCATAG GGTCGAGACG
CCGCGGCCTT CGGCGGTGGA CGACACGGCG TTTAGGGAAA TATCGCGGGA CCTCTTCAAC
GCCGTCTACA GCTACTACAA CGCGACGGTG GGGAGGGGGT ACGTAGACGA GGCCTTGATC
GATAGGCTGA AGAGCCGGGG CGACTACTAC GCCGCCGCCT CGTTGATATA CCAGGGGATC
GTGAGGTACT ACAGCGACCA AGCCTCCTCG TCTAGGAGAG CCGCCAGGGA GCTCTACGAC
AAGGCCCTCC AGCTGGCGAA GGAGGCCGAG GCGGAGCTCT CCAAGATCCC CACGACCGTC
AACAACCTAG ACCTCGTGGT GGCGGCCTAC ACCAGGGTAT ACGAGGTCTA TCTCCAGGCC
AACTCCACCT CGGCCAACCC AGGCGCTATG TACGCCCGGG CCGTCACCCT CAAGCCTTGG
GTCGACGAGG CGAGGAGGAT GGCCTACGGC CCCGCCGTAA ACGAGAGCAA GCTGGCGGAG
ATCGCGAGGA TGTACCTAGA CTACGCCAAG GCCATGTACG CCTATCTGGA GACCACCTCT
GACGTTCCCC TAGGCGATTA CTCCACAGCA GTGCAGCTGG CGGAGGACCT CTACGGGAGA
GGTCTCTACC TGGCCTCCAT TGCCAACTCC ATAGAGATAA TCGCAGAGTC CGCCGCATCT
CTAATGTCGG CTGCCCCCGA GAAGTACCTG GAGGTGGCCA GGGAGAGGGC TCTCACCAAC
ATGGCCCGCG CCGCCCAGTG CGGCTACACG AACACGTTGC CGCTGAGCTA TCTACAGTTC
GGCGACTACT ACAGCCAGCA GCCCGACGGC GTCAAGTACG CGTTGATGTA CTACATCACA
GCCTCCGTCT ACTCGACGGC GATGGGCGAC GCGGCTTGCT TCGCGAAAAG CGGCGTGGTC
TACCAAAAGC CAAGCTTCGC CCCGCCTGAG CCGGCGGCGC CGGCGGCTCA TACCGCCGCC
GTTGCTAGGC AGGAGGGGGG AAAAAGCCTG TGGTTGCCGC TTGTCCTCGC ACTACTGGCG
GCCCTCGCCC TGGTCTACAG CGCTAGACGG TAA
 
Protein sequence
MVRKALALIL LAALALAAWQ TYTVKVASAE INALAVGPSG GAVLPIKITL ITPGDGRAYV 
AGVPEAGQGF GPSAQIALYV ASRYSGKPYT NYTALLRVLA SDAQVGGPSA SGYITVAMYA
LMNGLELRND TAMTGIILPD GLIGPVGGVS QKVSAAAERG IKTVLVPIGE KPSAVRGVKV
VEVGTVEDAI YYLTGHRVET PRPSAVDDTA FREISRDLFN AVYSYYNATV GRGYVDEALI
DRLKSRGDYY AAASLIYQGI VRYYSDQASS SRRAARELYD KALQLAKEAE AELSKIPTTV
NNLDLVVAAY TRVYEVYLQA NSTSANPGAM YARAVTLKPW VDEARRMAYG PAVNESKLAE
IARMYLDYAK AMYAYLETTS DVPLGDYSTA VQLAEDLYGR GLYLASIANS IEIIAESAAS
LMSAAPEKYL EVARERALTN MARAAQCGYT NTLPLSYLQF GDYYSQQPDG VKYALMYYIT
ASVYSTAMGD AACFAKSGVV YQKPSFAPPE PAAPAAHTAA VARQEGGKSL WLPLVLALLA
ALALVYSARR