Gene Tneu_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1199 
Symbol 
ID6165281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1084254 
End bp1085264 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID641668348 
Productglycoprotease family metalloendopeptidase 
Protein accessionYP_001794573 
Protein GI171185654 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.834375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTCC TCGGCGTTGA GTCGACCGCC CACACCTTCA GCATAGGGGT CGTTAAAGAC 
GGCGTGGTGC TCGGCCAACT GGGGAAGACC TACATCCCGC CGGGGGGCGG GGGGATACAC
CCCCGCGAGG CGGCTGAGCA CCACGCCAGG GTGGCCCCCT CCATACTCCG CCAGCTCCTG
GGCCAGCTGG GGGTGGGGCT GTCGGACATC GGCGCGGTGG CCTACGCCGC CGGCCCAGGT
CTAGGCCCCG CCCTCAGGGT GGGGGCCGTA CTGGCGAGGG CCTTGGCCAT TAGGCTGGGC
GTGCCGGTTG TGCCCGTGCA CCACGGCGTG GCGCACATCG AGGTGGCCCG CTACGCCACC
GGCGCGTGCG ACCCGCTGGT GGTTCTGATC TCCGGCGGCC ACACGGTGGT GGCGGGGTAC
TCCGATGGGC GCTATAGGGT TTTCGGCGAA ACCCTCGACG TGGCTATCGG AAACGCCATT
GACATGTTTG CGAGGGAGGT GGGGCTGGGC TTCCCGGGGG TGCCGGCGGT GGAGAAATGC
GCCGAGTCCG CGGAGACGGT GGTGCCCTTC CCCATGCCGA TAGTTGGGCA GGACCTCTCC
TATGCGGGGC TCGCCACCCA CGCGCTTCAG CTCGTGAAGA GGGGGGTCCC CCTCCCCGTG
GTCTGCAGAT CGCTTGTGGA AACCGCCTAC TACATGCTTG CGGAGGTGGT GGAGAGGGCG
CTGGCCTATA CGAGGAAGAG GGAGGTGGTG GTGGCGGGGG GCGTCGCGAG GAGCAGGCGG
CTGAAGGAGA TCCTGCGGGC CGTGGGCGAG GAGCACGGCG CCGTTGTGAA GGTTGTCCCC
GACGAATATG CGGGCGACAA CGGGGCCATG ATAGCCCTCA CCGGCTACTA CGCCTATAGA
CGCGGCGTAT ACACCACGCC GGAGGGCAGC TTCGTGAGGC AGAGGTGGAG GCTAGACAGC
GTGGACGTGC CCTGGTTCCG CGACCTCTGC CCGGTCACAA CGTATATATA G
 
Protein sequence
MLVLGVESTA HTFSIGVVKD GVVLGQLGKT YIPPGGGGIH PREAAEHHAR VAPSILRQLL 
GQLGVGLSDI GAVAYAAGPG LGPALRVGAV LARALAIRLG VPVVPVHHGV AHIEVARYAT
GACDPLVVLI SGGHTVVAGY SDGRYRVFGE TLDVAIGNAI DMFAREVGLG FPGVPAVEKC
AESAETVVPF PMPIVGQDLS YAGLATHALQ LVKRGVPLPV VCRSLVETAY YMLAEVVERA
LAYTRKREVV VAGGVARSRR LKEILRAVGE EHGAVVKVVP DEYAGDNGAM IALTGYYAYR
RGVYTTPEGS FVRQRWRLDS VDVPWFRDLC PVTTYI