Gene Tneu_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1921 
Symbol 
ID6164869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1693149 
End bp1694330 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content54% 
IMG OID641669084 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_001795282 
Protein GI171186363 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.705743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTGT GGGCTCTGCT ACCGGTTACG TTGGCTTTGG TAGTGTTGCT GATCTTCATC 
GTGTCTCTTA AGGAGGCGTC CACGCACTCT CCTCCCACGT CTCACACATC TGCTCCTGCG
CCTCTGGAGA GCGGTAGCTA TGGCGACTTC GCCGTCAGTT TCTATAAGAG AATAGCCTTG
GAGAGACTTC GCGAAAACCT TGTCCTCTCG CCGTATTCCG TGTATAAAGC CTTTGCCATG
GCCTACGCGG GCGCGGCTGG GGAGACTAGG GAGGAGATCG GGAGGGTGTT CGGCTTCGGC
GACGACGTCT GCGCCTTGGC TCAGGCTGGG CGGGGGGTCG AGGAGGCGGT GGCCGCGTGG
GTGCAACTGG GATTCCCGCT GAGGGAGGAG TACGAGCGGG AGCTGTCTTG TCTAGGGGCG
GAGCTGAGGC GGGTGGATTT CGCGGAGCGC TCCGCGCTGG TCGAGATAAA CAGATGGATA
GAGGAAAAGA CCAGGGGATA CGTAAAGGAC TTGATCCCTA CTGACTACCC GCGTAGCCAG
GATATACGCG TTGTATTGAC TTCAGCTCTG TTTTTCAACG GTAGTTGGTG GCCGCTCCAG
TTTGGGCGTA TTGGGAAGAG AGAATTCCAG GGGGTAGGCC CGGTGGAGTT TATGAAGCTT
GACCTTGGGT CTTGCGTCCC CTCGCTCAGG GGGCGCGTCT CTGCCGATCT CACGGTGGTG
GAGCTCCGGT TTGAAAATAC AGATGTAGCT ATGTACGTCA TAATGCCAAA GTCGCTTGAG
GACTATGTAA AAGGCTTGAC CTATGAGAAG TTGAAGAGAG ACATCACCGA ATTGCCGGAC
GAGATCGTCG CCGTAACGAT GCCTCTATTT ACCGCAGAGT TTAAAGACTC TCTCAAAAAG
GTATTAATCG ACATAGGCAT ACGCTCAGCA TTTGACAAAA CCAGAGCAAA CTTCACCCGG
ATGTCGCCTA TCAGGATTTA TATAGACGAA GTTTTCCACG GCGCGTACAT AAAAGCTGAT
GAAAAAGGTG TTGTAGCAAC AGCCGCCACT GCCGTCGTAT TTGTGCCCGT TTGTGCTAAG
GTTGGAGGCA TGGAGGTGGT AATAGACAGG CCCTTCCTCT TCGTCTTGGC GGATCGGAGA
GACGGCACGA TCTACTTCAT AGGGCATGTG GTTAAGCCCT AG
 
Protein sequence
MRLWALLPVT LALVVLLIFI VSLKEASTHS PPTSHTSAPA PLESGSYGDF AVSFYKRIAL 
ERLRENLVLS PYSVYKAFAM AYAGAAGETR EEIGRVFGFG DDVCALAQAG RGVEEAVAAW
VQLGFPLREE YERELSCLGA ELRRVDFAER SALVEINRWI EEKTRGYVKD LIPTDYPRSQ
DIRVVLTSAL FFNGSWWPLQ FGRIGKREFQ GVGPVEFMKL DLGSCVPSLR GRVSADLTVV
ELRFENTDVA MYVIMPKSLE DYVKGLTYEK LKRDITELPD EIVAVTMPLF TAEFKDSLKK
VLIDIGIRSA FDKTRANFTR MSPIRIYIDE VFHGAYIKAD EKGVVATAAT AVVFVPVCAK
VGGMEVVIDR PFLFVLADRR DGTIYFIGHV VKP