Gene Tneu_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1937 
Symbol 
ID6164706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1709033 
End bp1710073 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID641669100 
Productpeptidase M24 
Protein accessionYP_001795298 
Protein GI171186379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.469872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG TGGGCAAGCT CGTAAAAGCC TTCGCAGGCC GGTACAGATA CCTCATCTTA 
ACCAGGTCGC CAAATCTGGC ATACGCGGTG GGGATGCCCG ACGCCCTAGG CCTAGTCCTA
GATCTTGAGA CAGGCCACTC TACCCTCTAC GTATCTCGGC TGGACTACGC ACGTGCAAGC
GCCTTCGCCG AGGTTGACAA GGTTGTGGCG GTGGCCTCTG CCGAGATACC GCCCCGGAGA
CCGGGCGAGG AGCTCGTCGT CGCCTCTAGC CTATCAGACG TCCTAAGACA GGTCTCGCAG
GGGGGCGCCG CCGCGTCGGA TAACAAAGAG CTTGGGGTGG ACGTAAGCGC CGAGATCGCG
GAGCTTAGGG CGGCCAAGGA GGAGTGGGAG GTGGAGATGA TGAGGGAGGC GCTTAAAATC
GCCGAAGGCG CATACGTCAA ACTAGCCGAG CTGAGGCTCA TAGGCATGAG GGAGCGGGAC
GTGGCCGCCC TCATATATAA ATGGTTCCTA GAGGAGGGGG CAGACGGAGT CGCCTTCGAC
CCAATCGTCG CGTCGGGGCC AAACGGCGCG TATCCGCACT ACAGATTCGG CGACAGGAAG
ATAGCCTACG GCGACTACGT TGTGGTAGAC ATCGGCGCGA AGAGGGGGGT CTACTGCTCA
GACATCACCA GGACCTTGGC GGTGGGGCAG GGAGGCGCGT TGAGAGATGC CGTGTACGCC
GTATATGAGG CCGTCAAAGC CGCTGAAAAA GTTGCGAGGG AGGGGGCGGC TGCCGCCGAG
GTGGACAAGG CGGCGCGGGA CGTCATCGCC GAGTACGGCT TCGGCCAATA CTTCATACAC
TCGACGGGAC ACGGCGTCGG GGTGGAGGTG CACGAGCCGC CTAGGCTATA CGCGGCCTCG
CGAGATGTGC TCAAGAGGGG GCACGTGGTG ACGATAGAGC CGGGCGTCTA CATAGAGGGG
GTGGGGGGCG TGAGGATAGA GGACATGGTA TACATCAACG GCGGAGCCGC GGTCCTAAAC
AGGCTACCCC ACATCCTCTA G
 
Protein sequence
MNNVGKLVKA FAGRYRYLIL TRSPNLAYAV GMPDALGLVL DLETGHSTLY VSRLDYARAS 
AFAEVDKVVA VASAEIPPRR PGEELVVASS LSDVLRQVSQ GGAAASDNKE LGVDVSAEIA
ELRAAKEEWE VEMMREALKI AEGAYVKLAE LRLIGMRERD VAALIYKWFL EEGADGVAFD
PIVASGPNGA YPHYRFGDRK IAYGDYVVVD IGAKRGVYCS DITRTLAVGQ GGALRDAVYA
VYEAVKAAEK VAREGAAAAE VDKAARDVIA EYGFGQYFIH STGHGVGVEV HEPPRLYAAS
RDVLKRGHVV TIEPGVYIEG VGGVRIEDMV YINGGAAVLN RLPHIL