Gene Tneu_0240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0240 
Symbol 
ID6166045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp212351 
End bp214069 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content60% 
IMG OID641667403 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001793639 
Protein GI171184720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.393632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTTC TTGCGAAAAG GGCTCTCTCG GTAAGATCCG CCTATGCGCC CCGCCTGGGG 
CCCCGCAGGG AGATCTACTA CATCAGCGAT ATTACCGGCG TTCCCCAGCT CTGGAGGTTT
GATGGACATG TCCACGACAT GGTGCTCCCC TGGGAGGAAA GGGTGTCGGA GTACCGGGTT
GCAGATGACG GCACTCTCGC GTTCACCAGC GACGTGGCGG GCGACGAAAG GTGGCGTCTC
TACGTGCTTG AGGGCGAGGA GGCGGCGCCC GTCTCCGCTG AGGGGGTTAA CAGCCTGGGG
GCTTGGTCGC CCGACTCGCA GAAGCTGGCT TTCACCTCCA CCAGGGATGA CCAGCAGAAC
TTCCACCTAT ACCTCTACGA CAGAGCTACG AGGTCTATCG CCAAGCTCGC CGAGATACCC
GGCATCAACG TCGTGGAGGA GTGGTCGGAG GTGGGTCTCT TCGTCACGCA CTATGAGACA
AACCTCGAAA GCGCCATACT GCTCTACAGA GGCGGGGAGG TGCGGGAGTT GACCAAGAGG
AGCCCGGACA CCATGAGCCT CTCCCCTAGA TACATCGGAG GGGGGAAGCT ACTCTATCTA
ACAAACGAGG GGTGGGAACA CATGGGGGTG GCCCAGATGG ATCTGACGAC CGGCTCCTGG
AAATACCTAA TTCAGCTGGA CAGAGACGTC GAGTTCTTCG ACGTGTGGGG GAGCTACCTC
GTCTTTTCTC TAAACGAGGA GGGGTCGTCG GGTCTCTACA TGATGCACAT GCCCTCGGGC
CTAACCCACA AGATAGCTAC GCCGAGGGGC GTCGTGACAT CTCTACAGTA CAGAGAGGGG
CTCATCCTTT TCTCCCTCTC CAGCATAAAC AGAGGCCATG AGGTCTACAT CCACCAGGGA
GGGGCACTGA GGCAACTCAC GAGATCCCCC AGGTTCGGCC TACAGCTGGA GTCCCTACCC
GACCCGGAGT CCGTGTGGTA CGTGAGCCAC GACGGTAGGA AGATCCAGGC TAACATCTAC
AAGCCGCCGG GCGCGGCTAG GGGAGTCGTC GTCTACCTCC ACGGAGGCCC CGAGAGCCAA
GATAGGCCGG AGCTCAAGCC GCTGGTGCTG GCCTTGCTCA TGTCGGGCTT CGTCGTGGCG
GCGCCTAACT ACAGAGGGAG CGCCGGCTTC GGAAAAAGCT TCCTCCGCCT CGACGACTTA
GACAAGAGGT GGGACGCGAT AAAGGACGTC GAAGCCTTCG CCAGGTGGCT CACCGCTGAG
GGCATCGCTA AGGCGAAGCC CTGCGTAATG GGGGGGTCCT ACGGCGGCTA CCTCACGTTG
ATGGCCCTCG CCACAGCCCC CGACCTCTGG GCATGCGGCG TGGAGATAGC CGGCATCTTC
AACCTGGTGA CGTTTCTGGA GAGGACCGCG CCTTGGAGGA GGAGGTACAG AGAGGCGGAA
TACGGATCTC TCGACAGACA TCGCGATCTC TTGCTCCAGC TGAGCCCAGC GACGTACGTG
GACAAAATCA CGGCCCCCCT CCTAGCCGTC CACGGCGCAA ACGACATACG CGTGCCCATC
CACGAAGCCG AGCAGCTGGC CAAGAGGCTG GGGGAGCTGG GGAGAGAGGT TAAACTCTTG
GTGCTCCCCG ACGAGGGCCA CGTCATTACA AAGGTGGAAA ACCGGGTCAA GGTCTACACG
GAGGTTTTGA AGTTCGTAGA ACGGCACCAG GTTTATTAA
 
Protein sequence
MSLLAKRALS VRSAYAPRLG PRREIYYISD ITGVPQLWRF DGHVHDMVLP WEERVSEYRV 
ADDGTLAFTS DVAGDERWRL YVLEGEEAAP VSAEGVNSLG AWSPDSQKLA FTSTRDDQQN
FHLYLYDRAT RSIAKLAEIP GINVVEEWSE VGLFVTHYET NLESAILLYR GGEVRELTKR
SPDTMSLSPR YIGGGKLLYL TNEGWEHMGV AQMDLTTGSW KYLIQLDRDV EFFDVWGSYL
VFSLNEEGSS GLYMMHMPSG LTHKIATPRG VVTSLQYREG LILFSLSSIN RGHEVYIHQG
GALRQLTRSP RFGLQLESLP DPESVWYVSH DGRKIQANIY KPPGAARGVV VYLHGGPESQ
DRPELKPLVL ALLMSGFVVA APNYRGSAGF GKSFLRLDDL DKRWDAIKDV EAFARWLTAE
GIAKAKPCVM GGSYGGYLTL MALATAPDLW ACGVEIAGIF NLVTFLERTA PWRRRYREAE
YGSLDRHRDL LLQLSPATYV DKITAPLLAV HGANDIRVPI HEAEQLAKRL GELGREVKLL
VLPDEGHVIT KVENRVKVYT EVLKFVERHQ VY