Gene Tneu_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1820 
Symbol 
ID6164604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1599294 
End bp1600454 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID641668983 
Productmetallophosphoesterase 
Protein accessionYP_001795183 
Protein GI171186264 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00514924 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCC TACACATCTC GGATGCGCAC CTCGGCAGAG CCCAGTACCA CCTCCCGGAG 
AGGGAGGAGG ACTACTTCAG AGCGTTTGAG GAGGCGCTGA GGAGGGGGAG GGGGGCAGAC
GCCGTCTTGA TAACGGGCGA CCTCTTCGAC CTCAAGAGGC CCTCCACGAG GGCGCTCGTG
AAGTTTGTGG AGGCCGTGGA GGCGGCGGGC GCGCCTGTCT ACCTAATCGG GGGAAACCAC
GACTTCAGCT ACGTCCGGTA TAGGGCTGAG GCGGAGAGGT GTCCGCGGCC GGCGGAGTGC
CTCTACGACA CGGCGCTGAG GCTTCTAGAT AGGCTGAGGC TGGCGAAGCT CCTCTGTTGG
GAGTCCGTAG ACGCCGGGGG CGTTCACATA TTTGGGGCAT GCGCAACGCC TAGGGACTAC
GCGGCGGAGT ACCGCCGGGC TCTCCAGAGG ATGCCGCCGG GGGCTGTCCT CGCCATACAT
CAGGCGGTGG AGGGGGTCAA GGCCAGGTAC CCCGCCGAAG ACGACGAATA CACCATGCCC
CAGGAGGTTT TCCAAGGCCT GCCGTATGTA CACATCGCCG CTGGCCACAT ACACGACCAT
CTGGCGAGGC ATCCGGTAGG CGCCGTGTGG GCGGGGTCGC TTGAGGTCTG GGACGTCGGG
GAGTTCGAGA CCTGGGACTA CAGAGGGGGC TTTGAGAAGG CGCAGGACAG AGCTGAGAAA
GGCGCTGTCT TAATAGACGT AGCCGGTAGG GCGGTCTCCC TCAGAGCTAT CCCCATCCCC
CCTGGGAGGC CTCTGTACAG GGTCAGGCTC TATGTCAGGG AGAGGAGGGA GGCCTACGGG
GCTGCGGAGG AGGCGGCGAA GCTTTTTGAC AAACCGGGGG CGGTGGTTAG AGTCGAGGTG
TGGGGTACGT TGGAGGAGGC TCTAAGGCCT AGGCAGATGG CTACCTTGTT TACAAAAGCC
CTCTACGTCG ACGTTGTTGA CAGAACCGCC GCCCCGCAGA GGGCCGTGTC TCTAAGGGGG
TCCGCCATGG AGGAGCTGTG GCGGCTGATG AGGGAGAAGC TTGGGCAACA CGCCGAGGTG
GTGCTCAGGG CTATGGAGCT CCTTAGAGAG GGGGAGAAGG AGGCGGCGTA CAAGCTCATC
CTCAAGGCGC TTTATGATTA G
 
Protein sequence
MKLLHISDAH LGRAQYHLPE REEDYFRAFE EALRRGRGAD AVLITGDLFD LKRPSTRALV 
KFVEAVEAAG APVYLIGGNH DFSYVRYRAE AERCPRPAEC LYDTALRLLD RLRLAKLLCW
ESVDAGGVHI FGACATPRDY AAEYRRALQR MPPGAVLAIH QAVEGVKARY PAEDDEYTMP
QEVFQGLPYV HIAAGHIHDH LARHPVGAVW AGSLEVWDVG EFETWDYRGG FEKAQDRAEK
GAVLIDVAGR AVSLRAIPIP PGRPLYRVRL YVRERREAYG AAEEAAKLFD KPGAVVRVEV
WGTLEEALRP RQMATLFTKA LYVDVVDRTA APQRAVSLRG SAMEELWRLM REKLGQHAEV
VLRAMELLRE GEKEAAYKLI LKALYD