Gene Tneu_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1005 
Symbol 
ID6165080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp895986 
End bp897881 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content61% 
IMG OID641668158 
Productmetallophosphoesterase 
Protein accessionYP_001794383 
Protein GI171185464 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000460676 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000515762 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCAGC TGTACCTCCT CCTCGCCGCG GCCCTCGCCC TAGCCGCCTC CATCGTCGAC 
CCCAGGTGGG CCGTCCCCGC CTACGTAACC CCCGGCGGCG CCTTCAACAT CACGCTGGAT
GCGCCAACCG CTGTGAAAAG CGTAGCCCTA GCCGCCCCCG GCGTGGAGAA GCCGCTGGAG
CTCAGCTTCA ACGCCTCTGG AGCCGTCATC ACGGCTCGCG TCCCGGCGGA CGTCCCCCCC
GGCCTCTACG ACCTCGTAAT CAACGGCGGT GAGATATACG AGCCCAAGGC CGTCTGGGTC
GGCAACGTGA CGGGGCCCCT TAGGATAATC CAGCTCACCG ACATACACGT GGGAGTTGAG
CTAGACATGG CCTCCATATA CCGCCTAACC CACGCCGCTC TCTACGCCAG CTCCAGCCCC
TACGACGTCG TGTTCCTCAC CGGCGACCTC GCAGACGTGG GAGGCCAGCC CTGGCAGTAC
GCCCTGCTGG TCAGATACAC CTCCACAATT ACAAAGCCCA TCTTCGCCGT GCCGGGGAAC
CACGACCACG CCGGCGACGA CCCCCTCAAT AACTACAGGA GGTATGTGGG GCCCCCCTAC
TGGTACAGAG TCGTCGGGCC CTACCTAATA ATCGGCCTAG ACAGCGGCCA CGACGGCTAT
CTAACAGAGG AGCAGGTCAA GTTCTACGGA GACGTCCTGA GGCGCTACCC AGACAAGGTG
AAGATAGTCC TAATACACCA CCCGCCCTTC TACATACGCG ACGCCTACGT CGCCGAGATC
TACCGCGGCC CCCAGGACAT AGACAGACTC AGCAGAGACC CCACCGGGAG GAGGAACTAC
TACATCGTCT ACACCAGCTA CCTCTACAAC CGCCCCACCT ACGAGAGGTT CCTCAACCTA
ACGATTAGGT ACAGAGTAGC CCTGGTGATG GCCGGCCACG TACACCCCGG CAACTCCACC
GTGGTCATAA ACGGCACCTA CTTCGTCACC ACCAGGACGT TGGGAGGGTC TGTGGACACC
TCCCACGGCT TTAGAACCTA CGTCGTCTAC CCAGACGGCC GCGTCCAGAT AAACCCAGAA
ACCCTCACCT ACAAAAACTA CGCCGTCGTG GCCCAAGGCG CCAAAGCCGC CCAGATATAC
GCCGACAGCG ACCTCCTCCC CGGCACAATA GCCATAGACC TACCCGGCCA GTACCAAGGC
CTCAAAGCCC TAAACGGCAC AGCCCAGCTA GTAGAAGCGA AGAAACACCC ACTAGCCAAA
TACACGCGGT ACTACATCTC TACAGCAGGC AAGCCCATCT GGATAGCCAT CGGAGACTAC
GCCCCAGCCC CCACCCTATC GGTAGAAAAG ATAATGCCCA GGTCCCCCAC CCCCGGCGAC
GTGGTGACAG TCACGATAAA GGCAGAGGAC CCCAACGTCG GCATACCCTT CCTAACGGTA
GACGGCAAGA AGATACTCGC CTCCTACCCA GGAGAGCAGC CGGTATACCA ATACAGATTC
AGATACGACA AGCCAACCAC ACTACAGATA CAAGCCCCAG GAGGCCAACC CACAACAGTA
CAGATAGGCC AAACCACGCA GCCGCCAACA ACTACGCCGA CACCCACCCC GACACCCACA
GCCACCCCAA CCCCAAGCCC CACGCCAAGT AAAACACCAA CGCCCACGCC AACTACACCA
CCCCCAACCA CCCCGACGGC CACCTCTCCA ACACCTACGC CCACCCCCAC CCGCACGCCA
ACCCCCACCG CAGCTCCGGC GCCAGCCCCA TTCCCCACTG AAGCCGCCGT GCTAATCGCA
ATAGCCGCGG TCGGCGCCGC GCTACTGGCG GCCCTCGCCA AAACAGGCAA AAAGAAGGCA
GAAACCGGCG GAACCAGGGT ATACGGAGAA AGATAA
 
Protein sequence
MRQLYLLLAA ALALAASIVD PRWAVPAYVT PGGAFNITLD APTAVKSVAL AAPGVEKPLE 
LSFNASGAVI TARVPADVPP GLYDLVINGG EIYEPKAVWV GNVTGPLRII QLTDIHVGVE
LDMASIYRLT HAALYASSSP YDVVFLTGDL ADVGGQPWQY ALLVRYTSTI TKPIFAVPGN
HDHAGDDPLN NYRRYVGPPY WYRVVGPYLI IGLDSGHDGY LTEEQVKFYG DVLRRYPDKV
KIVLIHHPPF YIRDAYVAEI YRGPQDIDRL SRDPTGRRNY YIVYTSYLYN RPTYERFLNL
TIRYRVALVM AGHVHPGNST VVINGTYFVT TRTLGGSVDT SHGFRTYVVY PDGRVQINPE
TLTYKNYAVV AQGAKAAQIY ADSDLLPGTI AIDLPGQYQG LKALNGTAQL VEAKKHPLAK
YTRYYISTAG KPIWIAIGDY APAPTLSVEK IMPRSPTPGD VVTVTIKAED PNVGIPFLTV
DGKKILASYP GEQPVYQYRF RYDKPTTLQI QAPGGQPTTV QIGQTTQPPT TTPTPTPTPT
ATPTPSPTPS KTPTPTPTTP PPTTPTATSP TPTPTPTRTP TPTAAPAPAP FPTEAAVLIA
IAAVGAALLA ALAKTGKKKA ETGGTRVYGE R