Gene Hneap_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1996 
Symbol 
ID8535155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2139340 
End bp2140398 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID646384378 
ProductNUDIX hydrolase 
Protein accessionYP_003263865 
Protein GI261856582 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00586] mutator mutT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.42027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCCTC CGGAAGAAAC ACGAATCGCA CTCGCAGTTT TACCGGCCGG ACCGAATCAG 
GCCGGTCTTC CCCAATATTG GCTTGAGCGC CGCCCCGATT CTGCGCATCT GGGCGGGATG
CTGGCGTTTC CCGGTGGCAA GTGCCAGCCG GATGAATCTC CCACAGATGC ATTGGCTCGC
GAACTGTTTG AGGAACTCGG TATCCTGCCG CAAGCGTCGC GGTTGCTTAT GGAAATTCCC
TGGGTTTACT CGGCCAATTC AAGCGATCTT GAAGGCAAAC CGAAATCCAA GCACCTTCGC
CTCATTGTCT ATCGAGTCGA AAAGTGGCAA GGCGAACTTC ATGGCCGCGA AGGACAATCG
GTAACAGCTC AAACACTGGA TTGCAGTCGG CATGGCGAGT GGATGAGCGC CTTGCCACCT
GCCAATCGGG GGATTGTCGC TGCCTTGTGC CTGCCGCCTC GAATAGCAAT TACAGCTGCG
TGCGGTGCGG GCGATGCCGG GTTTTCCGTT TGGCATCAGG CATTGGTCAA GACAGCCAAT
GCGCTTAGAC AGCAATTTCG ATCGTCATTT GGGGGGCGAT CATCCATCGT GCAATTACGT
CCCGGTCGGG ATCTAAGCAT GGCCCAGTGG ACCGCCGCAG TGGCCACCGT TCAGTCGTTT
GAGTTGTCCG CGTGGGTGAA TGCGAGTTTG GATATTGCCA TATCGTGTCG CGCAGATGGT
GTTCACTTAA ATCGACACCG TCTGGCCTCG GTGGATCGGG AAGCCCTGGC GAATTGGCGT
GCACAAAATC GTTGGGTTAG TGCATCCGGC CATACCTTGG AAGAAGTGCG ATTGGCCAAT
GAGGTCGGCG TCGATGCCTT GCTGATTTCT CCCATCCTAC CAACGTTAAG CCATCCGGGA
GAATCCGGAA TCGGTTGGGC ACAGTTCGCG GAATTGACTC GCGAAGCCAC CATGCCCACC
TATGCGCTTG GTGGCATCTT GGAAACGCAC CTGCCCCAGG TGCAAATGTT GGCAGGGCAG
GGGGTCGCCG CCATTCGTGG CTATTGGATG GACTCTTGA
 
Protein sequence
MSPPEETRIA LAVLPAGPNQ AGLPQYWLER RPDSAHLGGM LAFPGGKCQP DESPTDALAR 
ELFEELGILP QASRLLMEIP WVYSANSSDL EGKPKSKHLR LIVYRVEKWQ GELHGREGQS
VTAQTLDCSR HGEWMSALPP ANRGIVAALC LPPRIAITAA CGAGDAGFSV WHQALVKTAN
ALRQQFRSSF GGRSSIVQLR PGRDLSMAQW TAAVATVQSF ELSAWVNASL DIAISCRADG
VHLNRHRLAS VDREALANWR AQNRWVSASG HTLEEVRLAN EVGVDALLIS PILPTLSHPG
ESGIGWAQFA ELTREATMPT YALGGILETH LPQVQMLAGQ GVAAIRGYWM DS