Gene Hneap_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1851 
Symbol 
ID8535009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1986171 
End bp1987316 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content58% 
IMG OID646384232 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_003263720 
Protein GI261856437 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.641505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCAA GCGATTTTCA CAGCCGCCTG CTCGACTGGT TCGATCGGCA CGGCCGGCAC 
GATTTGCCCT GGCAGCATCC CAGGACACCC TACCGCGTCT GGATCTCCGA GATCATGTTG
CAGCAAACGC AAGTGGCGAC GGTGATCGGG TATTTCAATC GATTCATGCA GCGCTTCCCA
TCGTTGGACG TATTGGCTGC CGCACCGGTG GATGAGGTGT TGGCGTTGTG GTCTGGACTT
GGCTATTACG CCCGCGCCCG CAATCTGCAC GCCGCCGCGC AAATCATGGC GCAGCAAGGC
GTTCCAGAAA CGCGCGCCGG TTGGCAGGCA TTGCCTAGTG TAGGGCCTTC CACAGCCGCT
GCGATCATGG CGCAGGCATT CGATGTGCCG GAAACGATTC TGGATGGAAA CGTTAAACGC
GTGCTGGCAC GCCATGCCGG TATAGATCGT CCCATCGAGC AGGCTTCGAC TATTCAGGCG
CTGTACGAAG TTGCCAAATT ACATACGCCG CAGACCCGCG TGGCCGATTA CACGCAGGCT
ATTATGGATC TCGGTGCAAC CCTGTGCACA CGCCATTCGC CGGGATGCTC GGCTTGCCCC
GTTTCTGCGG ACTGCGTGGC GTTCGCGTCC AATCGGGTTG AATCCTTGCC CGTTCGGCGC
AGGCGTCAAC CCGTGCCGAC CCGTCGTGCT GTCTTCATGG CGATCGAGGA TGAAGCGGAA
CGCCTGATGC TGATTCGCCG TCCTCCGACG GGGATCTGGG GTGGGTTGTG GTGTCTGCCC
GAGTATATTC CCGCCGATGA CTGTGTGTCG CAGGAACACA CACTCGATTC GGCCGTTTCA
GGTAAACAGG CGAAAGATAA TTCAGGCCAG CAGTCGATGC TTTCGATACA TCGGCTGAAA
CAGCGCGGCC CAACTTCGCC CTTGACAACA TTCGAACATC GATTCACGCA TTATTTGCTC
GATGCCCGCA TTGATCACAT GGTGGTTTCT CGCACATCGA GCGTCGAGGA CAATCCCGAT
GTGCTCTGGT TGCCATTGAT TGAGTTGGCA GCGCGCGCGC CGCTTTTGGG GTTGCCCAAA
CCAATGAGCC GTTTTTTGTC GGACTATCCG GCGCTGAAGT CATCCCAGAG TTCATGCACG
CCATAA
 
Protein sequence
MSASDFHSRL LDWFDRHGRH DLPWQHPRTP YRVWISEIML QQTQVATVIG YFNRFMQRFP 
SLDVLAAAPV DEVLALWSGL GYYARARNLH AAAQIMAQQG VPETRAGWQA LPSVGPSTAA
AIMAQAFDVP ETILDGNVKR VLARHAGIDR PIEQASTIQA LYEVAKLHTP QTRVADYTQA
IMDLGATLCT RHSPGCSACP VSADCVAFAS NRVESLPVRR RRQPVPTRRA VFMAIEDEAE
RLMLIRRPPT GIWGGLWCLP EYIPADDCVS QEHTLDSAVS GKQAKDNSGQ QSMLSIHRLK
QRGPTSPLTT FEHRFTHYLL DARIDHMVVS RTSSVEDNPD VLWLPLIELA ARAPLLGLPK
PMSRFLSDYP ALKSSQSSCT P