Gene Hneap_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0024 
Symbol 
ID8533137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp26886 
End bp29708 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content59% 
IMG OID646382403 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003261937 
Protein GI261854654 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATC CGTTTATCCG CATTCGCGGC GCCCGCACAC ACAATCTGAA AAATATCGAT 
GTCGATTTAC CGCGCGATGC GCTGATCGTG ATCACGGGGT TATCCGGTTC GGGTAAGTCA
TCGTTGGCTT TCGACACGCT GTTTGCTGAA GGTCAGCGCC GGTACGTGGA ATCGCTCTCG
GCCTACGCGC GGCAATTTTT GTCGATGATG GACAAGCCGG ATGTGGATCA CATCGAAGGC
CTTTCGCCCG CCATTTCCAT CGAGCAAAAA ACCACCTCGC ATAACCCGCG CTCGACCGTG
GGTACGGTGA CGGAAATCTA CGATTATCTG CGCCTGCTCT ATGCCCGCGC CGGTATTCCC
CGCTGCCCGG AACACGGCGT TGATTTAACC GCGCAAACGG TTTCGCAGAT GGTCGATCAG
GTGCTGGCGC TGCCCGAAGA TTCCCGCTGG TTGCTGCTCG CGCCGGTCAT CGAGAACCGC
AAGGGCGAGC ACCATAAGCT CTTCGATGAA TGGCGCTCGC AGGGCTTTTT GCGGGTGCGC
ATCAATGGCA CCGTGTATGA AATCGATGAG ATTCCAGCAC TCGACCCGAA AGGCAAAAAC
ACGATTGAAG TCGTGGTTGA TCGCATCAAA ATCCGCCCGG ATATTGGCCT GCGGCTGGCC
GAATCGTTCG AAACCGCGTT GCGCATGGCC GATGGCCGCA CCTTTGTCGT CAATATGGAT
GATGACAGCG CGCATACGTT CTCTGCCAAG CACGCCTGCC CGATCTGCGG CTATGCCCTG
GCCGAACTCG AACCCCGTTT GTTCTCGTTC AACAACCCCG TCGGTGCCTG CCCAAGCTGC
GATGGCCTGG GCGTAAAGCA GTTTTTCGAC CCGCAAAAGG TCATCGTCAA CGCCGAACTC
TCGCTGGCCG CCGGTGCGAT TCGCGGCTGG GATCGGCGCA ATGTCTATTA CTACCAGCTT
TTGCTCGGGC TATCGAAACA CTACGGCTTT GATTTAGACA CGGCCTGGCA GGATCTGCCG
GAATCGATTC AAAACGCCAT TTTGCAAGGC AGTGGCCGCG AGAAAATCAC CTTCACCTAC
CTCAGCCCCA AAGGTAAAGC CAGCACCAAG GATCACCCCT GGGAAGGCGT GCTGCCCAAC
ATGGATCGCC GTTATCGGGA AACCGATTCG CAGGCGGTGC GCGAAGAGCT GGCCAAATAT
ATTTCCGAGA GCGCTTGCCC CAGTTGCGGC GGCACCCGCC TGAACTTGGC GGCGCGAAAC
GTATTCATCG AAAACGAAAA CCTGCCCGCC ATCACTCATC GCTCGATCAG CGAGGCGCTC
GCGTTTTTTC AAACGCTTGA CCTGCCGGGC GCGCGCGGTG AAATTGCCGG GCGCATCGTG
CGGGAAATCA GCGCGCGGCT GGGCTTTTTG AACGATGTGG GCCTCACTTA TCTCTCGCTG
GATCGCTCGG CCGAAACGCT TTCCGGCGGC GAAGCCCAAC GCATTCGGCT CGCGAGCCAG
ATCGGCTCGG GGCTCGTCGG CGTCACCTAC ATTCTCGATG AACCCTCTAT CGGCCTGCAT
CAACGCGATA ACGACCGCTT GCTCGGCACC CTCAACCATC TGCGCGAGCT GGGCAACACC
GTGATCGTGG TCGAACATGA CGAAGACGCG ATTCGGGCGG CCGATTTCGT ACTCGACATT
GGCCCCGGCG CGGGCGTACA TGGGGGCCAT ATTGTGGCGC AAGGCACGCC CGAACAAATC
GCCGCCAACG CCGATTCGCT GACCGGCGCG TACCTTTCCG GCAAGAAAAA AATCACCGTG
CCCAAGCGTG TGACGCCCGA TGCCGAACGC TGGCTCAAGC TGTATGGCGC GCGGGGCAAT
AACCTCGTCG GCGACACGCT CGAAATCCCG ATGGGGCTCC TCACCTGCAT TACCGGCGTA
TCGGGTTCCG GTAAATCCAC CCTGATCAAC GACACGCTCT ATCTCATTGC CGCACGCGAT
CTGAACGGTG CCAGCACGCG ACCCTCCGCG TATGAACGCA TCGAACACCT CAACCAGCTC
GACAAAGTCA TCGACATCAA CCAATCCCCC ATCGGCCGCA CGCCACGCAG CAATCCGGCC
ACCTACACCG GCCTGTTCAC GCCGATTCGC GAGTTGTTCG CGGGCACGCA AGAGGCACGT
TCACGCGGTT ACACGCCGGG GCGGTTCTCG TTCAATGTGA AAGGCGGACG CTGCGAAGCC
TGCCAGGGCG ATGGCGTCAT CAAGGTCGAA ATGCACTTTT TGCCAGATGT GTATGTTGCT
TGCGATGTCT GCCACGGCAA GCGTTACAAC CGCGAAACGC TGGATATCCG CTACAAAGGC
AAAACCATCA GCGACGTGCT GGCCATGACC GTCGAAGACG CCGCGCCCTT TTTCGATGCC
GTGCCCGCCC TGCATCGCAA ACTCAACACA CTGCTCGATG TGGGCTTGGG CTACCTCACG
CTCGGCCAAA ACGCCACCAC CCTCTCCGGC GGCGAAGCCC AGCGCGTCAA ACTCGCCAAG
GAACTGTCCA AACGGGATAC CGGCAACACG TTGTATATTC TCGATGAACC AACCACCGGC
CTGCATTTTG CCGACGTCGA GCAACTGCTC GCGGTACTCT ACCGCCTGCG CGATCAGGGC
AACACCGTCG TGGTCATCGA ACATAATCTC GATGTAATCA AAACCGCCGA CTGGATTGCG
GATTTGGGCC CAGAAGGCGG CAGCGGTGGC GGGCAAATCA TCGCCAGCGG CACACCAGAA
ACCGTGGCAA AAATTGCCGG ATCGCATACC GGACGGTATC TCGCCCGATT ACTCGACCAC
TGA
 
Protein sequence
MADPFIRIRG ARTHNLKNID VDLPRDALIV ITGLSGSGKS SLAFDTLFAE GQRRYVESLS 
AYARQFLSMM DKPDVDHIEG LSPAISIEQK TTSHNPRSTV GTVTEIYDYL RLLYARAGIP
RCPEHGVDLT AQTVSQMVDQ VLALPEDSRW LLLAPVIENR KGEHHKLFDE WRSQGFLRVR
INGTVYEIDE IPALDPKGKN TIEVVVDRIK IRPDIGLRLA ESFETALRMA DGRTFVVNMD
DDSAHTFSAK HACPICGYAL AELEPRLFSF NNPVGACPSC DGLGVKQFFD PQKVIVNAEL
SLAAGAIRGW DRRNVYYYQL LLGLSKHYGF DLDTAWQDLP ESIQNAILQG SGREKITFTY
LSPKGKASTK DHPWEGVLPN MDRRYRETDS QAVREELAKY ISESACPSCG GTRLNLAARN
VFIENENLPA ITHRSISEAL AFFQTLDLPG ARGEIAGRIV REISARLGFL NDVGLTYLSL
DRSAETLSGG EAQRIRLASQ IGSGLVGVTY ILDEPSIGLH QRDNDRLLGT LNHLRELGNT
VIVVEHDEDA IRAADFVLDI GPGAGVHGGH IVAQGTPEQI AANADSLTGA YLSGKKKITV
PKRVTPDAER WLKLYGARGN NLVGDTLEIP MGLLTCITGV SGSGKSTLIN DTLYLIAARD
LNGASTRPSA YERIEHLNQL DKVIDINQSP IGRTPRSNPA TYTGLFTPIR ELFAGTQEAR
SRGYTPGRFS FNVKGGRCEA CQGDGVIKVE MHFLPDVYVA CDVCHGKRYN RETLDIRYKG
KTISDVLAMT VEDAAPFFDA VPALHRKLNT LLDVGLGYLT LGQNATTLSG GEAQRVKLAK
ELSKRDTGNT LYILDEPTTG LHFADVEQLL AVLYRLRDQG NTVVVIEHNL DVIKTADWIA
DLGPEGGSGG GQIIASGTPE TVAKIAGSHT GRYLARLLDH