Gene Hneap_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2090 
Symbol 
ID8535249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2237151 
End bp2239655 
Gene Length2505 bp 
Protein Length834 aa 
Translation table11 
GC content58% 
IMG OID646384468 
ProductDNA topoisomerase I 
Protein accessionYP_003263955 
Protein GI261856672 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.475207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAA CGCTTGTTAT CGTTGAGTCG CCCGCCAAAG CGAAAACCAT TAAGAAATAC 
CTCGGCCCCG GCTACGAGGT GCTCGCCTCC TACGGCCATG TGCGCGACCT GATCCCCAAA
GATGGCGCCG TGGATACCGC GCACGACTTT GCGATGAACT ACACCCTGAT CGACAAGAAC
GTGCGCCACG TCGATGCCAT TAAAAAAGCG CTTAAATCAT CCGACATTCT CCTGCTCGCA
ACCGACCCCG ATCGCGAAGG CGAAGCCATT TCCTGGCATT TGCGCGAACT CTTGGCGGAA
GCAGGCTTGC TGAAGAACAA GACGGCGCAA CGGGTGGTGT TCTACGAAAT CACCAAAAAA
GCAGTACAGG AAGCGGTCGC GCATCCACGC GATCTTTCCA TCGATCTGAT CAATGCCCAG
CAGGCACGCC GCGCACTCGA TTACCTCGTG GGCTTCAACT TGTCGCCGCT CTTGTGGAAA
AAGATCAATC CGGGGCTGTC CGCCGGACGG GTGCAAAGCC CCGCCCTGCG CCTGATCGTG
GAGCGCGAGG CCGAGATCGA AGCGTTCAAT CCGCAGGAAT ACTGGACCAT ACTCGCCGAC
TGCCAGGCCG ATCGGGCTGC CGAAAAGAGC CGATTCAATG CACGCCTGCT GACACTGGAC
GGGCAAAAGG CCGAACAATT CACGCTGACC AATGAAACGG ATGCGCAATC TGCGCGCAGC
CGTATCCTCG AAGCCGCCGC CGGCACACTC ACCGTCCGTT CGGTGGAAAA GCGCGAACGC
AAGCGCAATC CGGCCCCTCC GTTTACCACC TCGACGCTCC AGCAGGAAGG GGTTCGTAAA
CTCGGGCTCT CCGCCTCGCG CGTCATGCGC CTGGCGCAGG AGTTGTACGA AGGCGTCGAC
ATCGGCGCGG GCACCGTGGG TTTGATTACC TACATGCGAA CCGATGCCGT GACCCTCTCC
GAGGACGCGC TCACGCAGAT TCGGGCACAT ATCGGCGATA AGTACGGTGC GGCCTACCTT
CCCGCCAGCC CGAACCGCTA CAAGACCAAA TCCAAAAACG CACAAGAAGC GCACGAGGCC
ATCCGTCCGA CATCAGCGGC ACATACCCCG GATTCGGTAC GCGCTTTCCT GAACAAGGAT
CAGTTCCGCC TGTATGAGAT GATTTTCAAG CGCGCGGTCG CTTCGCAAAT GACACCGGCA
GTCTACGATC AAGTTTCCGT TGATCTCGCC GTCAATGATC AGCATAGCTT TCGCGCCAAT
GGCTCTACCC TCAAATTCCC GGGCTTCATC GCGCTTTATC GCGAGGATGA AGACGATGCC
TCAGGGGACA ATGACGAAGA TCGCCGCCTG CCGCCGCTGA CCGTAGGCGA CAAGATTGCC
CTAAACGACA TCGCTGCCGA CCAGCATTTT ACCGAGCCAC CGCCGCGCTT TACCGAGGCG
AGTCTGGTCA AGACGCTGGA AGAGTATGGT ATCGGTCGCC CTTCGACCTA TGCCAGCATC
ATCTCCACCC TCCAGGCGCG TGAGTACGTT CTGCTCGACC AACGGCGCTT CAAACCCACC
GATATGGGCC GGGTCGTTAA TGGCTTTCTG ACCGACTACT TCCGCGATAT TGTCGATTAC
GAATTCACCG CCAAACTGGA AGATGATCTG GATGCCGTGT CCCGCGGCGA ACGCGACTGG
GTGCCATTGA TGCGCGAGTT CTGGACGCCA TTCCATGACC GTGTCGAGCA CACCAATGAG
AATGTCACCC GGCAGGAGGC CGCCCAGGGG CGTGAACTGG GGATCGATCC CAAATCGGGC
AAGCCCGTCT CCGTCCGACT CGGGCGGTTC GGTCCCTTCG CCCAGATCGG CACCAAGGAC
GACGAGGAAA AACCAAAGTT CGCCTCGCTC AAGCGCAGCC AGAGCATTGC CACCATCACG
CTGGATGAAG CCCTGGACCT GTTTCAGTTG CCGAGAAAAC TGGGCGAAAC CCCGGAAGGC
GAACCTGTCG AGGTCGCCAT TGGTCGGTTC GGCCCCTTTG TAAAGTTCGG CAAAATGTAC
GCTTCGCTTG GCAAGGATGA CGATCCGTAC ACCATCGAAC TGCCGCGCGC GCTCGAAATC
ATCGAGATCA AAAAGCTCGC TGAGAAGAAT CGCTACATCA CCCAGTTCGA TAATGGGGTG
TCGGTACAAA ACGGTCGCTA TGGCCCCTAC ATCACCGATG GCAAAAAGAA CGCCAAAATC
CCCAAGGACA AAGATCCAAA ATCCCTGACA CTGGAAGAGT GCGTCGCCTT GCTTGCAGCT
GCCCCTGAGA AGAAATCGGC GCGCGGCAAA ACCGCTGCGA AAGCGACGGC CAAGAAGACG
GCAGCCCCAA AAGCGACGGC TCCGAAAACA ACAGCCAAAA AGTCAACGAC GCCGCGCGCC
AAAAAGGCCA GCGCGACCGA TGCACAAGCG CCTGCAAAGA AGACGCCGGT AAAAAAGCCC
GCCACCGCTC GCAGCAAGAA ACCCGCCGCG ACGATACCGG AATAA
 
Protein sequence
MSQTLVIVES PAKAKTIKKY LGPGYEVLAS YGHVRDLIPK DGAVDTAHDF AMNYTLIDKN 
VRHVDAIKKA LKSSDILLLA TDPDREGEAI SWHLRELLAE AGLLKNKTAQ RVVFYEITKK
AVQEAVAHPR DLSIDLINAQ QARRALDYLV GFNLSPLLWK KINPGLSAGR VQSPALRLIV
EREAEIEAFN PQEYWTILAD CQADRAAEKS RFNARLLTLD GQKAEQFTLT NETDAQSARS
RILEAAAGTL TVRSVEKRER KRNPAPPFTT STLQQEGVRK LGLSASRVMR LAQELYEGVD
IGAGTVGLIT YMRTDAVTLS EDALTQIRAH IGDKYGAAYL PASPNRYKTK SKNAQEAHEA
IRPTSAAHTP DSVRAFLNKD QFRLYEMIFK RAVASQMTPA VYDQVSVDLA VNDQHSFRAN
GSTLKFPGFI ALYREDEDDA SGDNDEDRRL PPLTVGDKIA LNDIAADQHF TEPPPRFTEA
SLVKTLEEYG IGRPSTYASI ISTLQAREYV LLDQRRFKPT DMGRVVNGFL TDYFRDIVDY
EFTAKLEDDL DAVSRGERDW VPLMREFWTP FHDRVEHTNE NVTRQEAAQG RELGIDPKSG
KPVSVRLGRF GPFAQIGTKD DEEKPKFASL KRSQSIATIT LDEALDLFQL PRKLGETPEG
EPVEVAIGRF GPFVKFGKMY ASLGKDDDPY TIELPRALEI IEIKKLAEKN RYITQFDNGV
SVQNGRYGPY ITDGKKNAKI PKDKDPKSLT LEECVALLAA APEKKSARGK TAAKATAKKT
AAPKATAPKT TAKKSTTPRA KKASATDAQA PAKKTPVKKP ATARSKKPAA TIPE