Gene Hneap_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0907 
Symbol 
ID8534048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp973927 
End bp977067 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content51% 
IMG OID646383292 
ProductProtein of unknown function DUF2309 
Protein accessionYP_003262797 
Protein GI261855514 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAC TCCCTTTGGG TAAACGCCTG AAAATTCGTT CCATGGTGCA CATGGCTGCC 
GAGCCAATCC CCAATTTCTG GCCTATGCGG ACGTTCATTC ACCATAACCC GCTCCATGGT
CTGGAGCATC TGCCCTTCGA ACAGGCGGTT CGCCAGGGTG AAAAACTCTT TCACGCACGG
GGATTTTTGC CGCGTGAGGA TTACCAGCGC TATCACAAGG AAGGCCGAGT TGATCAAAAC
AGCATAAAGC GTGACATAGC CGATTTTATT TCAAAACAAG AAACGCTCAA CGGTTTGGAT
TTGGCATCGT TACTTAGCGA CTTGATGTGT TCGGTTAAGA ACAAAGTAAC TAGAACGCGC
GCGCTCGCCG ATCATGATGA TGTGTTCCAA GCCTTGCACG GGAAACAACT GGAAAATGCA
GAGGCGCTCG ATCTTAAAGC GCTAACTCAA CGCTTATGTG CGCAGTTTGC ACCAGAGCGC
CCCCTGTACG AAGCCATCGA TTTGCTGTTC GGCACGCAAA TGGGCACCAC ACTCGATGAA
TTGGTAATCA AAAGCTGTCT CGACTTTTTC GATGAAGGCC AATCAACCAT CCAAATGCCC
GGCCGCCACC AAGGATTGTT CGCAGCTTGG ACGGCCCTGG CAAAACGTAA TTTACGCCTG
TTTTTACGTG GCATGCATAT CAAACAGATC CTCGATCAGG ACGATACGCC AGAGGGCATC
ATCGCCTACA TTCTCGACGA ACTGGGCATT GAGGAGGCTC ACTGGGATGG GCTGATTACC
CGCGAACTGA CTCGTTTGCA TGGCTGGGCA GGTTTTATTC GCTGGCGCTC CTCTTCCAAG
CACTACTATT GGGCCGAGCA GTACCCGGGG GATCTCATCG ATTTCCTGGC CATCCGGCTC
GTTTTAGGCT TGGCCTTGAT CCGTGAACAT AGCCGTCAAA AGCGCACACC GATGACAGTC
AAAGTGCTGC AAGAATATAT CGAAGGGCAC ACCGCCGAAT GCTATCTGCG TCAAGCCTAT
TACGGTGGCT GCATATTGCC CGCGTTTGCT CATGATGTTG ATGATGCGCT GTCGCATAAA
AAGCCTCAAA GGATCAACAA CATTCTTCCG GGCTACCTGC GCCAACAACG CCAATTCGAG
GCAACACGAC AAGCTGATGC GCTTCGTGAT TTAGCCAGCA AAGCGGGGCA AACCGATGCC
CTTATGGCGC TGAATGCGCC TCAAATCAAG CAACTCATGA CACTTATCGA GGCGTTTGAA
AACGAAGAAG GCATGATCTG GCTTCGTGCG ATGGAATCGG TCTATCGACG GGAAATCATC
AACCAGATTC AACTGTATGC ACCGCATAAA AAAGAAAAAC GGCCCTTTGC CCAGGCATTG
TTCTGTATCG ATGTGCGCTC CGAGCCGATA CGCCGTAATC TGGAAACGGT AGGCGAGTAT
CAAACCTATG GTATCGCCGG GTTTTTTGGT GTTCCGGTAA GCTATATTGG CCTTGGCAAG
GGCAGTGAAG TTAATCTTTG CCCGGTGGTC ATTACCCCTA AAAATCTGGT GCTTGAAGTG
CCCGTGGGTG CCACAAGCAT TGAAACAGAC TTTTATTCTT CCGCCGACCA TGTGCTACAT
GAGATGAAAA GCTCGATCCT TTCACCCTAC TTCACGGTTG AAGCGGCCGG TTTGCTGTTT
GGTTTCGACA TGATCGGCAA AACCATTGCC CCGCGACGCT ACACCCAAAT ACGCAATCAT
ATCGAACCAA AAGCACAGGC AACTCGTTTG CTGGTGGATA AACTCACCCG CGAACAAGCC
GACTCAATCG TCCGTTCGCT GCAACGCGCC ATGATTGTGC GCGCCATTCA TCAGGAATTT
GGCATCGAAC GCGAAGCAGT CACCGATGCC ATGATCCGCG AACTGCGCGA AGCGGCCATG
GATAACTATC ACGAACAGAC CGAATTCGCG CGCCGTTTCG CCTTGAGCCC AACGGCCGAA
ACTCAGTTTA TCGCAGGGCT GAAAAAGGAC TATAAAATCA ATCGCTCATT TGTTTCCATG
CAAATGGAAC GCTTGGCCAG GATTGGTTTT AGCCTCGATG AACAGGTGTT TTATGTTGAC
AAGGCACTCA CATCCATCGG ATTGACCGAA AACTTTTCAC GCTTTGTTTT ATTGGCCGGT
CACGGCAGCA CTTCCGACAA TAATCCCTAC GAATCCGCGC TTGACTGCGG TGCATGTGGT
GGTAGTCATG GGCTGGTTTC TGCCCGGGTG CTTGCCCACA TGGCCAATAA GCCTGAAGTA
CGTCGCAGAC TGGCCAAGCA AGGCATCCAG ATACCTGAAG ATACTTGGTT TGTGTCCGTC
ATGCACAACA CCACAACCGA TCAATTGTCA CTGCAAGACC TTGATTTGCT TCCAAACAGC
CATCTTGTTT ACCTCGAACG CTTGCGTAAC GGCTTACGTG CGGCCACCCG TTTGTCAGCG
GCAGAACGCT TGCCTGCTCT GCTTGATCAT CCTTCGCCCA ACATCGACAC ATTATCGGCA
CAAAAACAAA TTGAGCGGAA TGCCAGCGAC TGGACCCAAG TTCGGCCAGA GTGGGGCTTG
GCGAGAAATG CGAGCGTCGT CGCCGGCGGC CGACATTTGA CCGAGGGTGC GAACTTAAGC
GGTCGAACGT TTTTGCAGTC TTACGATTAT CGACTCGATC CCAAAGGCCG CCACCTTGAA
AACATTCTCA GCAACCCGCT AATTATCGGC CAGTGGATCA ATCTTGAGCA TTATTTCTCA
GCGGTAGATA ACGAACACTT TGGCAGTGGC AGCAAGGCCT ATCACAACGT CGTAGGTCGT
TTTGGTGTGG TTACGGGTAA TTTAAGTGAC TTGCGAACAG GGTTACCAGC ACAGTCGGTG
CTTAAAGATG GACGCCCATA CCACGAGCCC ATCCGTCTCT TGGCGATTAT CGAAGCACCC
GCAGCATTCA CCCTCGAAGT AGCGGGTCGA TTGCCCAAGG TGATGTCCCT GATTACCAAC
GGCTGGATCA CTGTTGTTGT CGTTGATCCG GAAACGGGCG ATCGTCTTTT TTATGATCGC
GGCGAATGGT ACAATCTCAA CAATGATCCG CAGTACACGC CCTCGGTCAA ACCCTTGCTT
GAAGAAGAAC TCAGCGCATG A
 
Protein sequence
MSKLPLGKRL KIRSMVHMAA EPIPNFWPMR TFIHHNPLHG LEHLPFEQAV RQGEKLFHAR 
GFLPREDYQR YHKEGRVDQN SIKRDIADFI SKQETLNGLD LASLLSDLMC SVKNKVTRTR
ALADHDDVFQ ALHGKQLENA EALDLKALTQ RLCAQFAPER PLYEAIDLLF GTQMGTTLDE
LVIKSCLDFF DEGQSTIQMP GRHQGLFAAW TALAKRNLRL FLRGMHIKQI LDQDDTPEGI
IAYILDELGI EEAHWDGLIT RELTRLHGWA GFIRWRSSSK HYYWAEQYPG DLIDFLAIRL
VLGLALIREH SRQKRTPMTV KVLQEYIEGH TAECYLRQAY YGGCILPAFA HDVDDALSHK
KPQRINNILP GYLRQQRQFE ATRQADALRD LASKAGQTDA LMALNAPQIK QLMTLIEAFE
NEEGMIWLRA MESVYRREII NQIQLYAPHK KEKRPFAQAL FCIDVRSEPI RRNLETVGEY
QTYGIAGFFG VPVSYIGLGK GSEVNLCPVV ITPKNLVLEV PVGATSIETD FYSSADHVLH
EMKSSILSPY FTVEAAGLLF GFDMIGKTIA PRRYTQIRNH IEPKAQATRL LVDKLTREQA
DSIVRSLQRA MIVRAIHQEF GIEREAVTDA MIRELREAAM DNYHEQTEFA RRFALSPTAE
TQFIAGLKKD YKINRSFVSM QMERLARIGF SLDEQVFYVD KALTSIGLTE NFSRFVLLAG
HGSTSDNNPY ESALDCGACG GSHGLVSARV LAHMANKPEV RRRLAKQGIQ IPEDTWFVSV
MHNTTTDQLS LQDLDLLPNS HLVYLERLRN GLRAATRLSA AERLPALLDH PSPNIDTLSA
QKQIERNASD WTQVRPEWGL ARNASVVAGG RHLTEGANLS GRTFLQSYDY RLDPKGRHLE
NILSNPLIIG QWINLEHYFS AVDNEHFGSG SKAYHNVVGR FGVVTGNLSD LRTGLPAQSV
LKDGRPYHEP IRLLAIIEAP AAFTLEVAGR LPKVMSLITN GWITVVVVDP ETGDRLFYDR
GEWYNLNNDP QYTPSVKPLL EEELSA