Gene Hneap_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1987 
Symbol 
ID8535146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2127020 
End bp2129941 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content59% 
IMG OID646384369 
ProductProtein of unknown function DUF2339, transmembrane 
Protein accessionYP_003263856 
Protein GI261856573 
COG category[S] Function unknown 
COG ID[COG5373] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0361094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTA CACTCACGTT TGTGGGGCTG ATTTTCGGGA TGGCGATGGC GGGAATCTGG 
GGCGCGGCGT TTGGCGCACT GACCGGCTTT CTCGTAGCGC AGGTCAGTCG GTTGAATCGG
CAGGTTCAGG CCTTGATCGC CGATCAGATA TTGTTGCGAG ATGAATTGCG TCATCTGTCT
CAACCTCCGG CGCAGCGCGC GTCGCCATCC GAAGCGTCGA ATCCGCCCGC CGCGCCCGAA
CCCGCCGTGG ACGAAGTGGC GTCGTGCGCA CCCCAATCTA AACCCGATCT TGTGCCATCG
CACGAACCTT TGCCGGAATC TTCATCCGTG CCATTGAAGA CGGTTGGCGA GTCGCCTGTG
ATCGCGCGAA CCAACGCTTC GTTGCCTGAA AAAGACCCTG AAAACGGTGC ATCGGTCTCG
GCCTGGGGCG CGCCCTCGTC GGCCGAAACC GGAGCGCCAG ATGGCTTGTC GCGCCTGTGG
TCGAGCGTGT ACCGATTCCT CACCGAGGGC AATGTGGTCG CCAAGATCGG GGTGATCGTG
CTGTTTTTCG GCTTGGCCTT CCTGCTGAAA TACGCCGCCG ATCAGGCCCT GTTCCCGATC
AGCGTCCGCT TGACGCTTGT GGGTATCGGC GGGTTGGTGC TGCTCGGCAT CGGCTGGTAT
TTGCGTGAGC GACATACCGG TTATGCCCTT GTTTTGCAGG GCGGGGGCAT TGGTCTGACG
TATCTCACGT TATACGCCGC ATTTCGTCTG TACGGGTTGT TGCCGGCGGG CGTAACGATG
GGGCTGATGC TTCTGGTTGT GGCCGCTGCC GCAGTGCTGG CGGTCGTTCA GGATGCCCGG
AGCCTTGCCG TGCTGGGGAT TATTGGCGGT TTTCTTGCGC CGATTCTGGC GGGTAGCGAC
AGCGGAAGGC ATGTCGATCT GTTCAGTTAT TACCTTGTGC TCGATTTCGG CATCGTGTTC
GTCGCCTGGC GCAAGGCGTG GCGTGAACTC AACCTGCTGG CCTTTTTGTT TACGTTCGTG
ATCGGCACAA TCTGGGGTGG GTTGAACTAT AAGCCTGCGC TGTTTTCGAC AACCGAACCT
TTCCTCATCG GTTTTTATCT GATCTTTCTG GCGACAGCCC TATTGTTTGC GCGTCAGCAA
CGGGCAGGCG GTCAGCGTGA TTATGTGCAA TCCACCCTCG TGTTCGGCCC GCCCCTGGTC
GGGTTTGGGC TGCAAGCCGC TCTGGTCCAA AATTTCGAAT ACGGTCTGGC CTGGAGCGCA
TTCGGCTTGG GCGCGCTCTA TCTTGTGTTG TGGCTGGGGT TGCGGCGCGC TGTCGGTGAG
TATTTCAAGA TTCTGAACGA TGCTTTCCTG CTATTGGGCC TCGGTTTTGT TTCTTTGGCC
GTGCCGTTCG CGTTCGATGG GCAATGGACC AGCACCACCT GGGCGTTGGA AGGCGCGGCC
ATGCTTTGGG TCGGATTGCG GCAGGGGAAA ACTTGGCCGG TTGTGTTTGG CCTTTTGCTG
CAACTCGGTG CCGGCGTGGC GTTTGGCGAC GATCCTTCAT CGCTCGACCC GACACACTGG
CCTTTGCTGG ATGGCTATTT CCTGAGTGGC GGTTTGATTG CGCTTTCCGG GTTGGCGAGC
GCGTATTTGT TGCGTGATTG GCGAAACTGG GTGCCGGTCC CAGCCCTGTT GACCCTTTGG
GGATTGGCCT GGTGGTTCGG CACCGGTTTC TACGATCTGG CACATGTGGC GAGCTGGCTT
CAGCCGTTCA CGCTGTGGCT GATGTTCGCC AGTGCTTCGA TGCTGCTGGT CCAATGGGTG
CGCGGGCGGC TGCAGGACTG GTCGATTCTG CGCTATACGC TCGCGCTGCA AACCGTCTGG
ATGTGGGCGC TGGGCGGCTT GATTTTGCTG TTGAATATGA GCCCGTTCCA CGAAGGAGGC
TGGTTTGCAT GGCTATTGGC GTTCGCCACA CTCTATGGCG GGCTCTACTG GAGCGAGCGT
CGAAGCGAAT CCGTTTTCGC ATCCGAGCGC TTGCATGGTT TGGGCTTGTG GCTGCTGGCG
CCTGTGTTGG CGCCCCAATT GGCGGATGCG ATCTTCCGGG GACTGTTCGG GTTTACGCTG
TATTTCGGCT GGTTCGATTT GGGGCCAAGG GGCATCATGC ACCCGAACAC GCCCGGCGTT
TGGACGGCCA TGAGTTGGGG ATTGATACCG GTGTTGCTGC TGAGTTGGGT TGGTTCCGCC
CGTCACTGGC CCTTCGCTGA ACGATTTGGC CATGCGGCTG ACTATCGTGG ATGGGTCGCA
TCGGGATTGG GTGTGTTCTT ACTGGGTTGG ATGTTTATCG TCCACGGGCT GTGGGTGACT
GATCCTGTTA TGGGTGCAGG CTGGGGCCAA CCAGCACAGA TCGGTTATTT GCCATTATTC
AATGCACTCG ATTTTGTTTC TGCTCTGGCC CTGTTCGCCC TATGGCGACA TGGAAGACTG
ACCGGTGCGT ATTTCCTGAA CTATGCCGGT GAGCGAACAC AACAGGTGCT GCATTGGTTG
ATGGGCGCTG CCGCGTTTGT CTGGCTCAAT GCCATGATTG CCCGCAGCCT GAATGCGTAC
GCCGGGCTGC CGCTCGATGA CGGGCGGTTC CTGCACGAGG CGCTGGCGCA AACGACATAC
TCTATTGCCT GGTCGTTGTT AGGCTTGATA CTGATCGTGC TGGCTAGCCG ACTGAAGCAA
CGTCGTTTAT GGTTGGTGGC CGCCGGGTTG CTGGGCGTTG TGGTGCTCAA ACTTTTTCTG
GTCGATTTGT CCGGCAGCGA TACACTCGCC CGGATCATCT CGTTTGTCGG CGTGGGCGTG
CTGCTCTTGC TGGCAGGTTA TATCGCCCCG ATACCGGCCA AACAGGCCCC ATTTGCCAAT
GATGACCAGA ACACAAATGA CTCCGAACAG AAGGCCGACT GA
 
Protein sequence
MMITLTFVGL IFGMAMAGIW GAAFGALTGF LVAQVSRLNR QVQALIADQI LLRDELRHLS 
QPPAQRASPS EASNPPAAPE PAVDEVASCA PQSKPDLVPS HEPLPESSSV PLKTVGESPV
IARTNASLPE KDPENGASVS AWGAPSSAET GAPDGLSRLW SSVYRFLTEG NVVAKIGVIV
LFFGLAFLLK YAADQALFPI SVRLTLVGIG GLVLLGIGWY LRERHTGYAL VLQGGGIGLT
YLTLYAAFRL YGLLPAGVTM GLMLLVVAAA AVLAVVQDAR SLAVLGIIGG FLAPILAGSD
SGRHVDLFSY YLVLDFGIVF VAWRKAWREL NLLAFLFTFV IGTIWGGLNY KPALFSTTEP
FLIGFYLIFL ATALLFARQQ RAGGQRDYVQ STLVFGPPLV GFGLQAALVQ NFEYGLAWSA
FGLGALYLVL WLGLRRAVGE YFKILNDAFL LLGLGFVSLA VPFAFDGQWT STTWALEGAA
MLWVGLRQGK TWPVVFGLLL QLGAGVAFGD DPSSLDPTHW PLLDGYFLSG GLIALSGLAS
AYLLRDWRNW VPVPALLTLW GLAWWFGTGF YDLAHVASWL QPFTLWLMFA SASMLLVQWV
RGRLQDWSIL RYTLALQTVW MWALGGLILL LNMSPFHEGG WFAWLLAFAT LYGGLYWSER
RSESVFASER LHGLGLWLLA PVLAPQLADA IFRGLFGFTL YFGWFDLGPR GIMHPNTPGV
WTAMSWGLIP VLLLSWVGSA RHWPFAERFG HAADYRGWVA SGLGVFLLGW MFIVHGLWVT
DPVMGAGWGQ PAQIGYLPLF NALDFVSALA LFALWRHGRL TGAYFLNYAG ERTQQVLHWL
MGAAAFVWLN AMIARSLNAY AGLPLDDGRF LHEALAQTTY SIAWSLLGLI LIVLASRLKQ
RRLWLVAAGL LGVVVLKLFL VDLSGSDTLA RIISFVGVGV LLLLAGYIAP IPAKQAPFAN
DDQNTNDSEQ KAD