Gene Hneap_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1750 
Symbol 
ID8534908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1877815 
End bp1880892 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content56% 
IMG OID646384132 
Productprotein of unknown function DUF490 
Protein accessionYP_003263620 
Protein GI261856337 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTGG AAAAATTGCA CATCGATACC CTCGATGGAT CGCTCTGTGG GCGGGGCAGC 
GTGGATTTTG CCCCCCAGCT CACAGCCACT ATTCACGGGC AGGCAAGGGG TTTGAATCCG
GCAAGGCTTG CGCCCGCGGC GGCGGGGCAG GTGGGCTTTG ATTACCAGTT TTCATTTGCC
CAAAAAGACG ACAAGGCAAG TAAGCCCACA ACGCCTGAGA TGCAATTTAA GCTCACTGAG
CTTGGAGGGC ATCTAGCCAA GCTGCCCTTC GATGGGCTAA CGGTGGATGG TTCGATGGCG
AATCAACAGG TCTCGCTGGA TATCAGCAAC GGGACATTGG CGGGCGGTGC GCTCAAGGCA
AAGGGCGAAC TGGGCTTGAC CGGCGCACGG CCCGTGGCGC TCTCGCTGGA TTTGGATCGA
GGAGCCCTGC GCGATATGTT GGCATCAACT GGCGTGGTCG CCGAGGGAGC GATCAGCACG
CATCTTAAAG TGAACGGTTC ATTGGGTATC GATCCGTTGC GGGATGCTCA AATAAGTTTT
GACTGGTCAA TACCGAGCAC GGTGCTGCTT ATGCCTGCAA CTGCGGCTAA TCAAAAAACG
GTTCGGGTGC CGTTGTCATT GGCGTTACAC GGCAGTTTTG CCGATCAGCG GCTTGATGTG
AAGCAGGCAA AAATCAATCT GGCCGATGCG TCCATAAATG CCAAAGGCCG CGTGACGATG
GCGGATTTGG CGAAAAATAC GCCCGTCGAT TTGACCGTGA ATGCCCACAT CCCGCAACTC
GCGCAGATTC CCTGGCAGGC ATTCAATCTG CCCGATTTAA CCGGCAGTAT TGAGCTTAAA
ACCGAGGTAA CGGGCAGCGT GCAGCAACCG AACGCCACCG TTGATTTGCG CGCCCGTAAG
CTAGCCTACG CGCAATGGCA GTTGGCTAAC TTGAGCCTGA ACGGGCGGGT GAAACAGCAG
CAGACATCTG TCTACGATCT TTCCCTGCGT GCCGATCAGC TGACGCAAGA GAATGCCAAA
AAAACCGCTG ATGTGCGTTT GAATAAGCTC ACTTTGGATG CACAAGGCCA GTGGCCGCAA
TTTGTGTTCA ATGCGCAAAG CGCCGATGTT GGAGGCGGCT TCAGTGCAGC GCAGCGTTTT
AGGCTGACTG CGCAGAGCCC GCAGGGGGAA GTGCGCCTGG CGATTGATGG TTTACTGGCT
CAAATTGGGA GTACACCCAT ATTGAGTTGG AGCGGGCAAA TCAAACACCT GGACGTACTG
CCCGCGCGCT TTGAAGGTAA ATCGATACCG TCATGGCATC TTGAAAAACC CGCAGATCTG
ACCTTATCAA AGCAGCAACA AACCTTGGGT AAAACGTGTC TGGCCACGGA TGCGAGTAAA
AAATACCAGT CGGGTCATCT GTGCATTGAC CTAGCCAGAA ACTTGGCGAG TCAGGAAAAA
AGCAAGGGTC AGATGGATGC CGATTTGCCC TTGGCGCTGA TCACGCCCTG GCTGCCGATT
GCCGCTGATT TGCCCGGACG CGTGCGCGTA ACAGCCAATG GGTCGATCAC GGCGGGGCAA
CTGGGCGGCT CGCTGAAATT GAGCTTGCCC GACAGCGAAT TTCGTCTGCC TGACACTTTG
GATAATCAAG CCTACCACTA CAAGAACGTG GATCTGACCG CCCGTGTTCA ATCGGGTGTC
GTGAATGTAG CAGTTGCCGC TGATGTACCG CAATTGCTCA ACATCAAAGG CGGTGGCACC
GTGGGGCTAG CCGGGGCAAA ACCCCTGGCG CTCGATCTTA CGGCAGCGTT GCCCAGCGTG
CGCGTTTTAC AAGGCTTCTT GCCGCAAGTT GCTGGGCTTA AGGGACAGGC GCGTGCCGAT
CTCAAGGTTG CCGGTACGCT GGATCAACCC AAGCCCAGTG GAAAGCTCAC GGTCGATCAA
TTGGCGTTCA CCCTGCCCGA TACCGGAGTG GCTTACGATC AAGGCACGCT CAATGCGCAG
ATCGACAGCA ACGGACAACT TGTTTTTTCC GGTGGTTTGA ACGGTTTGGT GGCCCAAGAC
AGTGCCAATA CAATACCGGC AAACAAGCAG CCATCGGCGG TCGCAAAGGG CCATCTGCGT
ATTCAAGGGA CTGGCGATTT GGCCAAGCTG CCTCAATGGG AGGTGCAGGC GCAGATTCAA
GGGCAAGACG TTCCGGTATT GCGTCTGCCA AGCCTTCTGA TCGATGCCAG TCCTGATTTG
ACGCTGGATG CCAGCAAGGC GGGCGCAAAA ATCGGCGGTT CGATCACACT GCCCACGGTG
ACCGCGCGCA TTGAAAAACT CCCCGACGCG GTGGTCAAAT CAACCAATGA TTTGGTGATT
GTGGGTGAGA AAAAACTTAC CCCCACAACG GCTTATCCCG TAACAGCGGA TATCAAACTG
ATTCTGGGCC AAGCAGTTTC GCTCGCCGGC ATGGGTTTTT CGACCGGTTT GACCGGTACG
CTTAACCTGC GCCTGCGTCC GAACGCGCCC TTGGCTGCAT TCGGCGAAAT TGATTTAATC
AACGGCACGT ACAAAGCCTA TGGGCAAAAT CTGGCCGTCA AACAGGGGCG GTTACAGTTT
GTCGGGCCGT TGGGCGATCC GGGTATTGCG GCCACTGCGC AACGGGTCGT TGGTGACACC
ACGGTCGGGC TGAACATCAC TGGCACGCTG TATCAACCCA AAACAACCGT GTTTTCATCG
CCCTCCTTGC CCGAATCCGA TGCCCTGTCG ATACTGCTGA CTGGTAAGCC CTTGAGCGAT
TCGGGATCGG GGGATCGGGC CATGCTGATG AATGCTATTG CCGGTCTCGG TGTGGCGCAG
GGCAACGATA TCGTGCGCGA CATCGGCCAG AAGTTCGGCT TTGATTCGGT TGGTTTGGAC
ACCTCCGGGG GATTTGGCGA TACGCAGCTT TCCCTAGGTA AACAAATCGG TGACCGTCTC
TTCGTGCGGT ACGCGGTGGG TGTGGTCAAT GGCTTGAGTG AACTCATCAC GCAATACAAA
TTAAGCAATC TGTTTTCAAT CGAAATCACC ACGAGTCCAG ACGCGACCGG CGGTGATCTG
ATCTACCGGA TTCACTGA
 
Protein sequence
MSLEKLHIDT LDGSLCGRGS VDFAPQLTAT IHGQARGLNP ARLAPAAAGQ VGFDYQFSFA 
QKDDKASKPT TPEMQFKLTE LGGHLAKLPF DGLTVDGSMA NQQVSLDISN GTLAGGALKA
KGELGLTGAR PVALSLDLDR GALRDMLAST GVVAEGAIST HLKVNGSLGI DPLRDAQISF
DWSIPSTVLL MPATAANQKT VRVPLSLALH GSFADQRLDV KQAKINLADA SINAKGRVTM
ADLAKNTPVD LTVNAHIPQL AQIPWQAFNL PDLTGSIELK TEVTGSVQQP NATVDLRARK
LAYAQWQLAN LSLNGRVKQQ QTSVYDLSLR ADQLTQENAK KTADVRLNKL TLDAQGQWPQ
FVFNAQSADV GGGFSAAQRF RLTAQSPQGE VRLAIDGLLA QIGSTPILSW SGQIKHLDVL
PARFEGKSIP SWHLEKPADL TLSKQQQTLG KTCLATDASK KYQSGHLCID LARNLASQEK
SKGQMDADLP LALITPWLPI AADLPGRVRV TANGSITAGQ LGGSLKLSLP DSEFRLPDTL
DNQAYHYKNV DLTARVQSGV VNVAVAADVP QLLNIKGGGT VGLAGAKPLA LDLTAALPSV
RVLQGFLPQV AGLKGQARAD LKVAGTLDQP KPSGKLTVDQ LAFTLPDTGV AYDQGTLNAQ
IDSNGQLVFS GGLNGLVAQD SANTIPANKQ PSAVAKGHLR IQGTGDLAKL PQWEVQAQIQ
GQDVPVLRLP SLLIDASPDL TLDASKAGAK IGGSITLPTV TARIEKLPDA VVKSTNDLVI
VGEKKLTPTT AYPVTADIKL ILGQAVSLAG MGFSTGLTGT LNLRLRPNAP LAAFGEIDLI
NGTYKAYGQN LAVKQGRLQF VGPLGDPGIA ATAQRVVGDT TVGLNITGTL YQPKTTVFSS
PSLPESDALS ILLTGKPLSD SGSGDRAMLM NAIAGLGVAQ GNDIVRDIGQ KFGFDSVGLD
TSGGFGDTQL SLGKQIGDRL FVRYAVGVVN GLSELITQYK LSNLFSIEIT TSPDATGGDL
IYRIH