Gene Hneap_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1059 
Symbol 
ID8534206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1144401 
End bp1146278 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content53% 
IMG OID646383443 
Productvon Willebrand factor type A 
Protein accessionYP_003262942 
Protein GI261855659 
COG category 
COG ID 
TIGRFAM ID[TIGR03503] conserved hypothetical protein TIGR03503 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0405627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGTAC CACCGACAAG CCCGCTCATC AGAATGCATG CATCCACCAT TTGCGGGCTA 
CTTATTCTGC TGATTACCGG GCTGGCTTTC GTACCCGCAC AGGCAGCAAC GCCTCCGGAA
CTTCATGTGT TAATCGACGT TTCCGGCAGC ATGAAGCAAA CCGACCCAAA CAATCTAAGA
CGCCCCGCCT TGCGCTTACT GGGTGATTTG CTGCCTCCCT CTGCGCAAAT TGGCATCTGG
TTTTTCGGTG ACAAAGTCAG CCTTATGTTA AAAACGGCCG GTGCTGATCC AAAAGTAAAG
GAACGCGTCC GTCAAACCGC CAAAAAAATA CGATCCAACG AACCCTTCAC CGATATTCCT
GCTGCCCTAG CCGCGGCTGC CGCCACCTGG AACGATGGCA CGGACCGTAA CATTCTGTTG
TTGTCCGATG GCATGGTTGA TATATCACCG GAAAAAGCCA TTAATGTTCG AGCCCAAGAA
GAACTTTTGC AGAAACTCGT GCCGCAACTC AGGGCCGAAC ATATTCGCGT GCACACGATC
GCCTTGTCCA AAGATGCCGA CAGTAAGCTG CTTTCCCAGA TTGCAGCCGA CACGGGCGGC
ATTTTTGTCG AAGCCGACTC GGCCGACGCA TTGCAACGCG CTTTTCTCAA GATATTCGAA
GCGGCTGCCC CACGTGACGG CCTTCCACTG AAAGACAACA AGTTCCTGGT TGATCGTGCC
GTCAAAGAAC TGACCATCCT CGCGTTCCGA GACAAACCGG AAAACCAGAC AAAACTCAAG
CTACCCGATG GTCAAATCGT TGACGCCCAA GGCAGCAGGT CCAAAATGGG TTGGCGTTGG
GATGACAGTG GCGGCCGTGA TCTGGTTACC ATCGAAAATC CGCCGGCTGG AGCGTGGCAG
ATAATCGGCG GACTCGATCC GGACAACCGA GCGCTGATCA TCACTGATCT CAAACTTCAC
CTGACGCCCT TGCCGACGCG TATCTACCCG GGGGAACGGA TCGATGGCGC GCTGATGCTG
ACCAATCATG ATCAACCGAT CACCAAGGCC GCATTAACGC AGACAATCAA CGCAACGATC
GATGAACAAA AAGGAAGCGA ACTGATTCAG TCGATCAAAC TTAATGATCA GGGCGCAGAC
CCGGACATTA TCGGCGGTGA TGGAAAATTC AACTATCAGC TCCATCTGAT TGATCCTGCG
GGTATTTACA GCCTGGTCGC CACGGCAAGC AGCCCGACCT TCCAACGTGA TTGGCGCCAG
AATTTTGCTA TGGCACCGTT GCCACCAGTG CAACTCAAAC TCGTTTCATC CGTAGTCGAA
GTATCACCAG CCGAATCTTC GAATAGCCCG GATGGACACC CCGCCCAACC CGTCAAAAAG
ACCCTGCGTC AGATTCAAGT CACTCAGGAT CCCACGGTAC TTGAACCTAA TACGGCCAAA
TTAATCGGAC AATGGCAATG CGAACGGCCC CATTCAAATA AGCCCGATGC AAATAAACCC
GATTCAAATA AATATGGCGC GCCCATACCC ATCGAATGGC ATTTAACCGA CACCAGCATG
ATGTTCCCTG CGCCCGATGA GACGACGGCA GACTGTGTGC TGAACGCTGT GCTTAAAGCA
AAACTGACGA CGCACAGAGA CATCGATCTG GTGCTTGAAC CATTCAAGCT ACCCGCGCTA
TCTCACCCCA TGCCACAACC CAAGCAGGCA GAACCTGTCC AACAGGCGCA AGAGGGTGCT
TCGGGTAGGA TCAACTGGAT GCTCATCATC GTTATCAATG CCGTGGCGCT GCTCCTGATC
GGACTGGGGA CATGGCTTTG GAAACGAAAA GTGAAACGTA CACACCGCCA ACTGCTTGAA
GAGGCACAAT CAGTATGA
 
Protein sequence
MPVPPTSPLI RMHASTICGL LILLITGLAF VPAQAATPPE LHVLIDVSGS MKQTDPNNLR 
RPALRLLGDL LPPSAQIGIW FFGDKVSLML KTAGADPKVK ERVRQTAKKI RSNEPFTDIP
AALAAAAATW NDGTDRNILL LSDGMVDISP EKAINVRAQE ELLQKLVPQL RAEHIRVHTI
ALSKDADSKL LSQIAADTGG IFVEADSADA LQRAFLKIFE AAAPRDGLPL KDNKFLVDRA
VKELTILAFR DKPENQTKLK LPDGQIVDAQ GSRSKMGWRW DDSGGRDLVT IENPPAGAWQ
IIGGLDPDNR ALIITDLKLH LTPLPTRIYP GERIDGALML TNHDQPITKA ALTQTINATI
DEQKGSELIQ SIKLNDQGAD PDIIGGDGKF NYQLHLIDPA GIYSLVATAS SPTFQRDWRQ
NFAMAPLPPV QLKLVSSVVE VSPAESSNSP DGHPAQPVKK TLRQIQVTQD PTVLEPNTAK
LIGQWQCERP HSNKPDANKP DSNKYGAPIP IEWHLTDTSM MFPAPDETTA DCVLNAVLKA
KLTTHRDIDL VLEPFKLPAL SHPMPQPKQA EPVQQAQEGA SGRINWMLII VINAVALLLI
GLGTWLWKRK VKRTHRQLLE EAQSV