Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1987 |
Symbol | |
ID | 8535146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2127020 |
End bp | 2129941 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646384369 |
Product | Protein of unknown function DUF2339, transmembrane |
Protein accession | YP_003263856 |
Protein GI | 261856573 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0361094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATTA CACTCACGTT TGTGGGGCTG ATTTTCGGGA TGGCGATGGC GGGAATCTGG GGCGCGGCGT TTGGCGCACT GACCGGCTTT CTCGTAGCGC AGGTCAGTCG GTTGAATCGG CAGGTTCAGG CCTTGATCGC CGATCAGATA TTGTTGCGAG ATGAATTGCG TCATCTGTCT CAACCTCCGG CGCAGCGCGC GTCGCCATCC GAAGCGTCGA ATCCGCCCGC CGCGCCCGAA CCCGCCGTGG ACGAAGTGGC GTCGTGCGCA CCCCAATCTA AACCCGATCT TGTGCCATCG CACGAACCTT TGCCGGAATC TTCATCCGTG CCATTGAAGA CGGTTGGCGA GTCGCCTGTG ATCGCGCGAA CCAACGCTTC GTTGCCTGAA AAAGACCCTG AAAACGGTGC ATCGGTCTCG GCCTGGGGCG CGCCCTCGTC GGCCGAAACC GGAGCGCCAG ATGGCTTGTC GCGCCTGTGG TCGAGCGTGT ACCGATTCCT CACCGAGGGC AATGTGGTCG CCAAGATCGG GGTGATCGTG CTGTTTTTCG GCTTGGCCTT CCTGCTGAAA TACGCCGCCG ATCAGGCCCT GTTCCCGATC AGCGTCCGCT TGACGCTTGT GGGTATCGGC GGGTTGGTGC TGCTCGGCAT CGGCTGGTAT TTGCGTGAGC GACATACCGG TTATGCCCTT GTTTTGCAGG GCGGGGGCAT TGGTCTGACG TATCTCACGT TATACGCCGC ATTTCGTCTG TACGGGTTGT TGCCGGCGGG CGTAACGATG GGGCTGATGC TTCTGGTTGT GGCCGCTGCC GCAGTGCTGG CGGTCGTTCA GGATGCCCGG AGCCTTGCCG TGCTGGGGAT TATTGGCGGT TTTCTTGCGC CGATTCTGGC GGGTAGCGAC AGCGGAAGGC ATGTCGATCT GTTCAGTTAT TACCTTGTGC TCGATTTCGG CATCGTGTTC GTCGCCTGGC GCAAGGCGTG GCGTGAACTC AACCTGCTGG CCTTTTTGTT TACGTTCGTG ATCGGCACAA TCTGGGGTGG GTTGAACTAT AAGCCTGCGC TGTTTTCGAC AACCGAACCT TTCCTCATCG GTTTTTATCT GATCTTTCTG GCGACAGCCC TATTGTTTGC GCGTCAGCAA CGGGCAGGCG GTCAGCGTGA TTATGTGCAA TCCACCCTCG TGTTCGGCCC GCCCCTGGTC GGGTTTGGGC TGCAAGCCGC TCTGGTCCAA AATTTCGAAT ACGGTCTGGC CTGGAGCGCA TTCGGCTTGG GCGCGCTCTA TCTTGTGTTG TGGCTGGGGT TGCGGCGCGC TGTCGGTGAG TATTTCAAGA TTCTGAACGA TGCTTTCCTG CTATTGGGCC TCGGTTTTGT TTCTTTGGCC GTGCCGTTCG CGTTCGATGG GCAATGGACC AGCACCACCT GGGCGTTGGA AGGCGCGGCC ATGCTTTGGG TCGGATTGCG GCAGGGGAAA ACTTGGCCGG TTGTGTTTGG CCTTTTGCTG CAACTCGGTG CCGGCGTGGC GTTTGGCGAC GATCCTTCAT CGCTCGACCC GACACACTGG CCTTTGCTGG ATGGCTATTT CCTGAGTGGC GGTTTGATTG CGCTTTCCGG GTTGGCGAGC GCGTATTTGT TGCGTGATTG GCGAAACTGG GTGCCGGTCC CAGCCCTGTT GACCCTTTGG GGATTGGCCT GGTGGTTCGG CACCGGTTTC TACGATCTGG CACATGTGGC GAGCTGGCTT CAGCCGTTCA CGCTGTGGCT GATGTTCGCC AGTGCTTCGA TGCTGCTGGT CCAATGGGTG CGCGGGCGGC TGCAGGACTG GTCGATTCTG CGCTATACGC TCGCGCTGCA AACCGTCTGG ATGTGGGCGC TGGGCGGCTT GATTTTGCTG TTGAATATGA GCCCGTTCCA CGAAGGAGGC TGGTTTGCAT GGCTATTGGC GTTCGCCACA CTCTATGGCG GGCTCTACTG GAGCGAGCGT CGAAGCGAAT CCGTTTTCGC ATCCGAGCGC TTGCATGGTT TGGGCTTGTG GCTGCTGGCG CCTGTGTTGG CGCCCCAATT GGCGGATGCG ATCTTCCGGG GACTGTTCGG GTTTACGCTG TATTTCGGCT GGTTCGATTT GGGGCCAAGG GGCATCATGC ACCCGAACAC GCCCGGCGTT TGGACGGCCA TGAGTTGGGG ATTGATACCG GTGTTGCTGC TGAGTTGGGT TGGTTCCGCC CGTCACTGGC CCTTCGCTGA ACGATTTGGC CATGCGGCTG ACTATCGTGG ATGGGTCGCA TCGGGATTGG GTGTGTTCTT ACTGGGTTGG ATGTTTATCG TCCACGGGCT GTGGGTGACT GATCCTGTTA TGGGTGCAGG CTGGGGCCAA CCAGCACAGA TCGGTTATTT GCCATTATTC AATGCACTCG ATTTTGTTTC TGCTCTGGCC CTGTTCGCCC TATGGCGACA TGGAAGACTG ACCGGTGCGT ATTTCCTGAA CTATGCCGGT GAGCGAACAC AACAGGTGCT GCATTGGTTG ATGGGCGCTG CCGCGTTTGT CTGGCTCAAT GCCATGATTG CCCGCAGCCT GAATGCGTAC GCCGGGCTGC CGCTCGATGA CGGGCGGTTC CTGCACGAGG CGCTGGCGCA AACGACATAC TCTATTGCCT GGTCGTTGTT AGGCTTGATA CTGATCGTGC TGGCTAGCCG ACTGAAGCAA CGTCGTTTAT GGTTGGTGGC CGCCGGGTTG CTGGGCGTTG TGGTGCTCAA ACTTTTTCTG GTCGATTTGT CCGGCAGCGA TACACTCGCC CGGATCATCT CGTTTGTCGG CGTGGGCGTG CTGCTCTTGC TGGCAGGTTA TATCGCCCCG ATACCGGCCA AACAGGCCCC ATTTGCCAAT GATGACCAGA ACACAAATGA CTCCGAACAG AAGGCCGACT GA
|
Protein sequence | MMITLTFVGL IFGMAMAGIW GAAFGALTGF LVAQVSRLNR QVQALIADQI LLRDELRHLS QPPAQRASPS EASNPPAAPE PAVDEVASCA PQSKPDLVPS HEPLPESSSV PLKTVGESPV IARTNASLPE KDPENGASVS AWGAPSSAET GAPDGLSRLW SSVYRFLTEG NVVAKIGVIV LFFGLAFLLK YAADQALFPI SVRLTLVGIG GLVLLGIGWY LRERHTGYAL VLQGGGIGLT YLTLYAAFRL YGLLPAGVTM GLMLLVVAAA AVLAVVQDAR SLAVLGIIGG FLAPILAGSD SGRHVDLFSY YLVLDFGIVF VAWRKAWREL NLLAFLFTFV IGTIWGGLNY KPALFSTTEP FLIGFYLIFL ATALLFARQQ RAGGQRDYVQ STLVFGPPLV GFGLQAALVQ NFEYGLAWSA FGLGALYLVL WLGLRRAVGE YFKILNDAFL LLGLGFVSLA VPFAFDGQWT STTWALEGAA MLWVGLRQGK TWPVVFGLLL QLGAGVAFGD DPSSLDPTHW PLLDGYFLSG GLIALSGLAS AYLLRDWRNW VPVPALLTLW GLAWWFGTGF YDLAHVASWL QPFTLWLMFA SASMLLVQWV RGRLQDWSIL RYTLALQTVW MWALGGLILL LNMSPFHEGG WFAWLLAFAT LYGGLYWSER RSESVFASER LHGLGLWLLA PVLAPQLADA IFRGLFGFTL YFGWFDLGPR GIMHPNTPGV WTAMSWGLIP VLLLSWVGSA RHWPFAERFG HAADYRGWVA SGLGVFLLGW MFIVHGLWVT DPVMGAGWGQ PAQIGYLPLF NALDFVSALA LFALWRHGRL TGAYFLNYAG ERTQQVLHWL MGAAAFVWLN AMIARSLNAY AGLPLDDGRF LHEALAQTTY SIAWSLLGLI LIVLASRLKQ RRLWLVAAGL LGVVVLKLFL VDLSGSDTLA RIISFVGVGV LLLLAGYIAP IPAKQAPFAN DDQNTNDSEQ KAD
|
| |