Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1750 |
Symbol | |
ID | 8534908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1877815 |
End bp | 1880892 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646384132 |
Product | protein of unknown function DUF490 |
Protein accession | YP_003263620 |
Protein GI | 261856337 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTGG AAAAATTGCA CATCGATACC CTCGATGGAT CGCTCTGTGG GCGGGGCAGC GTGGATTTTG CCCCCCAGCT CACAGCCACT ATTCACGGGC AGGCAAGGGG TTTGAATCCG GCAAGGCTTG CGCCCGCGGC GGCGGGGCAG GTGGGCTTTG ATTACCAGTT TTCATTTGCC CAAAAAGACG ACAAGGCAAG TAAGCCCACA ACGCCTGAGA TGCAATTTAA GCTCACTGAG CTTGGAGGGC ATCTAGCCAA GCTGCCCTTC GATGGGCTAA CGGTGGATGG TTCGATGGCG AATCAACAGG TCTCGCTGGA TATCAGCAAC GGGACATTGG CGGGCGGTGC GCTCAAGGCA AAGGGCGAAC TGGGCTTGAC CGGCGCACGG CCCGTGGCGC TCTCGCTGGA TTTGGATCGA GGAGCCCTGC GCGATATGTT GGCATCAACT GGCGTGGTCG CCGAGGGAGC GATCAGCACG CATCTTAAAG TGAACGGTTC ATTGGGTATC GATCCGTTGC GGGATGCTCA AATAAGTTTT GACTGGTCAA TACCGAGCAC GGTGCTGCTT ATGCCTGCAA CTGCGGCTAA TCAAAAAACG GTTCGGGTGC CGTTGTCATT GGCGTTACAC GGCAGTTTTG CCGATCAGCG GCTTGATGTG AAGCAGGCAA AAATCAATCT GGCCGATGCG TCCATAAATG CCAAAGGCCG CGTGACGATG GCGGATTTGG CGAAAAATAC GCCCGTCGAT TTGACCGTGA ATGCCCACAT CCCGCAACTC GCGCAGATTC CCTGGCAGGC ATTCAATCTG CCCGATTTAA CCGGCAGTAT TGAGCTTAAA ACCGAGGTAA CGGGCAGCGT GCAGCAACCG AACGCCACCG TTGATTTGCG CGCCCGTAAG CTAGCCTACG CGCAATGGCA GTTGGCTAAC TTGAGCCTGA ACGGGCGGGT GAAACAGCAG CAGACATCTG TCTACGATCT TTCCCTGCGT GCCGATCAGC TGACGCAAGA GAATGCCAAA AAAACCGCTG ATGTGCGTTT GAATAAGCTC ACTTTGGATG CACAAGGCCA GTGGCCGCAA TTTGTGTTCA ATGCGCAAAG CGCCGATGTT GGAGGCGGCT TCAGTGCAGC GCAGCGTTTT AGGCTGACTG CGCAGAGCCC GCAGGGGGAA GTGCGCCTGG CGATTGATGG TTTACTGGCT CAAATTGGGA GTACACCCAT ATTGAGTTGG AGCGGGCAAA TCAAACACCT GGACGTACTG CCCGCGCGCT TTGAAGGTAA ATCGATACCG TCATGGCATC TTGAAAAACC CGCAGATCTG ACCTTATCAA AGCAGCAACA AACCTTGGGT AAAACGTGTC TGGCCACGGA TGCGAGTAAA AAATACCAGT CGGGTCATCT GTGCATTGAC CTAGCCAGAA ACTTGGCGAG TCAGGAAAAA AGCAAGGGTC AGATGGATGC CGATTTGCCC TTGGCGCTGA TCACGCCCTG GCTGCCGATT GCCGCTGATT TGCCCGGACG CGTGCGCGTA ACAGCCAATG GGTCGATCAC GGCGGGGCAA CTGGGCGGCT CGCTGAAATT GAGCTTGCCC GACAGCGAAT TTCGTCTGCC TGACACTTTG GATAATCAAG CCTACCACTA CAAGAACGTG GATCTGACCG CCCGTGTTCA ATCGGGTGTC GTGAATGTAG CAGTTGCCGC TGATGTACCG CAATTGCTCA ACATCAAAGG CGGTGGCACC GTGGGGCTAG CCGGGGCAAA ACCCCTGGCG CTCGATCTTA CGGCAGCGTT GCCCAGCGTG CGCGTTTTAC AAGGCTTCTT GCCGCAAGTT GCTGGGCTTA AGGGACAGGC GCGTGCCGAT CTCAAGGTTG CCGGTACGCT GGATCAACCC AAGCCCAGTG GAAAGCTCAC GGTCGATCAA TTGGCGTTCA CCCTGCCCGA TACCGGAGTG GCTTACGATC AAGGCACGCT CAATGCGCAG ATCGACAGCA ACGGACAACT TGTTTTTTCC GGTGGTTTGA ACGGTTTGGT GGCCCAAGAC AGTGCCAATA CAATACCGGC AAACAAGCAG CCATCGGCGG TCGCAAAGGG CCATCTGCGT ATTCAAGGGA CTGGCGATTT GGCCAAGCTG CCTCAATGGG AGGTGCAGGC GCAGATTCAA GGGCAAGACG TTCCGGTATT GCGTCTGCCA AGCCTTCTGA TCGATGCCAG TCCTGATTTG ACGCTGGATG CCAGCAAGGC GGGCGCAAAA ATCGGCGGTT CGATCACACT GCCCACGGTG ACCGCGCGCA TTGAAAAACT CCCCGACGCG GTGGTCAAAT CAACCAATGA TTTGGTGATT GTGGGTGAGA AAAAACTTAC CCCCACAACG GCTTATCCCG TAACAGCGGA TATCAAACTG ATTCTGGGCC AAGCAGTTTC GCTCGCCGGC ATGGGTTTTT CGACCGGTTT GACCGGTACG CTTAACCTGC GCCTGCGTCC GAACGCGCCC TTGGCTGCAT TCGGCGAAAT TGATTTAATC AACGGCACGT ACAAAGCCTA TGGGCAAAAT CTGGCCGTCA AACAGGGGCG GTTACAGTTT GTCGGGCCGT TGGGCGATCC GGGTATTGCG GCCACTGCGC AACGGGTCGT TGGTGACACC ACGGTCGGGC TGAACATCAC TGGCACGCTG TATCAACCCA AAACAACCGT GTTTTCATCG CCCTCCTTGC CCGAATCCGA TGCCCTGTCG ATACTGCTGA CTGGTAAGCC CTTGAGCGAT TCGGGATCGG GGGATCGGGC CATGCTGATG AATGCTATTG CCGGTCTCGG TGTGGCGCAG GGCAACGATA TCGTGCGCGA CATCGGCCAG AAGTTCGGCT TTGATTCGGT TGGTTTGGAC ACCTCCGGGG GATTTGGCGA TACGCAGCTT TCCCTAGGTA AACAAATCGG TGACCGTCTC TTCGTGCGGT ACGCGGTGGG TGTGGTCAAT GGCTTGAGTG AACTCATCAC GCAATACAAA TTAAGCAATC TGTTTTCAAT CGAAATCACC ACGAGTCCAG ACGCGACCGG CGGTGATCTG ATCTACCGGA TTCACTGA
|
Protein sequence | MSLEKLHIDT LDGSLCGRGS VDFAPQLTAT IHGQARGLNP ARLAPAAAGQ VGFDYQFSFA QKDDKASKPT TPEMQFKLTE LGGHLAKLPF DGLTVDGSMA NQQVSLDISN GTLAGGALKA KGELGLTGAR PVALSLDLDR GALRDMLAST GVVAEGAIST HLKVNGSLGI DPLRDAQISF DWSIPSTVLL MPATAANQKT VRVPLSLALH GSFADQRLDV KQAKINLADA SINAKGRVTM ADLAKNTPVD LTVNAHIPQL AQIPWQAFNL PDLTGSIELK TEVTGSVQQP NATVDLRARK LAYAQWQLAN LSLNGRVKQQ QTSVYDLSLR ADQLTQENAK KTADVRLNKL TLDAQGQWPQ FVFNAQSADV GGGFSAAQRF RLTAQSPQGE VRLAIDGLLA QIGSTPILSW SGQIKHLDVL PARFEGKSIP SWHLEKPADL TLSKQQQTLG KTCLATDASK KYQSGHLCID LARNLASQEK SKGQMDADLP LALITPWLPI AADLPGRVRV TANGSITAGQ LGGSLKLSLP DSEFRLPDTL DNQAYHYKNV DLTARVQSGV VNVAVAADVP QLLNIKGGGT VGLAGAKPLA LDLTAALPSV RVLQGFLPQV AGLKGQARAD LKVAGTLDQP KPSGKLTVDQ LAFTLPDTGV AYDQGTLNAQ IDSNGQLVFS GGLNGLVAQD SANTIPANKQ PSAVAKGHLR IQGTGDLAKL PQWEVQAQIQ GQDVPVLRLP SLLIDASPDL TLDASKAGAK IGGSITLPTV TARIEKLPDA VVKSTNDLVI VGEKKLTPTT AYPVTADIKL ILGQAVSLAG MGFSTGLTGT LNLRLRPNAP LAAFGEIDLI NGTYKAYGQN LAVKQGRLQF VGPLGDPGIA ATAQRVVGDT TVGLNITGTL YQPKTTVFSS PSLPESDALS ILLTGKPLSD SGSGDRAMLM NAIAGLGVAQ GNDIVRDIGQ KFGFDSVGLD TSGGFGDTQL SLGKQIGDRL FVRYAVGVVN GLSELITQYK LSNLFSIEIT TSPDATGGDL IYRIH
|
| |