Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1309 |
Symbol | |
ID | 8534465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1417441 |
End bp | 1420743 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646383701 |
Product | Protein of unknown function DUF2126 |
Protein accession | YP_003263191 |
Protein GI | 261855908 |
COG category | [S] Function unknown |
COG ID | [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.544002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACTC GCGTTGCGCT GCATCACAAA AGCCGGTACC TGTTCGACCG GTCAGTCGTA CTTTCACCGC ATGAGATCCG ACTGCGTCCG GCCGCCCATT GCCGCACCCC GATTTTGGGT TACTCCCTGC AGGTATTGCC CGGAAAGCAC TTCGTTAACT GGCAGCAAGA CGTTTATGGA AACTTCGTCG CCCGTCTGAC ATTCACCGAG CCAACAGTCG CATTGGATGT GACGGTCGAT TTGATCGTCG ATATGACCGT CATCAATCCG TTTGATTTCT TTGTAGAGCC ATGGGCTGAA CATTATCCGT TTGCCTATCC CGATGTGATG AAGGCTGAAT TGGCACCCTT TCTTGAGGCG GCGCCACTCG ATAAGCACTT CAGGAAATGG TTGGCAGAAT TTACGCAGGC CATGCCGAAA GATTTGACGA TCAATGATTT TCTGGTTCGG ATCAATCAAA AAGTTCAGAG TAGCGTCTCC TATCTCATTC GGATGGAACC GGGGGTGCAA ACGCCGGAAG AGACGCTCTC CAACGGTTCC GGATCCTGTC GTGATAGCGC CTGGCTTCTG GTGCAGGCGC TTCGCCACCT CGGTATTGCG GCGCGTTTTG TCTCCGGTTA TCTGATCCAG CTGGTGGCAG ATCAGCCTTC GCTTGATGGC CCGAGTGGGC CGACGGCTGA TTTTACGGAT CTGCACGCCT GGTGCGAGGC ATTTATCCCG GGGGGCGGTT GGATCGGTCT GGATGCTACA TCCGGCCTGT TGGCTGGCGA GGGGCACATC CCGCTAGCGG CCACAGCACA ACCTACCTCG GCGGCGCCCA TTACAGGCAT GACGAGCGTT TGTGAATCCG AGCTGATCGT TGAGATGAGT GTCACTCGGA TTCATGAAGA TCCGCGTGTC ACCAAACCGT ATACCGAAAC CCAATGGCAG GCGATTCAGA GCCTGGGCTA CTGGGTTGAT TCCGAACTGC AAGCCAACGA TGTCCGTCTG ACCCAAGGGG GCGAACCGAC GTTCGTATCC ATTGATAACA TGGATGCGCC GGAATGGAAT ACCGATGCAC TGGGTGATCA GAAATGGGTG CTTGCTCAGG ATTTGATGGC GCGTTTGAGC CGGCAGTTTG CACAGGGCGG GGTGCGTTAT TACGGTCAGG GAAAATGGTA TCCCGGTGAG CCGTTGCCAC GCTGGGCGCT CAGTGTTTTC TGGCGTACCG ATGGCGTGCC CGTCTGGCAC GATCCTGAGT TGATGGCAGA AAATCCGAAA CCGGATGCCC AACCAATTCC CAAGGCTCGG GCATATCACT TCGCCCATCT GTTGGCGTCG CGTTTGCAGC TTTCGACCGA TTACGTGATT CCGGCTTATG AAGATCCATT GCTGGCGCTG TCCATGGAAA CCGCATTGCC GGTGAATCTC GATCCCATGA AAATTGATCT CAAGGATACG GGCAAGCGAG GGCAGGTTTC GCGCAAGTTG CAATCAGGAT TGGGTGATAT CGTCGGTTAT GTTCTGCCGC TCAAAGCGCT GGATTCCAGC AAAACCCAGT GGGCCTCAAG CCTTTGGCCA TTGCGCACCG AACGTCTCTT CCTGTTGGGC GGTGATTCAC CGCTGGGCCT GAGATTACCG CTCGACAGCC TGCCTTGGGT TCATCCCAAA GCGCAACCGG TCAATTTTCC GGTTGATCCG TTCGCATCGC GTCAGGTTTT GGGTGATTAT CCGCGCACGC CTGCTTCGCT CAAACTCAAT CCAGCCGAAC CGCCACCCGC GCAAATCAAC CCGGAAGAAG TCATTCACAC TGCATTGGCA CTGGAAGTTC GGAATGGTGT GCTGTACGTT TTCATGCCGC CGATCACTAC GTTGGAGGCA TGGCTGGAAC TGGTGTCTGC CATTGAATTC TGTGCGAAAA CGCTGAAACA GCCGATACGG ATCGAGGGCT ACACGCCGCC ACGGGATCCA CGCTTGCAGT CGCTCTCTGT TACCCCAGAT CCCGGCGTCA TCGAGGTCAA TATTCACCCG GCCAGTGACT GGTCGACGCT CGAGCAGAAC ATGCACCTGC TTTACGAATC GGCGCGCCTC GCCCGTCTGG GCGCAGAGAA GTTCATGCTG GATGGCCGGC ATACGGGAAC AGGTGGTGGC AATCATGTCA CATTGGGCGG TGCAACTCCG GCCGATAGTC CGTTTCTGCG CCGCCCGGAC TTGCTCAAGA GCCTGCTGAC CTATTGGCAA CAGCATCCGG CGCTGTCCTA TTTGTTCTCG GGTCAGTTCA TTGGCCCGAC GAGTCAGGCA CCGCGTGTCG ATGAAGCTCG CGACGACACC TTAGGTGAAC TGGAAATTGC CTTCCAGCAT CTTGATCTGG CCTTTCCGTC CGGTACGGAG TCCGATCAGC CTTGGCTGAT TGATCGGCTG TTGCGACATC TGCTGGTTGA TTTAACCGGC AATACGCACC GTGCCGAATT CTGCATCGAC AAGCTGTATT CCCCGGATTC TTCCACTGGT CGCTTGGGTT TGCTTGAGTT GCGAGCGTTT GAAATGCCGC CACACCAGCG CATGAACCTC GTTCAATCTT TGCTGTTACG CGCACTTGTG GCGCGTTTCT GGAAAACACC GTTGCATAGT CGTCTCGTCG ATTGGGGAAC ATCGCTGCAT GATCGCTTCA TGCTTCCGCA TTTTATAGAG CAGGATATGC GCGATATCTG TACCGACCTG CGCGAATCAG GCTATGCGTT TGACGATGCG TGGTTTTTGC CGTTCCTCGA ATTCCGCTTT CCTCGATATG GCAGCGTGAC CTACGACGGC GTGATCATTG AAATTCGTCA AGCCATAGAG CCTTGGCACG TGCTGGGTGA AGAAATGGCA GCCGGTGGCA CGGCGAGATA TGTCGATTCA TCCATTGAGC GCGTGCAAAT TAAGGTGCAA AACTTGATCG GCAATCGGCA TTACGTGACC TGCAACGGGC GGCGAGTGCC GCTGCACCCA ACGGGCGTGC CGGGTGAGTT CGTCGCGGGC GTTCGTTTTA AGGCCTGGGC GCCGTATTCG GCACTGCATC CGACCATCGG AGTGCAGGCA CCGCTTAGGT TCGATCTGGT CGATGGCTGG AATAACCGGG CCCTGGGCGG TTGTACCTAT CATGTCAGCC ATCCGGGTGG TCGAAGCTAC GATACTTTCC CGATCAATGC CTTGGAGGCG GAAGCTCGAC GTCGCTCACG CTTTTGGGAT CATGGTCACA CACCGGGCCA GTTCACGGTG CCGCAGGAAC AGATCAATCC ACGGTTCCCG CTTACGCTCG ATTTGCGTTG GCAGCCCAGT TAA
|
Protein sequence | MTTRVALHHK SRYLFDRSVV LSPHEIRLRP AAHCRTPILG YSLQVLPGKH FVNWQQDVYG NFVARLTFTE PTVALDVTVD LIVDMTVINP FDFFVEPWAE HYPFAYPDVM KAELAPFLEA APLDKHFRKW LAEFTQAMPK DLTINDFLVR INQKVQSSVS YLIRMEPGVQ TPEETLSNGS GSCRDSAWLL VQALRHLGIA ARFVSGYLIQ LVADQPSLDG PSGPTADFTD LHAWCEAFIP GGGWIGLDAT SGLLAGEGHI PLAATAQPTS AAPITGMTSV CESELIVEMS VTRIHEDPRV TKPYTETQWQ AIQSLGYWVD SELQANDVRL TQGGEPTFVS IDNMDAPEWN TDALGDQKWV LAQDLMARLS RQFAQGGVRY YGQGKWYPGE PLPRWALSVF WRTDGVPVWH DPELMAENPK PDAQPIPKAR AYHFAHLLAS RLQLSTDYVI PAYEDPLLAL SMETALPVNL DPMKIDLKDT GKRGQVSRKL QSGLGDIVGY VLPLKALDSS KTQWASSLWP LRTERLFLLG GDSPLGLRLP LDSLPWVHPK AQPVNFPVDP FASRQVLGDY PRTPASLKLN PAEPPPAQIN PEEVIHTALA LEVRNGVLYV FMPPITTLEA WLELVSAIEF CAKTLKQPIR IEGYTPPRDP RLQSLSVTPD PGVIEVNIHP ASDWSTLEQN MHLLYESARL ARLGAEKFML DGRHTGTGGG NHVTLGGATP ADSPFLRRPD LLKSLLTYWQ QHPALSYLFS GQFIGPTSQA PRVDEARDDT LGELEIAFQH LDLAFPSGTE SDQPWLIDRL LRHLLVDLTG NTHRAEFCID KLYSPDSSTG RLGLLELRAF EMPPHQRMNL VQSLLLRALV ARFWKTPLHS RLVDWGTSLH DRFMLPHFIE QDMRDICTDL RESGYAFDDA WFLPFLEFRF PRYGSVTYDG VIIEIRQAIE PWHVLGEEMA AGGTARYVDS SIERVQIKVQ NLIGNRHYVT CNGRRVPLHP TGVPGEFVAG VRFKAWAPYS ALHPTIGVQA PLRFDLVDGW NNRALGGCTY HVSHPGGRSY DTFPINALEA EARRRSRFWD HGHTPGQFTV PQEQINPRFP LTLDLRWQPS
|
| |