Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2221 |
Symbol | |
ID | 8535385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2390123 |
End bp | 2392504 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646384601 |
Product | TonB-dependent receptor |
Protein accession | YP_003264083 |
Protein GI | 261856800 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0217178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGATA TGCCCCATTT TTCGCACGAT ACCTCAGCTT TTATTTACTT TTATATCAAT GCGTTACGTG TTGGCAACAA ACTTGCTCAT GGGTCTGGCT TTCTTTTTAA CCAGCTCATA AAACCAACAG TAACAGTCAC GATGAAATTC AAATTAAATC CGCTCGCGAT TCTGATTGCC GCCGCCCCAT TGACGGCGTT TGCCGCAACC GAACTGCCGC CGGTTGAGGT CACCTCGACC AAAGCGGCGA CCGTACCTGC CGGGGCCCAA GAACTCACCT CGGAACAAAT CAAGGCTGCC CGGGCCGCGA CCAGCGATTC GGCCCGTCTG TTGGCCGATT TCCCCGGTGT GAACCTATTC AGCGCGGGCG GTGTGTCTGC CCTGCCGATC ATTCATGGTC TTGCCGATGA TCGCCTGCGC ACGCAGGTCG ATGGCATGGA TTTGATCGCC GCCTGCCCCA ATCATATGAA TTCGCCGCTG TCGTACATCG ATCCGACCAA TGTCGGTCGG GTGCGTGTGT ATAGCGGCGT GGTGCCGGTT TCCGTCGGCG GGGACAGCAT CGGCGGCACG ATTGCAGTCG ACTCGCCCGC GCCCGTCTTC GCTCAGCCCG GCGAAGGCAC GATCACCCAA GGTGAAATCG GCGGGTTTTA CCGCAGCAAT GGCAGCGCCT ATGGCGGCAA TCTGGCCGCG ACGCTGGCCA CGCGCAATTT CAGCCTGAGC TATCGGGGTT CGAAGGCACA ATCCGACAAC TACAAGGCAG GCGGCGATTT CAAGCCGGGG GGCCTGTCTT CCGCCGATCC GAAGGATATT TCCACCGGAA TCAGCCACTG GCTTGAAGGC GACACGGTCG GCTCTTCGGC CTACAAATCG CAGAATCACG AACTGAGCGC CGCCTATCAA AACGATGCGG GCACGCAGAT GGTCGAACTG GGTGTCGGCA TCCAGCGTAT TCCCTACGAG AACTATCCCA ATCAGCGCAT GGACATGACG CAGAACAACA GCACGCAGTT CAACCTGCAT TACAAGGGGC TGTACGACTG GGGCACGCTG GACGCCCGCG CCTATAATCA GCACGTGCGT CATGAAATGG ACTTTAACGA CGACCGCCAG TTCTGGTACA TGACCGCGGC GGGCATGCCG ATGAACACCG AAGGTAAGAA CACCGGCGCC AAGCTGCAAG CCAATATCGA TCTCACAACG CGAGACGTTT TGAAGCTCGG CGCGGAATTG CAGAAATACC GCCTGGATGA CTGGTGGCCA CCATCCGGCA CGGGCGGCAT GTCACCCGGC ACGTTCTGGA ACATCAACGA CGGCCAGCGC GACCGCTTCG ACCTGTTTGC CGAATGGGAC GCGCACTGGA CGCCGCAATG GATGACGCAG ATTGGCGTGC GCAGCGATAC CGTCCGCATG AATGCAGGCC CGGTTTCCGG TTACAGCACC CAGATGGGCG TCACCAGCCA GAGTATGACA AGCTATGGGA TGACGGCCAA GGATGCCGCC ACCTTCAATG CCAGCGACCG CGAAAAAACC GATCACAACC TCAACTTCAG CGCGCTGGCT CGCTACACGC CGGATGCGAC CCAATCCTAC GAATTCGGCT TTGCGCGTCA GGCCCGCTCG CCGAATCTGT ACGAGCGCTA CAGTTGGTCG ACCTGGCCGA TGGCTGCGAT CATGAATAAC TTTGTCGGCG ACGGTAACGG CTATGTCGGC GACATCAACC TCAAGCCGGA AAAAGCCAAC ACGATCAGCG CCACCGCCGA TTGGCACGAC GCAACCGGCC AGCAATGGGG CGTGAAAGTC ACGCCGTACT ATTCCTATAT TCAGGATTAC ATCGACGCCC AGTGCCTGCC CGGCACAGTA TGCAAAACAG GCCAGTTCAA CGTGCTGCAA TACGTAAATC AGAATGCACG CATCTATGGT GCGGACTGGT CCGGTCATTA CCTGCTGGCC GAAACACAGA ACTTCGGGCG TTTCACTACT ACAGGCATGG TCAGCTACAC GCACGGCAGG AACACCACCA CCGACGACAG CCTGTATAAC GTGATGCCAT TGAATGCCAA ACTGGCACTG GTGCAGAACA TCGGTACGTG GCAAAACACC GTGGAAACCG TCCTCGTCGC GGCGAAAACG GATGTCTCCA GCGTGCGCAG CGAACAGGAG ACCGGCGGCT TTGCCCTCGT CAACCTGCGC ACCAGCTACA CCTGGAACAA ACTGCGCATC GATGTGGGCA TCGACAACCT GTTCAACCGT TACTACGCCC TGCCCCAGGG CGGTGTTTAT GTGGGTCAAG GCAAGACGAT GTCGATCAAT GGCGTTCCCT TCGGCGTGCC TGTTCCAGGC CCGGGTCGCT CGATTTACAC CAGCCTGAAT TACAGCTTCT AA
|
Protein sequence | MGDMPHFSHD TSAFIYFYIN ALRVGNKLAH GSGFLFNQLI KPTVTVTMKF KLNPLAILIA AAPLTAFAAT ELPPVEVTST KAATVPAGAQ ELTSEQIKAA RAATSDSARL LADFPGVNLF SAGGVSALPI IHGLADDRLR TQVDGMDLIA ACPNHMNSPL SYIDPTNVGR VRVYSGVVPV SVGGDSIGGT IAVDSPAPVF AQPGEGTITQ GEIGGFYRSN GSAYGGNLAA TLATRNFSLS YRGSKAQSDN YKAGGDFKPG GLSSADPKDI STGISHWLEG DTVGSSAYKS QNHELSAAYQ NDAGTQMVEL GVGIQRIPYE NYPNQRMDMT QNNSTQFNLH YKGLYDWGTL DARAYNQHVR HEMDFNDDRQ FWYMTAAGMP MNTEGKNTGA KLQANIDLTT RDVLKLGAEL QKYRLDDWWP PSGTGGMSPG TFWNINDGQR DRFDLFAEWD AHWTPQWMTQ IGVRSDTVRM NAGPVSGYST QMGVTSQSMT SYGMTAKDAA TFNASDREKT DHNLNFSALA RYTPDATQSY EFGFARQARS PNLYERYSWS TWPMAAIMNN FVGDGNGYVG DINLKPEKAN TISATADWHD ATGQQWGVKV TPYYSYIQDY IDAQCLPGTV CKTGQFNVLQ YVNQNARIYG ADWSGHYLLA ETQNFGRFTT TGMVSYTHGR NTTTDDSLYN VMPLNAKLAL VQNIGTWQNT VETVLVAAKT DVSSVRSEQE TGGFALVNLR TSYTWNKLRI DVGIDNLFNR YYALPQGGVY VGQGKTMSIN GVPFGVPVPG PGRSIYTSLN YSF
|
| |