Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0393 |
Symbol | |
ID | 8533515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 408537 |
End bp | 410513 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646382772 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003262297 |
Protein GI | 261855014 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.305694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACG AGCATCCCTC GCGACACGAT CAATTCACAG ACGACCAGCG TTCGCGCCGT GATTTTCTTC GTTATTTGCT GGCAACAGGC GCTCTGGCAG CCGGAGCGGG CGCGTTCAGT CGGGTCGGTC ATGCTGCCGG ATTTGCCGAC GCGCTGGGTT ATTCACCCAA GTATCCGGCT AACTTCAACG CCTTCAATTA TGTGAATCCC GACGCGCCCA AGGGTGGCCG GTTGTCGCTG TCGGTTTTTG GCAACTTCGA CTCGCTCAAC CCGTTTGTGT TGAAGGGCTT GGCCGCCTCT GGCCTGAATG AACTGTGTTT TGAAAGTCTG ACCGCGCGCG CCTGGGATGA ACCGTTCTCT GCCTATGGCT TGTTGGCCGA TCGGGCCGAT CTGGCCGCCG ATGGCTTGTC GATTACTTTT CGCATCAACG CCAAAGCCCG TTTTTATGAT GGCAGCCCCG TCACGGCGCG GGATGTGCTG TTCAGCTTCA ATACGCTCAT GAGCAAGCAG GCGCATCCGC GCTATCGGAT TTACTGGGCG GATATTGCCG GTACGGAGCT GACCGATGCG CGGCATGTTC GATTCAGCTT TAAGCGAGTC AACCCGGAAC TGCATCTGAT TATTGGTGAA TTGCCGGTTT TTTCCGAACG CTGGCTGGCG GGGCGTGATT TTGCCAGCCT GTCGCGTGAG GCGTTTTTAA CCAGCGGGCC GTATCGCGTG GGTCGGGTCG ATTACGGCAA TACAATCACC TATGAGCGCA ACCCCGATTA CTGGGCGAAG GATCTGGACG TGCGTCGCGG GCAGTTCAAC TTCGACCAAA TCGTGTTCAA ATACTACAAG GACGAAACCG TCTCGCTCGA AGCCTTTAAG GCGGGCGAAT TTGATGTCTT CTACGAGACC AACTCCAAGC GCTGGGCACG TGATTACACC GGCGCGAATT TCGATGATGG CCGTATTATT CGCCGGGAGA TACCGCATAA AAACAACGCA GGGATGCAGG GCTTCGGCAT GAACACCCGC CAAGCCTTGT TCTCGGATCG GCGCGTGCGG CGGGCGCTGG ATCTGGCGCT GGATTTTGGC TGGTCGAACA CGCATTTGTT CTACGGTCAG TACACGCGCT GCGACAGTTA TTTCAGTAAC AGCGAACTGG CTTGTCGCGG ATTGCCGCAA GGCGATGAAT TGGCATTGCT CGAACCCTTC AAGGCCCAGT TGCCGCCCGA GTTGTTTCGT GAACCCTATC AAGTGCCCGT CGCCAACAAT CCGGCCGAGC AGCGTGCCAA TTTGCTTAAA GCGCGTGATT TGTTGGCCGA GGCCGGTTGG CGGGTGGAAG AGGGCGTGCT CAAAAATGCC GAAGGCCAGC CGTTTTCCTT CGAAATCACG CTGGCCATGC GCGGTTTCGA GCGCATCGTC GCGCCCTATG CCTACAACCT CAAACGGCTC GGCATCGAAG TCAGTTACCG CACGATTGAT GTATCGCTGT ATCAACGCAA GATGGATCGG TTTGAATTTG ATATGGCCGT GGTAGCTTAT GGCGAGTCGC AATCGCCGGG CAATGAACTG CGCGATCGGT TCGGCAGTGC GGCCGCGCAT ACCGACGGTT CCAGCAACTA CATGGGCATT GATAGCCCTG TGGTGGACGC GCTGATTGAT CGGGTCATCT ACGCCAAGTC GCGGGCGGAA CTGGTCACGG CCTGTCGCGC GCTGGATCGC GTGCTGCTTT GGGGCGAATA TCTGGTGCCA AACTGGTACA TTGGTGCGCA TCGGCTGGCG TGGTGGAATC GATTCGGTTT TCACCAGCCG CTGCCGCTTT ATTTCGATGC CATGACTTGG GTCATGCAAA CATGGTGGCA AGTATATGAA CATCCGCAAA AACAATCGAA AGGGCACCTC GAAAAACCTA CTGCGCTGTC CGATTGCGTC GTCGCGTGCT CGTTTACGCG CAGCTGGCTT AAGCCCGCGC CTCGCCTATC TACTTGA
|
Protein sequence | MKNEHPSRHD QFTDDQRSRR DFLRYLLATG ALAAGAGAFS RVGHAAGFAD ALGYSPKYPA NFNAFNYVNP DAPKGGRLSL SVFGNFDSLN PFVLKGLAAS GLNELCFESL TARAWDEPFS AYGLLADRAD LAADGLSITF RINAKARFYD GSPVTARDVL FSFNTLMSKQ AHPRYRIYWA DIAGTELTDA RHVRFSFKRV NPELHLIIGE LPVFSERWLA GRDFASLSRE AFLTSGPYRV GRVDYGNTIT YERNPDYWAK DLDVRRGQFN FDQIVFKYYK DETVSLEAFK AGEFDVFYET NSKRWARDYT GANFDDGRII RREIPHKNNA GMQGFGMNTR QALFSDRRVR RALDLALDFG WSNTHLFYGQ YTRCDSYFSN SELACRGLPQ GDELALLEPF KAQLPPELFR EPYQVPVANN PAEQRANLLK ARDLLAEAGW RVEEGVLKNA EGQPFSFEIT LAMRGFERIV APYAYNLKRL GIEVSYRTID VSLYQRKMDR FEFDMAVVAY GESQSPGNEL RDRFGSAAAH TDGSSNYMGI DSPVVDALID RVIYAKSRAE LVTACRALDR VLLWGEYLVP NWYIGAHRLA WWNRFGFHQP LPLYFDAMTW VMQTWWQVYE HPQKQSKGHL EKPTALSDCV VACSFTRSWL KPAPRLST
|
| |