Gene Hneap_0393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0393 
Symbol 
ID8533515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp408537 
End bp410513 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content57% 
IMG OID646382772 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003262297 
Protein GI261855014 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG AGCATCCCTC GCGACACGAT CAATTCACAG ACGACCAGCG TTCGCGCCGT 
GATTTTCTTC GTTATTTGCT GGCAACAGGC GCTCTGGCAG CCGGAGCGGG CGCGTTCAGT
CGGGTCGGTC ATGCTGCCGG ATTTGCCGAC GCGCTGGGTT ATTCACCCAA GTATCCGGCT
AACTTCAACG CCTTCAATTA TGTGAATCCC GACGCGCCCA AGGGTGGCCG GTTGTCGCTG
TCGGTTTTTG GCAACTTCGA CTCGCTCAAC CCGTTTGTGT TGAAGGGCTT GGCCGCCTCT
GGCCTGAATG AACTGTGTTT TGAAAGTCTG ACCGCGCGCG CCTGGGATGA ACCGTTCTCT
GCCTATGGCT TGTTGGCCGA TCGGGCCGAT CTGGCCGCCG ATGGCTTGTC GATTACTTTT
CGCATCAACG CCAAAGCCCG TTTTTATGAT GGCAGCCCCG TCACGGCGCG GGATGTGCTG
TTCAGCTTCA ATACGCTCAT GAGCAAGCAG GCGCATCCGC GCTATCGGAT TTACTGGGCG
GATATTGCCG GTACGGAGCT GACCGATGCG CGGCATGTTC GATTCAGCTT TAAGCGAGTC
AACCCGGAAC TGCATCTGAT TATTGGTGAA TTGCCGGTTT TTTCCGAACG CTGGCTGGCG
GGGCGTGATT TTGCCAGCCT GTCGCGTGAG GCGTTTTTAA CCAGCGGGCC GTATCGCGTG
GGTCGGGTCG ATTACGGCAA TACAATCACC TATGAGCGCA ACCCCGATTA CTGGGCGAAG
GATCTGGACG TGCGTCGCGG GCAGTTCAAC TTCGACCAAA TCGTGTTCAA ATACTACAAG
GACGAAACCG TCTCGCTCGA AGCCTTTAAG GCGGGCGAAT TTGATGTCTT CTACGAGACC
AACTCCAAGC GCTGGGCACG TGATTACACC GGCGCGAATT TCGATGATGG CCGTATTATT
CGCCGGGAGA TACCGCATAA AAACAACGCA GGGATGCAGG GCTTCGGCAT GAACACCCGC
CAAGCCTTGT TCTCGGATCG GCGCGTGCGG CGGGCGCTGG ATCTGGCGCT GGATTTTGGC
TGGTCGAACA CGCATTTGTT CTACGGTCAG TACACGCGCT GCGACAGTTA TTTCAGTAAC
AGCGAACTGG CTTGTCGCGG ATTGCCGCAA GGCGATGAAT TGGCATTGCT CGAACCCTTC
AAGGCCCAGT TGCCGCCCGA GTTGTTTCGT GAACCCTATC AAGTGCCCGT CGCCAACAAT
CCGGCCGAGC AGCGTGCCAA TTTGCTTAAA GCGCGTGATT TGTTGGCCGA GGCCGGTTGG
CGGGTGGAAG AGGGCGTGCT CAAAAATGCC GAAGGCCAGC CGTTTTCCTT CGAAATCACG
CTGGCCATGC GCGGTTTCGA GCGCATCGTC GCGCCCTATG CCTACAACCT CAAACGGCTC
GGCATCGAAG TCAGTTACCG CACGATTGAT GTATCGCTGT ATCAACGCAA GATGGATCGG
TTTGAATTTG ATATGGCCGT GGTAGCTTAT GGCGAGTCGC AATCGCCGGG CAATGAACTG
CGCGATCGGT TCGGCAGTGC GGCCGCGCAT ACCGACGGTT CCAGCAACTA CATGGGCATT
GATAGCCCTG TGGTGGACGC GCTGATTGAT CGGGTCATCT ACGCCAAGTC GCGGGCGGAA
CTGGTCACGG CCTGTCGCGC GCTGGATCGC GTGCTGCTTT GGGGCGAATA TCTGGTGCCA
AACTGGTACA TTGGTGCGCA TCGGCTGGCG TGGTGGAATC GATTCGGTTT TCACCAGCCG
CTGCCGCTTT ATTTCGATGC CATGACTTGG GTCATGCAAA CATGGTGGCA AGTATATGAA
CATCCGCAAA AACAATCGAA AGGGCACCTC GAAAAACCTA CTGCGCTGTC CGATTGCGTC
GTCGCGTGCT CGTTTACGCG CAGCTGGCTT AAGCCCGCGC CTCGCCTATC TACTTGA
 
Protein sequence
MKNEHPSRHD QFTDDQRSRR DFLRYLLATG ALAAGAGAFS RVGHAAGFAD ALGYSPKYPA 
NFNAFNYVNP DAPKGGRLSL SVFGNFDSLN PFVLKGLAAS GLNELCFESL TARAWDEPFS
AYGLLADRAD LAADGLSITF RINAKARFYD GSPVTARDVL FSFNTLMSKQ AHPRYRIYWA
DIAGTELTDA RHVRFSFKRV NPELHLIIGE LPVFSERWLA GRDFASLSRE AFLTSGPYRV
GRVDYGNTIT YERNPDYWAK DLDVRRGQFN FDQIVFKYYK DETVSLEAFK AGEFDVFYET
NSKRWARDYT GANFDDGRII RREIPHKNNA GMQGFGMNTR QALFSDRRVR RALDLALDFG
WSNTHLFYGQ YTRCDSYFSN SELACRGLPQ GDELALLEPF KAQLPPELFR EPYQVPVANN
PAEQRANLLK ARDLLAEAGW RVEEGVLKNA EGQPFSFEIT LAMRGFERIV APYAYNLKRL
GIEVSYRTID VSLYQRKMDR FEFDMAVVAY GESQSPGNEL RDRFGSAAAH TDGSSNYMGI
DSPVVDALID RVIYAKSRAE LVTACRALDR VLLWGEYLVP NWYIGAHRLA WWNRFGFHQP
LPLYFDAMTW VMQTWWQVYE HPQKQSKGHL EKPTALSDCV VACSFTRSWL KPAPRLST