Gene Hhal_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0597 
Symbol 
ID4710860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp672913 
End bp674814 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content67% 
IMG OID639855055 
Productextracellular solute-binding protein 
Protein accessionYP_001002185 
Protein GI121997398 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAGCA TGCTCCCATT CCCCCGCAGG CGCTGCGCTC CCCGGCATTC TCCGCCCTGT 
TGGGCGGCCG TGATGGCGGC GGTGTTCCTG ACGGTTCTGC TCCCGCTCTC CGCCGCGACC
GGCAACGAGG AGGACGCCCC GGGCGTCCAC GGCCTGGCGT TGCACGGCGA GCCCAAATAC
CCGCCGGACT TCTCCCACTT CGACTACGTC AACCCGAAGG CCCCCAAGGG GGGTACGGTC
ATCCGCGAGG CGCGGGGGAG TTACGATAGC CTCAACGGCT ACATCCTGCG CGGGACCAAA
CCGCCGGGTC TGGGGATGGT GATCGATACC CTGATGGTCC ACGCCGATGA TGAACCGTTT
TCTGTCTACG GGCTGATCGC CGAACGCGTG GAGGTGGCCG AGGACAACGC CTGGGTCGAG
TTCAAGCTGC GTGAGGAGGC CCGTTTCCAC GACGGTGAAC CGATCACCGC CGATGACGTG
GTCTTCAGTT TCGAAGTGCT CCGTGAGCAC GGGCACCCCA GGTTGCGCTC CTACTACCGC
CACGTCGAGT CCGCCGAGGC CGAGGGTCGT CATCGGGTGC GTTTCGAGTT CGCCCACGCC
GGCAATCCCG AATTGCCGCT GATCATGGGC GAGCTGCCGG TGCTGCCTCA GCATTACTGG
GAGGAACGCG ATTTCAGCCG GACCACCATG CAGCCGCCGT TGGGCAGTGG GCCCTACCGG
ATCGCCGAGG CCCGTCCCGG GCGCAGCATC ACCTACGAGC GGGTGGAGGA CTACTGGGCG
GAAGACCTGC CCGTGCGTCG CGGGCGGTTC AATTTCGACC GCCTGCGCTA CGACTTTTAC
CGAGACGCCA CCGTTGCGCT GGAGGCCTTC CGCGCCGGGG AGTTCGATCT GCGCGAGGAG
TATACTGCCC GCCACTGGGC CACCGGTTAC GAGACCTCGG CGCAGCGCGA AGGACGCATG
GTTCTCGAGG AGATCGAGCA CAGCCGCCCG GCGGGGATGC AGGGGTTCGT CTTCAACACC
CGTCGCCCGG TCTTCGAGGA CCGCGAGGTG CGTCGGGCAC TGAGCTACGC CTTCGACTTC
GAGTGGACCA ACCGCCAGCT GTTCCACTCG GCGTATACGC GTACGGCGAG CTACTTCGAA
AACTCGGAGT TGGCAGCTCG CGGTGCACCG GGCGAGGCCG AGCAGGCGAT CCTCGCCCCC
TTCCGGGAGG AGCTCCCCGA GGCGGTCTTC GAGCCGTACC GGCCGCCGGT CACCGACGGC
TCGGGCTGGA ACCGCGAGAA TCTGCTCAAG GCGCTCCGCA TCCTCAAGGA GGCCGGCTGG
TCGGTGGGGG ACGACGGCAT CCTGCGTCAC CGGGACAGCG GTCGGCCGCT GGTCTTCGAG
CTGCTCCTGG TCAACCCGAG CTTCGAGCGC GTGGCCCTGC CGTTCGTCCA GAACCTTCGG
CGCATCGGAG TCCTGGCGCG GGTGCGCACG GTGGATACCA CCCAGTACCA GTACCGGCTG
GATCACTTCG AGTTCGATAT GGCTGTGGTG GTGCTGCCGC AATCGCCCTC GCCGGGTCAC
GAGCAGGCCA TGTACTGGAG TTCCGAGGCC GCTGACGAAC CGGGCAGCCG GAACTACGCC
GGGGTGCGGG ACCCGGTGGT CGACGAACTG GTCGAACGCC TGGTCTCCGC TGAGGATCGC
GATGAGCTGG TCCACCTGAC CCGCGCCCTG GACCGCGTCC TACTCGCCGG CCACTACGTG
ATCCCCAACT GGCATACCCC GGTCCATCGG GTCGCCTACT GGGACAAGTT CGGGCGGCCG
GAGACGGCCC CGAAGTACGG ACTGGGGTTC GATACCTGGT GGGTCGACCC GGACAAGGAG
CAACGGCTGC GCGAGGCCAA CGGCCCGCGG TCGGTCCGAT AG
 
Protein sequence
MSSMLPFPRR RCAPRHSPPC WAAVMAAVFL TVLLPLSAAT GNEEDAPGVH GLALHGEPKY 
PPDFSHFDYV NPKAPKGGTV IREARGSYDS LNGYILRGTK PPGLGMVIDT LMVHADDEPF
SVYGLIAERV EVAEDNAWVE FKLREEARFH DGEPITADDV VFSFEVLREH GHPRLRSYYR
HVESAEAEGR HRVRFEFAHA GNPELPLIMG ELPVLPQHYW EERDFSRTTM QPPLGSGPYR
IAEARPGRSI TYERVEDYWA EDLPVRRGRF NFDRLRYDFY RDATVALEAF RAGEFDLREE
YTARHWATGY ETSAQREGRM VLEEIEHSRP AGMQGFVFNT RRPVFEDREV RRALSYAFDF
EWTNRQLFHS AYTRTASYFE NSELAARGAP GEAEQAILAP FREELPEAVF EPYRPPVTDG
SGWNRENLLK ALRILKEAGW SVGDDGILRH RDSGRPLVFE LLLVNPSFER VALPFVQNLR
RIGVLARVRT VDTTQYQYRL DHFEFDMAVV VLPQSPSPGH EQAMYWSSEA ADEPGSRNYA
GVRDPVVDEL VERLVSAEDR DELVHLTRAL DRVLLAGHYV IPNWHTPVHR VAYWDKFGRP
ETAPKYGLGF DTWWVDPDKE QRLREANGPR SVR